hgpu.org » StarPU
Lucas Leandro Nesi, Samuel Thibault, Luka Stanisic, Lucas Mello Schnorr
Tags: Computer science, CUDA, Heterogeneous systems, nVidia, nVidia GeForce GTX 1080 Ti, Performance, StarPU
September 1, 2019 by hgpu
Samuel Thibault
Tags: Computer science, CUDA, Distributed computing, Heterogeneous systems, nVidia, nVidia Quadro FX 5800, OpenCL, Operating systems, StarPU, Task scheduling, Tesla C2050, Tesla K20, Tesla M2075, Thesis
December 23, 2018 by hgpu
Dalal Sukkari, Hatem Ltaief, Mathieu Faverge, David Keyes
Tags: Algorithms, Benchmarking, Computer science, Factorization, Intel Xeon Phi, nVidia, StarPU, Task scheduling, Tesla K80, Tesla P100
September 21, 2017 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- CuTeGen: An LLM-Based Agentic Framework for Generation and Optimization of High-Performance GPU Kernels using CuTe
- Revealing NVIDIA Closed-Source Driver Command Streams for CPU-GPU Runtime Behavior Insight
- MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU
- Evaluating CUDA Tile for AI Workloads on Hopper and Blackwell GPUs
- Agentic Code Optimization via Compiler-LLM Cooperation
- FACT: Compositional Kernel Synthesis with a Three-Stage Agentic Workflow
- DITRON: Distributed Multi-level Tiling Compiler for Parallel Tensor Programs
- DVM: Real-Time Kernel Generation for Dynamic AI Models
- ARGUS: Agentic GPU Optimization Guided by Data-Flow Invariants
- Kernel-Smith: A Unified Recipe for Evolutionary Kernel Optimization
* * *



