hgpu.org » StarPU
Lucas Leandro Nesi, Samuel Thibault, Luka Stanisic, Lucas Mello Schnorr
Tags: Computer science, CUDA, Heterogeneous systems, nVidia, nVidia GeForce GTX 1080 Ti, Performance, StarPU
September 1, 2019 by hgpu
Samuel Thibault
Tags: Computer science, CUDA, Distributed computing, Heterogeneous systems, nVidia, nVidia Quadro FX 5800, OpenCL, Operating systems, StarPU, Task scheduling, Tesla C2050, Tesla K20, Tesla M2075, Thesis
December 23, 2018 by hgpu
Dalal Sukkari, Hatem Ltaief, Mathieu Faverge, David Keyes
Tags: Algorithms, Benchmarking, Computer science, Factorization, Intel Xeon Phi, nVidia, StarPU, Task scheduling, Tesla K80, Tesla P100
September 21, 2017 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- The Anatomy of a Triton Attention Kernel
- CudaForge: An Agent Framework with Hardware Feedback for CUDA Kernel Optimization
- KernelBand: Boosting LLM-based Kernel Optimization with a Hierarchical and Hardware-aware Multi-armed Bandit
- An MLIR pipeline for offloading Fortran to FPGAs via OpenMP
- RDMA Point-to-Point Communication for LLM Systems
- ProofWright: Towards Agentic Formal Verification of CUDA
- QiMeng-Kernel: Macro-Thinking Micro-Coding Paradigm for LLM-Based High-Performance GPU Kernel Generation
- Inside VOLT: Designing an Open-Source GPU Compiler
- Iris: First-Class Multi-GPU Programming Experience in Triton
- AIvailable: A Software-Defined Architecture for LLM-as-a-Service on Heterogeneous and Legacy GPUs
* * *



