hgpu.org » StarPU
Lucas Leandro Nesi, Samuel Thibault, Luka Stanisic, Lucas Mello Schnorr
Tags: Computer science, CUDA, Heterogeneous systems, nVidia, nVidia GeForce GTX 1080 Ti, Performance, StarPU
September 1, 2019 by hgpu
Samuel Thibault
Tags: Computer science, CUDA, Distributed computing, Heterogeneous systems, nVidia, nVidia Quadro FX 5800, OpenCL, Operating systems, StarPU, Task scheduling, Tesla C2050, Tesla K20, Tesla M2075, Thesis
December 23, 2018 by hgpu
Dalal Sukkari, Hatem Ltaief, Mathieu Faverge, David Keyes
Tags: Algorithms, Benchmarking, Computer science, Factorization, Intel Xeon Phi, nVidia, StarPU, Task scheduling, Tesla K80, Tesla P100
September 21, 2017 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- DITRON: Distributed Multi-level Tiling Compiler for Parallel Tensor Programs
- CuBridge: An LLM-Based Framework for Understanding and Reconstructing High-Performance Attention Kernels
- KEET: Explaining Performance of GPU Kernels Using LLM Agents
- CUDAHercules: Benchmarking Hardware-Aware Expert-level CUDA Optimization for LLMs
- Kerncap: Automated Kernel Extraction and Isolation for AMD GPUs
- KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels
- Pretraining large language models with MXFP4 on Native FP4 Hardware
- Microbenchmark-Driven Analytical Performance Modeling Across Modern GPU Architectures
- CUDABeaver: Benchmarking LLM-Based Automated CUDA Debugging
- Source-to-Source Transformations for GPU Code Generation
* * *



