hgpu.org » Linear Algbera
Samuel D. Relton, Pedro Valero-Lara, Mawussi Zounon
Tags: BLAS, Computer science, CUDA, Linear Algbera, nVidia, Package, Tesla K40
August 11, 2016 by hgpu
Nicolas Weber, Michael Goesele
Tags: Computer science, CUDA, Linear Algbera, nVidia, nVidia GeForce GTX Titan X, Package, Performance, Tesla K20
April 29, 2016 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- A Microbenchmark Framework for Performance Evaluation of OpenMP Target Offloading
- KernelBench: Can LLMs Write Efficient GPU Kernels?
- The AI CUDA Engineer: Agentic CUDA Kernel Discovery, Optimization and Composition
- Seamless acceleration of Fortran intrinsics via AMD AI engines
- pyATF: Constraint-Based Auto-Tuning in Python
- TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators
- WgPy: GPU-accelerated NumPy-like array library for web browsers
- Evaluating the Performance of the DeepSeek Model in Confidential Computing Environment
- Forecasting time series with constraints
- CRIUgpu: Transparent Checkpointing of GPU-Accelerated Workloads
* * *