hgpu.org » Comuter science
Kai Zhu, Wenyi Zhao, Zhen Zheng, Tianyou Guo, Pengzhan Zhao, Junjie Bai, Jun Yang, Xiaoyong Liu, Lansong Diao, Wei Lin
Tags: Compilers, Comuter science, CUDA, Machine learning, nVidia, Tesla T4
March 14, 2021 by hgpu
Tiago Augusto Engel, Andrea Schwertner Charao, Manuele Kirsch-Pinheiro, Luiz-Angelo Steffenel
Tags: Comuter science, CUDA, Data mining, Java, Matrix multiplication, nVidia, nVidia Quadro K 2000, Package, Tesla K20, Tesla M2050
June 13, 2014 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- A Microbenchmark Framework for Performance Evaluation of OpenMP Target Offloading
- pyATF: Constraint-Based Auto-Tuning in Python
- TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators
- WgPy: GPU-accelerated NumPy-like array library for web browsers
- CRIUgpu: Transparent Checkpointing of GPU-Accelerated Workloads
- LLMPerf: GPU Performance Modeling meets Large Language Models
- The Shamrock code: I- Smoothed Particle Hydrodynamics on GPUs
- Concurrent Scheduling of High-Level Parallel Programs on Multi-GPU Systems
- TransCL: An Automatic CUDA-to-OpenCL Programs Transformation Framework
- Can Tensor Cores Benefit Memory-Bound Kernels? (No!)
* * *