hgpu.org » nVidia A100
Oliver Hennigh, Susheela Narasimhan, Mohammad Amin Nabian, Akshay Subramaniam, Kaustubh Tangsali, Max Rietmann, Jose del Aguila Ferrandis, Wonmin Byeon, Zhiwei Fang, Sanjay Choudhry
Tags: cfd, CUDA, Fluid dynamics, Linear Algebra, Machine learning, Neural networks, nVidia, nVidia A100, Partial differential equations, PDEs, Physics, Tesla V100, Video
December 20, 2020 by hgpu
Yuhsiang Mike Tsai, Terry Cojean, Hartwig Anzt
Tags: Computer science, CUDA, Linear Algebra, nVidia, nVidia A100, Performance, Sparse, Sparse matrix
August 23, 2020 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- A Microbenchmark Framework for Performance Evaluation of OpenMP Target Offloading
- pyATF: Constraint-Based Auto-Tuning in Python
- TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators
- WgPy: GPU-accelerated NumPy-like array library for web browsers
- CRIUgpu: Transparent Checkpointing of GPU-Accelerated Workloads
- LLMPerf: GPU Performance Modeling meets Large Language Models
- The Shamrock code: I- Smoothed Particle Hydrodynamics on GPUs
- Concurrent Scheduling of High-Level Parallel Programs on Multi-GPU Systems
- TransCL: An Automatic CUDA-to-OpenCL Programs Transformation Framework
- Can Tensor Cores Benefit Memory-Bound Kernels? (No!)
* * *