hgpu.org » nVidia H100
Tal Kadosh, Niranjan Hasabnis, Vy A. Vo, Nadav Schneider, Neva Krien, Abdul Wasay, Nesreen Ahmed, Ted Willke, Guy Tamir, Yuval Pinter, Timothy Mattson, Gal Oren
Tags: Code generation, Computer science, Deep learning, HPC, nVidia, nVidia A40, nVidia H100, OpenMP, Package
September 6, 2023 by hgpu
Phuong Nguyen, Pratik Nayak, Hartwig Anzt
Tags: Computer science, CUDA, Intel, Intel Data Center GPU Max 1550, nVidia, nVidia A100, nVidia H100, Package, performance portability, Physics, SYCL
August 20, 2023 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Fortran High-Level Synthesis: Reducing the barriers to accelerating HPC codes on FPGAs
- PoCL-R: An Open Standard Based Offloading Layer for Heterogeneous Multi-Access Edge Computing with Server Side Scalability
- Compute units in OpenMP: Extensions for heterogeneous parallel programming
- Comparing Llama-2 and GPT-3 LLMs for HPC kernels generation
- Many Cores, Many Models: GPU Programming Model vs. Vendor Compatibility Overview
- Leveraging Memory Copy Overlap for Efficient Sparse Matrix-Vector Multiplication on GPUs
- Scope is all you need: Transforming LLMs for HPC Code
- Novel insights on atomic synchronization for sort-based group-by on GPUs
- Performant low-order matrix-free finite element kernels on GPU architectures
- HPAC-Offload: Accelerating HPC Applications with Portable Approximate Computing on the GPU
* * *