hgpu.org » AMD Radeon Instinct MI100
Moritz Lehmann, Mathias J. Krause, Giorgio Amati, Marcello Sega, Jens Harting, Stephan Gekle
Tags: AMD Radeon Instinct MI100, AMD Radeon VI, ATI, Fluid dynamics, lattice Boltzmann, Mixed precision, nVidia, OpenCL, Tesla K20, Tesla K40, Tesla K80, Tesla P100, Tesla V100
December 19, 2021 by hgpu
Ahmad Abdelfattah, Valeria Barra, Natalie Beams, Ryan Bleile, Jed Brown, Jean-Sylvain Camier, Robert Carson, Noel Chalmers, Veselin Dobrev, Yohann Dudouit, Paul Fischer, Ali Karakus, Stefan Kerkemeier, Tzanio Kolev, Yu-Hsiang Lan, Elia Merzari, Misun Min, Malachi Phillips, Thilina Rathnayake, Robert Rieben, Thomas Stitt, Ananias Tomboulides, Stanimire Tomov, Vladimir Tomov, Arturo Vargas, Tim Warburton, Kenneth Weiss
Tags: Algorithms, AMD Radeon Instinct MI100, ATI, Computer science, CUDA, Finite element method, nVidia, OCCA, Tesla V100
September 19, 2021 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Automatic Generation of OpenCL Code through Polyhedral Compilation with LLM
- Deep Learning and Machine Learning with GPGPU and CUDA: Unlocking the Power of Parallel Computing
- Testing GPU Numerics: Finding Numerical Differences Between NVIDIA and AMD GPUs
- Accelerating Drug Discovery in AutoDock-GPU with Tensor Cores
- miniLB: A Performance Portability Study of Lattice-Boltzmann Simulations
- Intel(R) SHMEM: GPU-initiated OpenSHMEM using SYCL
- OpenACC offloading of the MFC compressible multiphase flow solver on AMD and NVIDIA GPUs
- Superpipeline: A Universal Approach for Reducing GPU Memory Usage in Large Models
- Bitstream Database-Driven FPGA Programming Flow Based on Standard OpenCL
- Efficient Arbitrary Precision Acceleration for Large Language Models on GPU Tensor Cores
* * *