hgpu.org » AMD Radeon Pro V620
Peter Eastman, Raimondas Galvelis, Raúl P. Peláez, Charlles R. A. Abreu, Stephen E. Farr, Emilio Gallicchio, Anton Gorenko, Michael M. Henry, Frank Hu, Jing Huang, Andreas Krämer, Julien Michel, Joshua A. Mitchell, Vijay S. Pande, João PGLM Rodrigues, Jaime Rodriguez-Guerra, Andrew C. Simmonett, Jason Swails, Ivy Zhang, John D. Chodera, Gianni De Fabritiis, Thomas E. Markland
Tags: AMD Radeon Pro V620, ATI, Chemical Physics, CUDA, HIP, Machine learning, Molecular dynamics, Molecular simulation, nVidia, nVidia A100, nVidia GeForce RTX 4080, OpenCL, Package, Physics
October 15, 2023 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- DITRON: Distributed Multi-level Tiling Compiler for Parallel Tensor Programs
- KEET: Explaining Performance of GPU Kernels Using LLM Agents
- CuBridge: An LLM-Based Framework for Understanding and Reconstructing High-Performance Attention Kernels
- CUDAHercules: Benchmarking Hardware-Aware Expert-level CUDA Optimization for LLMs
- Kerncap: Automated Kernel Extraction and Isolation for AMD GPUs
- MusaCoder: Native GPU Kernel Generation with Full-Stack Training on Moore Threads GPU
- KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels
- Pretraining large language models with MXFP4 on Native FP4 Hardware
- Microbenchmark-Driven Analytical Performance Modeling Across Modern GPU Architectures
- KForge: LLM-Driven Cross-Platform Kernel Generation for AI Accelerators
* * *




