hgpu.org » nVidia GeForce RTX 4090
Taesu Kim, Jongho Lee, Daehyun Ahn, Sarang Kim, Jiwoong Choi, Minkyu Kim, Hyungjun Kim
Tags: Computer science, CUDA, Deep learning, Machine learning, Matrix multiplication, Mixed precision, nVidia, nVidia A100, nVidia GeForce RTX 4090, nVidia RTX A6000, Package
February 18, 2024 by hgpu
Mathis Bouverot-Dupuis, Mary Sheeran
Tags: Algorithms, AMD Radeon Instinct MI100, ATI, Computer science, CUDA, Haskell, nVidia, nVidia GeForce RTX 4090, Package
June 18, 2023 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Towards Robust Agentic CUDA Kernel Benchmarking, Verification, and Optimization
- Dato: A Task-Based Programming Model for Dataflow Accelerators
- TRUST: the HPC open-source CFD platform – from CPU to GPU
- Mojo: MLIR-Based Performance-Portable HPC Science Kernels on GPUs for the Python Ecosystem
- Towards GPU Parallelism Abstractions in Rust: A Case Study with Linear Pipelines
- High-Performance Computing: from Optimization to Automation
- exa-AMD: An Exascale-Ready Framework for Accelerating the Discovery and Design of Functional Materials
- VibeCodeHPC: An Agent-Based Iterative Prompting Auto-Tuner for HPC Code Generation Using LLMs
- Evolution of Kernels: Automated RISC-V Kernel Optimization with Large Language Models
- Robust LLM Training Infrastructure at ByteDance
* * *