hgpu.org » nVidia L40s
Anne Ouyang, Simon Guo, Simran Arora, Alex L. Zhang, William Hu, Christopher Ré, Azalia Mirhoseini
Tags: AI, Benchmarking, Computer science, CUDA, LLM, Machine learning, nVidia, nVidia L40s, Package, PyTorch
February 24, 2025 by hgpu
Bertil Schmidt, Felix Kallenborn, Alexander Wichmann, Alejandro Chacon, Christian Hundt
Tags: Bioinformatics, Biology, Computer science, CUDA, FPGA, Genomics, nVidia, nVidia A100, nVidia H100, nVidia L4, nVidia L40s, nVidia V100, Package
November 24, 2024 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- High-Performance Computing: from Optimization to Automation
- exa-AMD: An Exascale-Ready Framework for Accelerating the Discovery and Design of Functional Materials
- VibeCodeHPC: An Agent-Based Iterative Prompting Auto-Tuner for HPC Code Generation Using LLMs
- Compile-Time Resource Safety for GPU APIs: A Low-Overhead Typestate Framework
- Accelerating cosmological simulations on GPUs: a portable approach using OpenMP
- Compiler and Runtime Systems for Generative AI Models
- EvoEngineer: Mastering Automated CUDA Kernel Code Evolution with Large Language Models
- ConCuR: Conciseness Makes State-of-the-Art Kernel Generation
- STARK: Strategic Team of Agents for Refining Kernels
- Tutoring LLM into a Better CUDA Optimizer
* * *




