hgpu.org » nVidia L40s
Anne Ouyang, Simon Guo, Simran Arora, Alex L. Zhang, William Hu, Christopher Ré, Azalia Mirhoseini
Tags: AI, Benchmarking, Computer science, CUDA, LLM, Machine learning, nVidia, nVidia L40s, Package, PyTorch
February 24, 2025 by hgpu
Bertil Schmidt, Felix Kallenborn, Alexander Wichmann, Alejandro Chacon, Christian Hundt
Tags: Bioinformatics, Biology, Computer science, CUDA, FPGA, Genomics, nVidia, nVidia A100, nVidia H100, nVidia L4, nVidia L40s, nVidia V100, Package
November 24, 2024 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Analyzing Modern NVIDIA GPU cores
- Hardware-Assisted Software Testing and Debugging for Heterogeneous Computing
- Advances in Semantic Patching for HPC-oriented Refactorings with Coccinelle
- TileLink: Generating Efficient Compute-Communication Overlapping Kernels using Tile-Centric Primitives
- PyGraph: Robust Compiler Support for CUDA Graphs in PyTorch
- GigaAPI for GPU Parallelization
- Large Language Model Powered C-to-CUDA Code Translation: A Novel Auto-Parallelization Framework
- Scalability Evaluation of HPC Multi-GPU Training for ECG-based LLMs
- Efficient allocation of image recognition and LLM tasks on multi-GPU system
- A Power-Efficient Scheduling Approach in a Cpu-Gpu Computing System by Thread-Based Parallel Programming
* * *