hgpu.org » nVidia Quadro K420
Steven W. D. Chien, Stefano Markidis, Vyacheslav Olshevsky, Yaroslav Bulatov, Erwin Laure, Jeffrey S. Vetter
Tags: Benchmarking, Computer science, CUDA, Deep learning, FFT, Heterogeneous systems, HPC, Machine learning, nVidia, nVidia Quadro K420, OpenMPI, Package, Performance, Python, TensorFlow, Tesla K80, Tesla V100
March 17, 2019 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- High-Performance Computing: from Optimization to Automation
- Accelerating cosmological simulations on GPUs: a portable approach using OpenMP
- Compiler and Runtime Systems for Generative AI Models
- EvoEngineer: Mastering Automated CUDA Kernel Code Evolution with Large Language Models
- Scalable GPU-Based Integrity Verification for Large Machine Learning Models
- ConCuR: Conciseness Makes State-of-the-Art Kernel Generation
- STARK: Strategic Team of Agents for Refining Kernels
- Tutoring LLM into a Better CUDA Optimizer
- INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats
- Neptune: Advanced ML Operator Fusion for Locality and Parallelism on GPUs
* * *




