hgpu.org » nVidia Quadro K420
Steven W. D. Chien, Stefano Markidis, Vyacheslav Olshevsky, Yaroslav Bulatov, Erwin Laure, Jeffrey S. Vetter
Tags: Benchmarking, Computer science, CUDA, Deep learning, FFT, Heterogeneous systems, HPC, Machine learning, nVidia, nVidia Quadro K420, OpenMPI, Package, Performance, Python, TensorFlow, Tesla K80, Tesla V100
March 17, 2019 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Acceleration as a Service (XaaS) Source Containers
- Exploring SYCL as a Portability Layer for High-Performance Computing on CPUs
- All You Need Is Binary Search! A Practical View on Lightweight Database Indexing on GPUs
- CUDA-LLM: LLMs Can Write Efficient CUDA Kernels
- Engineering Supercomputing Platforms for Biomolecular Applications
- chemtrain-deploy: A parallel and scalable framework for machine learning potentials in million-atom MD simulations
- LiteGD: Lightweight and dynamic GPU Dispatching for Large-scale Heterogeneous Clusters
- A First Look at Bugs in LLM Inference Engines
- MemAscend: System Memory Optimization for SSD-Offloaded LLM Fine-Tuning
- HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration
* * *