hgpu.org » nVidia Quadro K420
Steven W. D. Chien, Stefano Markidis, Vyacheslav Olshevsky, Yaroslav Bulatov, Erwin Laure, Jeffrey S. Vetter
Tags: Benchmarking, Computer science, CUDA, Deep learning, FFT, Heterogeneous systems, HPC, Machine learning, nVidia, nVidia Quadro K420, OpenMPI, Package, Performance, Python, TensorFlow, Tesla K80, Tesla V100
March 17, 2019 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Omniwise: Predicting GPU Kernels Performance with LLMs
- P4OMP: Retrieval-Augmented Prompting for OpenMP Parallelism in Serial Code
- Engineering Supercomputing Platforms for Biomolecular Applications
- CUDA-LLM: LLMs Can Write Efficient CUDA Kernels
- GCStack+GCScaler: Fast and Accurate GPU Performance Analyses Using Fine-Grained Stall Cycle Accounting and Interval Analysis
- A First Look at Bugs in LLM Inference Engines
- ParEval-Repo: A Benchmark Suite for Evaluating LLMs with Repository-level HPC Translation Tasks
- Efficient GPU Implementation of Multi-Precision Integer Division
- Accelerated discovery and design of Fe-Co-Zr magnets with tunable magnetic anisotropy through machine learning and parallel computing
- chemtrain-deploy: A parallel and scalable framework for machine learning potentials in million-atom MD simulations
* * *