hgpu.org » nVidia Quadro K420
Steven W. D. Chien, Stefano Markidis, Vyacheslav Olshevsky, Yaroslav Bulatov, Erwin Laure, Jeffrey S. Vetter
Tags: Benchmarking, Computer science, CUDA, Deep learning, FFT, Heterogeneous systems, HPC, Machine learning, nVidia, nVidia Quadro K420, OpenMPI, Package, Performance, Python, TensorFlow, Tesla K80, Tesla V100
March 17, 2019 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- A Microbenchmark Framework for Performance Evaluation of OpenMP Target Offloading
- pyATF: Constraint-Based Auto-Tuning in Python
- TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators
- WgPy: GPU-accelerated NumPy-like array library for web browsers
- CRIUgpu: Transparent Checkpointing of GPU-Accelerated Workloads
- LLMPerf: GPU Performance Modeling meets Large Language Models
- The Shamrock code: I- Smoothed Particle Hydrodynamics on GPUs
- Concurrent Scheduling of High-Level Parallel Programs on Multi-GPU Systems
- TransCL: An Automatic CUDA-to-OpenCL Programs Transformation Framework
- Can Tensor Cores Benefit Memory-Bound Kernels? (No!)
* * *