hgpu.org » Tela K40
Ammar Ahmad Awan, Hari Subramoni, Dhabaleswar K. Panda
Tags: Benchmarking, Caffe, Computer science, CUBLAS, CUDA, Deep learning, Intel Xeon Phi, Machine learning, nVidia, Tela K40, Tesla K80, Tesla P100
December 24, 2017 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Using Intel oneAPI for Multi-hybrid Acceleration Programming with GPU and FPGA Coupling
- 94% on CIFAR-10 in 3.29 Seconds on a Single GPU
- SYCL in the edge: performance and energy evaluation for heterogeneous acceleration
- gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments
- Performance Portable Monte Carlo Particle Transport on Intel, NVIDIA, and AMD GPUs
- Fast Truncated SVD of Sparse and Dense Matrices on Graphics Processors
- Retargeting and Respecializing GPU Workloads for Performance Portability
- Cost-Effective Methodology for Complex Tuning Searches in HPC: Navigating Interdependencies and Dimensionality
- LOOPer: A Learned Automatic Code Optimizer For Polyhedral Compilers
- Seer: Predictive Runtime Kernel Selection for Irregular Problems
* * *