hgpu.org » nVidia Tesla GP100
Sungho Shin, Youngmin Jo, Jungwook Choi, Swagath Venkataramani, Vijayalakshmi Srinivasan, Wonyong Sung
Tags: Artificial intelligence, Computer science, Deep learning, Neural networks, nVidia, nVidia DGX-1, nVidia GeForce GTX Titan XP, nVidia Tesla GP100
November 11, 2018 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Profiling Apple Silicon Performance for ML Training
- Dissecting the NVIDIA Hopper Architecture through Microbenchmarking and Multiple Level Analysis
- CGP-Tuning: Structure-Aware Soft Prompt Tuning for Code Vulnerability Detection
- Column-Oriented Datalog on the GPU
- GSParLib: A multi-level programming interface unifying OpenCL and CUDA for expressing stream and data parallelism
- Boosting Performance of Iterative Applications on GPUs: Kernel Batching with CUDA Graphs
- SCALE-Ahead-Of-Time Compilation of CUDA for AMD GPUs
- LeetDecoding: A PyTorch Library for Exponentially Decaying Causal Linear Attention with CUDA Implementations
- Exploring data flow design and vectorization with oneAPI for streaming applications on CPU+GPU
- A User's Guide to KSig: GPU-Accelerated Computation of the Signature Kernel
* * *