high performance computing on graphics processing units: hgpu.org

hgpu.org » paper » Survey paper on Deep Learning on GPUs

Survey paper on Deep Learning on GPUs

Sparsh Mittal, Shraiysh Vaishay

Department of Computer Science and Engineering, IIT Hyderabad, India

Journal of Systems Architecture, 2019

DOI:10.1016/j.sysarc.2019.101635

BibTeX

Download (PDF)

View

Source

2448

views

The rise of deep-learning (DL) has been fuelled by the improvements in accelerators. GPU continues to remain the most widely used accelerator for DL applications. We present a survey of architecture and system-level techniques for optimizing DL applications on GPUs. We review 75+ techniques focused on both inference and training and for both single GPU and distributed system with multiple GPUs. It covers techniques for pruning, tiling, batching, impact of data-layouts, data-reuse schemes and convolution strategies (FFT/direct/GEMM/Winograd), etc. It also covers techniques for offloading data to CPU memory for avoiding GPU-memory bottlenecks during training.
The paper is available here, accepted in J. of Systems Architecture 2019.

Tags: Deep learning, Paper, Research, survey

August 21, 2019 by sparsh0mittal

Rating: 3.7/5. From 3 votes.

Please wait...

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

Engineering Supercomputing Platforms for Biomolecular Applications

high performance computing on graphics processing units: hgpu.org

Survey paper on Deep Learning on GPUs

Recent source codes

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

XaaS containers

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

SYCL Container

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Can Large Language Models Predict Parallel Code Performance?

Most viewed papers (last 30 days)

Survey paper on Deep Learning on GPUs

Share this:

Recent source codes

Most viewed papers (last 30 days)