high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Early Results of Deep Learning on the Stampede2 Supercomputer

Early Results of Deep Learning on the Stampede2 Supercomputer

Zhao Zhang, Weijia Xu, Niall Gaffney, Daniel Stanzione

Texas Advanced Computing Center

Texas Advanced Computing Center, Technical Report, 2017

DOI:10.13140/RG.2.2.36806.78404

BibTeX

Download (PDF)

View

Source

4448

views

We present early results of the deep learning work on the Stampede2 supercomputer. Our goal is to enable scalable and efficient deep learning model training and serving to expedite scientific discovery. We build three popular deep learning frameworks, namely, IntelCaffe, MXNet, and TensorFlow. With the built-in applications of these frameworks (CaffeNet, AlexNet, GoogLeNet, and Cifar10), we measure the scalability in both strong scaling and weak scaling way. At the time of writing, we are able to build and run Intel-Caffe, MXNet, and TensorFlow on multiple KNL nodes. While the MXNet and TensorFlow performance are still being tuned, we manage to scale the afore-mentioned applications in Caffe on 512 KNLs with ~80% efficiency compared to a single KNL performance.

Tags: Caffe, Computer science, Deep learning, Intel Xeon Phi, TensorFlow

October 29, 2017 by hgpu

Rating: 1.5/5. From 2 votes.

Please wait...

* * *

high performance computing on graphics processing units: hgpu.org

Early Results of Deep Learning on the Stampede2 Supercomputer

Recent source codes

XaaS containers

microSYCL: SYCL micro-benchmarks repository

SYCL Container

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

CFAL-bench

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Can Large Language Models Predict Parallel Code Performance?

PELSI: Power-Efficient Layer-Switched Inference

Ouroboros: Virtualized Queues for dynamic memory management

Most viewed papers (last 30 days)

Early Results of Deep Learning on the Stampede2 Supercomputer

Share this:

Recent source codes

Most viewed papers (last 30 days)