high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Computer vision » Single Server Multi-GPU Training of ConvNets

Single Server Multi-GPU Training of ConvNets

Omry Yadan, Keith Adams, Yaniv Taigman, Marc’Aurelio Ranzato

Facebook AI Group

arXiv:1312.5853 [cs.LG], (20 Dec 2013)

@article{2013arXiv1312.5853Y,

author={Yadan}, O. and {Adams}, K. and {Taigman}, Y. and {Ranzato}, M.},

title={"{Single Server Multi-GPU Training of ConvNets}"},

journal={ArXiv e-prints},

archivePrefix={"arXiv"},

eprint={1312.5853},

primaryClass={"cs.LG"},

keywords={Computer Science – Learning, Computer Science – Neural and Evolutionary Computing},

year={2013},

month={dec},

adsurl={http://adsabs.harvard.edu/abs/2013arXiv1312.5853Y},

adsnote={Provided by the SAO/NASA Astrophysics Data System}

}

Download (PDF)

View

Source

2741

views

In this work we evaluate different approaches to parallelize computation of convolutional neural networks across several GPUs within the same server.

Tags: Computer science, Computer vision, Machine learning, Neural networks

December 23, 2013 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

tritonBLAS: A Lightweight Triton-based General Matrix Multiplication (GEMM) Library

tritonBLAS: Triton-based Analytical Approach for GEMM Kernel Parameter Selection

hls4ml: Machine learning on FPGAs using HLS

hls4ml: A Flexible, Open-Source Platform for Deep Learning Acceleration on Reconfigurable Hardware

ThunderKittens: Tile primitives for speedy kernels

ParallelKittens: Systematic and Practical Simplification of Multi-GPU AI Kernels

NVIDIA Nemotron Parse 1.1

Iris: AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming

Iris: First-Class Multi-GPU Programming Experience in Triton

HipKittens: Fast and Furious AMD Kernels

CudaForge: An Agent Framework with Hardware Feedback for CUDA Kernel Optimization

See all packages

* * *

high performance computing on graphics processing units: hgpu.org

Single Server Multi-GPU Training of ConvNets

Your response

Recent source codes

tritonBLAS: A Lightweight Triton-based General Matrix Multiplication (GEMM) Library

hls4ml: Machine learning on FPGAs using HLS

ThunderKittens: Tile primitives for speedy kernels

NVIDIA Nemotron Parse 1.1

Iris: AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming

HipKittens: Fast and Furious AMD Kernels

Fortran xDSL dialects

mt4g: Memory Topology 4 GPUs

Falcon: GPU-Based Floating-point Adaptive Lossless Compression

CudaForge: An Agent Framework with Hardware Feedback for CUDA Kernel Optimization

Most viewed papers (last 30 days)

Single Server Multi-GPU Training of ConvNets

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)