Accelerating Large-Scale Convolutional Neural Networks with Parallel Graphics Multiprocessors
Autonomous Intelligent Systems, Institute of Computer Science VI, University of Bonn, Germany
Artificial Neural Networks – ICANN 2010, Lecture Notes in Computer Science, 2010, Volume 6354/2010, 82-91
@conference{scherer2009accelerating,
title={Accelerating large-scale convolutional neural networks with parallel graphics multiprocessors},
author={Scherer, D. and Behnke, S.},
booktitle={Proc. of NIPS 2009 Workshop on Large-Scale Machine Learning: Parallelism and Massive Datasets},
year={2009},
organization={Springer}
}
Training convolutional neural networks (CNNs) on large sets of high-resolution images is too computationally intense to be performed on commodity CPUs. Such architectures, however, achieve state-of-the-art results on low-resolution machine vision tasks such as recognition of handwritten characters. We have adapted the inherent multi-level parallelism of CNNs for Nvidia’s CUDA GPU architecture to accelerate the training by two orders of magnitude. This dramatic speedup permits to apply CNN architectures to pattern recognition tasks on datasets with high-resolution natural images.
January 2, 2011 by hgpu