high performance computing on graphics processing units: hgpu.org

hgpu.org » Programming » CUDA » Parallel implementation of artificial neural network training

Parallel implementation of artificial neural network training

S. Scanzio, S. Cumani, R. Gemello, F. Mana, P. Laface

Politec. di Torino, Turin, Italy

IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), 2010

DOI:10.1109/ICASSP.2010.5495108

BibTeX

Source

1980

views

In this paper we describe the implementation of a complete ANN training procedure for speech recognition using the block mode back-propagation learning algorithm. We exploit the high performance SIMD architecture of GPU using CUDA and its C-like language interface. We also compare the speed-up obtained implementing the training procedure only taking advantage of the multi-thread capabilities of multi-core processors. Our approach has been tested by training acoustic models for large vocabulary speech recognition tasks, showing a 6 times reduction of the time required to train real-world large size networks with respect to an already optimized implementation using the Intel MKL libraries.

Tags: Acoustics, CUDA, Neural networks, nVidia, Signal processing, Speech recognition

June 14, 2011 by hgpu

No votes yet.

Please wait...

high performance computing on graphics processing units: hgpu.org

Parallel implementation of artificial neural network training

Recent source codes

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

XaaS containers

SYCL Container

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

CFAL-bench

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Can Large Language Models Predict Parallel Code Performance?

Most viewed papers (last 30 days)

Parallel implementation of artificial neural network training

Share this:

Recent source codes

Most viewed papers (last 30 days)