high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Introducing CURRENNT: The Munich Open-Source CUDA RecurREnt Neural Network Toolkit

Introducing CURRENNT: The Munich Open-Source CUDA RecurREnt Neural Network Toolkit

Felix Weninger, Johannes Bergmann, Bjorn Schuller

Machine Learning & Signal Processing, Technische Universitat Munchen, 80290 Munich, Germany

Journal of Machine Learning Research, Vol.16, 547-551, 2015

BibTeX

Download (PDF)

View

Source

Source codes

Package:

CURRENNT: CUDA-enabled machine learning library for recurrent neural networks

2979

views

In this article, we introduce CURRENNT, an open-source parallel implementation of deep recurrent neural networks (RNNs) supporting graphics processing units (GPUs) through NVIDIA’s Computed Unified Device Architecture (CUDA). CURRENNT supports uni- and bidirectional RNNs with Long Short-Term Memory (LSTM) memory cells which overcome the vanishing gradient problem. To our knowledge, CURRENNT is the first publicly available parallel implementation of deep LSTM-RNNs. Benchmarks are given on a noisy speech recognition task from the 2013 2nd CHiME Speech Separation and Recognition Challenge, where LSTM-RNNs have been shown to deliver best performance. In the result, double digit speedups in bidirectional LSTM training are achieved with respect to a reference single-threaded CPU implementation. CURRENNT is available under the GNU General Public License.

Tags: Benchmarking, Computer science, CUDA, Neural networks, nVidia, nVidia GeForce GTX 560, Package, Speech recognition

October 8, 2015 by hgpu

Rating: 2.5/5. From 1 vote.

Please wait...

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

Engineering Supercomputing Platforms for Biomolecular Applications

high performance computing on graphics processing units: hgpu.org

Introducing CURRENNT: The Munich Open-Source CUDA RecurREnt Neural Network Toolkit

Package:

Recent source codes

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

XaaS containers

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

SYCL Container

CFAL-bench

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Most viewed papers (last 30 days)

Introducing CURRENNT: The Munich Open-Source CUDA RecurREnt Neural Network Toolkit

Package:

Share this:

Recent source codes

Most viewed papers (last 30 days)