high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » RETURNN: The RWTH Extensible Training framework for Universal Recurrent Neural Networks

RETURNN: The RWTH Extensible Training framework for Universal Recurrent Neural Networks

Patrick Doetsch, Albert Zeyer, Paul Voigtlaender, Ilya Kulikov, Ralf Schluter, Hermann Ney

Human Language Technology and Pattern Recognition, Computer Science Department, RWTH Aachen University, 52062 Aachen, Germany

arXiv:1608.00895 [cs.LG], (2 Aug 2016)

@article{doetsch2016returnn,

title={RETURNN: The RWTH Extensible Training framework for Universal Recurrent Neural Networks},

author={Doetsch, Patrick and Zeyer, Albert and Voigtlaender, Paul and Kulikov, Ilya and Schluter, Ralf and Ney, Hermann},

year={2016},

month={aug},

archivePrefix={"arXiv"},

primaryClass={cs.LG}

}

Download (PDF)

View

Source

Source codes

Package:

RETURNN: The RWTH extensible training framework for universal recurrent neural network

2988

views

In this work we release our extensible and easily configurable neural network training software. It provides a rich set of functional layers with a particular focus on efficient training of recurrent neural network topologies on multiple GPUs. The source of the software package is public and freely available for academic research purposes and can be used as a framework or as a standalone tool which supports a flexible configuration. The software allows to train state-of-the-art deep bidirectional long short-term memory (LSTM) models on both one dimensional data like speech or two dimensional data like handwritten text. It can be applied to a variety of natural language processing tasks and also supports more exotic components such as attention-based end-to-end networks or associative LSTMs.

Tags: Computer science, CUDA, Machine learning, Neural and Evolutionary Computing, nVidia, nVidia GeForce GTX 980, Package, RNN, Speech recognition

August 4, 2016 by hgpu

Rating: 2.1/5. From 30 votes.

Please wait...

Your response

You must be logged in to post a comment.

KernelGYM & Dr. Kernel: A distributed GPU environment and a collection of RL training methods to support RL for Kernel Generations

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

* * *

high performance computing on graphics processing units: hgpu.org

RETURNN: The RWTH Extensible Training framework for Universal Recurrent Neural Networks

Package:

Your response

Recent source codes

CL4SE: A Context Learning Benchmark For Software Engineering Tasks

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5

DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels

KernelGYM & Dr. Kernel: A distributed GPU environment and a collection of RL training methods to support RL for Kernel Generations

Vortex-Optimized Light-weight Toolchain (VOLT)

SciDef: Automated Definition Extraction from Scientific Literature

bioagent-bench: Benchmark for evaluating LLM agents in bioinformatics

Benchmark suite for LLM inference on NVIDIA consumer GPUs

Theorizer: from the paper Generating Literature-Driven Scientific Discoveries at Scale

Most viewed papers (last 30 days)

RETURNN: The RWTH Extensible Training framework for Universal Recurrent Neural Networks

Package:

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)