16377

RETURNN: The RWTH Extensible Training framework for Universal Recurrent Neural Networks

Patrick Doetsch, Albert Zeyer, Paul Voigtlaender, Ilya Kulikov, Ralf Schluter, Hermann Ney
Human Language Technology and Pattern Recognition, Computer Science Department, RWTH Aachen University, 52062 Aachen, Germany
arXiv:1608.00895 [cs.LG], (2 Aug 2016)

@article{doetsch2016returnn,

   title={RETURNN: The RWTH Extensible Training framework for Universal Recurrent Neural Networks},

   author={Doetsch, Patrick and Zeyer, Albert and Voigtlaender, Paul and Kulikov, Ilya and Schluter, Ralf and Ney, Hermann},

   year={2016},

   month={aug},

   archivePrefix={"arXiv"},

   primaryClass={cs.LG}

}

In this work we release our extensible and easily configurable neural network training software. It provides a rich set of functional layers with a particular focus on efficient training of recurrent neural network topologies on multiple GPUs. The source of the software package is public and freely available for academic research purposes and can be used as a framework or as a standalone tool which supports a flexible configuration. The software allows to train state-of-the-art deep bidirectional long short-term memory (LSTM) models on both one dimensional data like speech or two dimensional data like handwritten text. It can be applied to a variety of natural language processing tasks and also supports more exotic components such as attention-based end-to-end networks or associative LSTMs.
Rating: 2.1/5. From 30 votes.
Please wait...

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: