RASR/NN: The RWTH Neural Network Toolkit for Speech Recognition

Simon Wiesler, Alexander Richard, Pavel Golik, Ralf Schluter, Hermann Ney
Human Language Technology and Pattern Recognition, Computer Science Department, RWTH Aachen University, Aachen, Germany
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2014



   author={Simon Wiesler, Alexander Richard and Golik, Pavel and Schl{"u}ter, Ralf and Ney, Hermann},



Download Download (PDF)   View View   Source Source   



This paper describes the new release of RASR – the open source version of the well-proven speech recognition toolkit developed and used at RWTH Aachen University. The focus is put on the implementation of the NN module for training neural network acoustic models. We describe code design, configuration, and features of the NN module. The key feature is a high flexibility regarding the network topology, choice of activation functions, training criteria, and optimization algorithm, as well as a built-in support for efficient GPU computing. The evaluation of run-time performance and recognition accuracy is performed exemplary with a deep neural network as acoustic model in a hybrid NN/HMM system. The results show that RASR achieves a state-of-the-art performance on a real-world large vocabulary task, while offering a complete pipeline for building and applying large scale speech recognition systems.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2021 hgpu.org

All rights belong to the respective authors

Contact us: