Multi-user real-time speech recognition with a GPU

Jungsuk Kim
Department of Electrical Engineering and Computer Science, Seoul National University, 599 Gwanangno, Gwanak-gu, Seoul, 151-744, Korea
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2012


   title={Multi-user real-time speech recognition with a GPU},

   author={Kim, J. and Sung, W.},

   booktitle={Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on},





Download Download (PDF)   View View   Source Source   



We have developed a multi-user large vocabulary speech recognition system employing a fully composed one-level weighted finite state transducer (WFST) based network on a Graphics Processing Unit (GPU). This system improves the overall throughput and latency of speech recognition engine which processes multiple users’ utterances at the same time with efficient scheduling, parameter sharing, and communication overhead reduction techniques. We conduct both batch speech simulation and trace driven online simulation to access the performance of the developed system. Traces are generated based on a queueing model.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2021 hgpu.org

All rights belong to the respective authors

Contact us: