high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Multi-Lingual Speech Recognition with Low-Rank Multi-Task Deep Neural Networks

Multi-Lingual Speech Recognition with Low-Rank Multi-Task Deep Neural Networks

Aanchan Mohan, Richard Rose

Department of Electrical and Computer Engineering, McGill University, Montreal, Canada

IEEE International Conference on Acoustics, Speech and Signal Processing, 2015

@article{mohan2015multi,

title={Multi-Lingual Speech Recognition with Low-Rank Multi-Task Deep Neural Networks},

author={Mohan, Aanchan and Rose, Richard},

year={2015}

}

Download (PDF)

View

Source

1843

views

Multi-task learning (MTL) for deep neural network (DNN) multilingual acoustic models has been shown to be effective for learning parameters that are common or shared between multiple languages[1, 2]. In the MTL paradigm, the number of parameters in the output layer is large and scales with the number of languages used in training. This output layer becomes a computational bottleneck. For mono-lingual DNNs, low-rank matrix factorization (LRMF) of weight matrices have yielded large computational savings[3, 4]. The LRMF proposed in this work for MTL, is for the original language-specific block matrices to "share" a common matrix, with resulting low-rank language specific block matrices. The impact of LRMF is presented in two scenarios, namely: (a) improving performance in a target language when auxiliary languages are included during multi-lingual training; and (b) cross-language transfer to an unseen language with only 1 hour of transcribed training data. A 44% parameter reduction in the final layer, manifests itself in providing a lower memory footprint and faster training times. An experimental study shows that the LRMF multi-lingual DNN provides competitive performance compared to a full-rank multi-lingual DNN in both scenarios.

Tags: Computer science, CUDA, Deep learning, Machine learning, Neural networks, nVidia, Speech recognition, Tesla K20

April 7, 2015 by hgpu

No votes yet.

Please wait...

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

* * *

high performance computing on graphics processing units: hgpu.org

Multi-Lingual Speech Recognition with Low-Rank Multi-Task Deep Neural Networks

Recent source codes

QArray

Celerity: High-level C++ for Accelerator Clusters

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Optical flow algorithms for SYCL

OpenMP5-Offload-OpenMC-Intel-PVC

Most viewed papers (last 30 days)

Multi-Lingual Speech Recognition with Low-Rank Multi-Task Deep Neural Networks

Share this:

Recent source codes

Most viewed papers (last 30 days)