GPU-Friendly Local Regression for Voice Conversion

Taylor Berg-Kirkpatrick, Dan Klein
Computer Science Division, University of California, Berkeley
Conference of the North American Chapter of the Association for Computational Linguistics – Human Language Technologies (NAACL HLT), 2015


   title={GPU-Friendly Local Regression for Voice Conversion},

   author={Berg-Kirkpatrick, Taylor and Klein, Dan},



Download Download (PDF)   View View   Source Source   



Voice conversion is the task of transforming a source speaker’s voice so that it sounds like a target speaker’s voice. We present a GPUfriendly local regression model for voice conversion that is capable of converting speech in real-time and achieves state-of-the-art accuracy on this task. Our model uses a new approximation for computing local regression coefficients that is explicitly designed to preserve memory locality. As a result, our inference procedure is amenable to efficient implementation on the GPU. Our approach is more than 10X faster than a highly optimized CPU-based implementation, and is able to convert speech 2.7X faster than real-time.
Rating: 2.3/5. From 3 votes.
Please wait...

* * *

* * *

* * *

HGPU group © 2010-2022 hgpu.org

All rights belong to the respective authors

Contact us: