high performance computing on graphics processing units: hgpu.org

hgpu.org » Programming » Algorithms » Optimized MFCC Feature Extraction on GPU

Optimized MFCC Feature Extraction on GPU

Haofeng Kou, Weijia Shang, Ian Lane, Jike Chong

Santa Clara University

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2013

@article{kou2013optimized,

title={OPTIMIZED MFCC FEATURE EXTRACTION ON GPU},

author={Kou, Haofeng and Shang, Weijia and Lane, Ian and Chong, Jike},

year={2013}

}

Download (PDF)

View

Source

2796

views

In this paper, we update our previous research for Mel-Frequency Cepstral Coefficient (MFCC) feature extraction [1] and describe the optimizations required for improving throughput on the Graphics Processing Units (GPU). We not only demonstrate that the feature extraction process is suitable for GPUs and a substantial reduction in computation time can be obtained by performing feature extraction on these platforms, but also discus about the optimized algorithm. Using one GTX580 GPU our approach is shown to be approximately 97x faster than a sequential CPU implementation, enabling feature extraction to be performed at under 0.01% real-time. This is significantly faster than prior reported results implemented on GPUs, DSPs and FPGAs. Furthermore we demonstrate that multiple MFCC features can be generated for a set of predefined Vocal Tract Length Normalization (VTLN) alpha parameters with little degradation in throughput, along with the optimization for filter bank and reductions.

Tags: Algorithms, Computer science, CUDA, nVidia, nVidia GeForce GTX 580, Speech recognition

July 13, 2013 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

Optimized MFCC Feature Extraction on GPU

Your response

Recent source codes

UniCoder: Unified Visual-to-Code Generation via Symbolic Rewards and Reference-Guided Code Optimization

CuFuzz: An API-Knowledge-Graph Coverage-Driven Fuzzing Framework for CUDA Libraries

AutoPass: Evidence-Guided LLM Agents for Compiler Performance Tuning

Probe-and-Refine Tuning of Repository Guidance for AI Coding Agents

CUDAnalyst (CUDA + Analyst)

CodegenBench

KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels

CUDA Kernel Fusion Benchmarks

IntelliKit: Agent-first tooling for AMD hardware

DITRON: Distributed Compiler based on Triton for Parallel Systems

Most viewed papers (last 30 days)

Optimized MFCC Feature Extraction on GPU

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)