high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Implementation of large-scale FIR adaptive filters on NVIDIA GeForce graphics processing unit

Implementation of large-scale FIR adaptive filters on NVIDIA GeForce graphics processing unit

Akihiro Hirano, Kenji Nakayama

Kanazawa University, Kakuma-Machi, Kanazawa, 920-1192, Japan

International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS), 2010

DOI:10.1109/ISPACS.2010.5704666

@inproceedings{hirano2010implementation,

title={Implementation of large-scale FIR adaptive filters on nVIDIA GeForce graphics processing unit},

author={Hirano, A. and Nakayama, K.},

booktitle={ISPACS 2010-2010 International Symposium on Intelligent Signal Processing and Communication Systems, Proceedings},

pages={5704666},

year={2010}

}

Download (PDF)

View

Source

2153

views

This paper presents implementations of an FIR adaptive filter with a large number of taps on nVIDIA GeForce graphics processing unit (GPU) and CUDA software development environment. In order to overcome a long access latency for slow off-chip memory access, reduction of memory accesses by re-ordering and vector load/store operations and an increase of the number of threads are introduced. A tree adder is introduced to reduce the cost for summing thread outputs up. A simultaneous execution of multiple filters are also examined. On low-cost platform such as an Atom/ION nettop, GPU will accelerates the computation by almost three times. For simultaneous multiple simulations such as an ensemble averaging, a GPU with a large number of processing elements outperforms a dual-core CPU; almost six times faster for 16 runs.

Tags: Computer science, CUDA, Filtering, nVidia, nVidia GeForce 8800 GTS, nVidia GeForce 9400 M

July 11, 2011 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

Implementation of large-scale FIR adaptive filters on NVIDIA GeForce graphics processing unit

Your response

Recent source codes

AutoDock-GPU: AutoDock for GPUs and other accelerators

NCCLX: collective communication framework

Tutoring LLM into a Better CUDA Optimizer

Kernel Library for LLM Serving

Adaptivity in AdaptiveCpp: Optimizing Performance by Leveraging Runtime Information During JIT-Compilation

Neptune: Advanced ML Operator Fusion for Locality and Parallelism on GPUs

Genten: Software for Generalized Tensor Decompositions by Sandia National Laboratories

Interleaved Learning and Exploration: A Self-Adaptive Fuzz Testing Framework for MLIR

Pinocchio: PINpointing Orbit Crossing Collapsed Hierarchical Objects

KernelCoder: trained on a curated dataset of reasoning traces and CUDA kernel pairs

Most viewed papers (last 30 days)

Implementation of large-scale FIR adaptive filters on NVIDIA GeForce graphics processing unit

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)