The nonequispaced FFT on graphics processing units

Susanne Kunis, Stefan Kunis
University of Osnabrueck, 49076 Osnabrueck
Proceedings in Applied Mathematics and Mechanics, 2012


   title={The nonequispaced FFT on graphics processing units},

   author={Kunis, S. and Kunis, S.},

   journal={Proceedings in Applied Mathematics and Mechanics},



Download Download (PDF)   View View   Source Source   Source codes Source codes




Without doubt, the fast Fourier transform (FFT) belongs to the algorithms with large impact on science and engineering. By appropriate approximations, this scheme has been generalized for arbitrary spatial sampling points. This so called nonequispaced FFT is the core of the sequential NFFT3 library and we discuss its computational costs in detail. On the other hand, programmable graphics processing units have evolved into highly parallel, multithreaded, manycore processors with enormous computational capacity and very high memory bandwidth. By means of the so called Compute Unified Device Architecture (CUDA), we parallelized the nonequispaced FFT using the CUDA FFT library and a dedicated parallelization of the approximation scheme.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2021 hgpu.org

All rights belong to the respective authors

Contact us: