15180

Parallel 3D Fast Wavelet Transform comparison on CPUs and GPUs

Gregorio Bernabe
Computing Engineering, University of Murcia,Campus de Espinardo, 30071 Murcia, Spain
Annals of Multicore and GPU Programming, Vol 2, No 1, 2015

@article{bernabe2015parallel,

   title={Parallel 3D Fast Wavelet Transform comparison on CPUs and GPUs},

   author={Bernab{‘e}, Gregorio},

   journal={Annals of Multicore and GPU Programming},

   volume={2},

   number={1},

   pages={1–14},

   year={2015}

}

Download Download (PDF)   View View   Source Source   

1714

views

We present in this paper several implementations of the 3D Fast Wavelet Transform (3D-FWT) on multicore CPUs and manycore GPUs. On the GPU side, we focus on CUDA and OpenCL programming to develop methods for an efficient mapping on manycores. On multicore CPUs, OpenMP and Pthreads are used as counterparts to maximize parallelism, and renowned techniques like tiling and blocking are exploited to optimize the use of memory. We evaluate these proposals and make a comparison between a new Fermi Tesla C2050 and an Intel Core 2 QuadQ6700. Speedups of the CUDA version are the best results, improving the execution times on CPU, ranging from 5.3x to 7.4x for different image sizes, and up to 81 times faster when communications are neglected. Meanwhile, OpenCL obtains solid gains which range from 2x factors on small frame sizes to 3x factors on larger ones.
Rating: 1.5/5. From 2 votes.
Please wait...

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: