15427

Impact of data layouts on the efficiency of GPU-accelerated IDW interpolation

Gang Mei, Hong Tian
School of Engineering and Technolgy, China University of Geosciences, No. 29 Xueyuan Road, Beijing 100083, China
SpringerPlus, 5:104, 2016

@article{mei2016impact,

   title={Impact of data layouts on the efficiency of GPU-accelerated IDW interpolation},

   author={Mei, Gang and Tian, Hong},

   journal={SpringerPlus},

   volume={5},

   number={1},

   pages={104},

   year={2016},

   publisher={Springer}

}

This paper focuses on evaluating the impact of different data layouts on the computational efficiency of GPU-accelerated Inverse Distance Weighting (IDW) interpolation algorithm. First we redesign and improve our previous GPU implementation that was performed by exploiting the feature of CUDA dynamic parallelism (CDP). Then we implement three versions of GPU implementations, i.e., the naive version, the tiled version, and the improved CDP version, based upon five data layouts, including the Structure of Arrays (SoA), the Array of Structures (AoS), the Array of aligned Structures (AoaS), the Structure of Arrays of aligned Structures (SoAoS), and the Hybrid layout. We also carry out several groups of experimental tests to evaluate the impact. Experimental results show that: the layouts AoS and AoaS achieve better performance than the layout SoA for both the naive version and tiled version, while the layout SoA is the best choice for the improved CDP version. We also observe that: for the two combined data layouts (the SoAoS and the Hybrid), there are no notable performance gains when compared to other three basic layouts. We recommend that: in practical applications, the layout AoaS is the best choice since the tiled version is the fastest one among three versions. The source code of all implementations are publicly available.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: