Optimization Techniques for CUDA Application

Tushar Athawale, Xie Xu
Department of Computer and Information Science and Engineering, University of Florida, Gainesville, FL, US
Project Report of CIS6930 GPU: Parallel Architecture and Programming, 2012


   title={Optimization Techniques for CUDA Application},

   author={Athawale, T. and Xu, X.},



Download Download (PDF)   View View   Source Source   



In this paper, we summarize our experiment results of applying various optimization techniques for CUDA application running on NVIDIA Fermi GPUs. Our experiments on matrix multiplication and breadth first search algorithms show that optimization techniques such as coalesced global memory access, conflict-free shared memory access and data pre-fetching improve the performance of applications running on GPUs.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2021 hgpu.org

All rights belong to the respective authors

Contact us: