An Automatic Host and Device Memory Allocation Method for OpenMPC

Hiroaki Uchiyama, Tomoaki Tsumura, Hiroshi Matsuo
Nagoya Institute of Technology, Gokiso, Showa, Nagoya, Japan
3rd Int’l. Conf. on Networking and Computing (ICNC’12), 2012


   author={Kosuke SOBUE and Tomoaki TSUMURA and Hiroshi MATSUO},

   title={An Efficient Thread Recombinnig at Program Phase Changes},

   booktitle={Proc. 3rd Int’l Workshop on Advances in Networking and Computing (WANC’12)},






   location={Okinawa, Japan}


Download Download (PDF)   View View   Source Source   



The CUDA programming model provides better abstraction for GPU programming. However, it is still hard to write programs with CUDA because both some specific techniques and knowledge about GPU architecture is required. Hence, many programming frameworks for CUDA have been developed. OpenMPC is one of them based on OpenMP. OpenMPC s an easy-to-write framework for programmers familiar wih traditional OpenMP, but still requires programmers to use the special directives for utilizing fast device memories. To solve this problem, this paper proposes a method for allocating appropriate device memories automatically. This paper also proposes a method for automatically allocating page locaked memory for the data which are transferred between host and device. The evaluation results with several prgrams show that proposed methods can reduce 52% execution time in maximum.
No votes yet.
Please wait...

You must be logged in to post a comment.

* * *

* * *

HGPU group © 2010-2021 hgpu.org

All rights belong to the respective authors

Contact us: