18780

On the Portability of GPU-Accelerated Applications via Automated Source-to-Source Translation

Paul Sathre, Mark Gardner, Wu-chun Feng
Virginia Tech, Dept. of Computer Science, Blacksburg, Virginia, USA
International Conference on High Performance Computing in Asia-Pacific Region (HPC Asia), 2019

@inproceedings{sathre2019portability,

   title={On the Portability of CPU-Accelerated Applications via Automated Source-to-Source Translation},

   author={Sathre, Paul and Gardner, Mark and Feng, Wu-chun},

   booktitle={Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region},

   pages={1–8},

   year={2019},

   organization={ACM}

}

Over the past decade, accelerator-based supercomputers have grown from 0% to 42% performance share on the TOP500. Ideally, GPUaccelerated code on such systems should be "write once, run anywhere," regardless of the GPU device (or for that matter, any parallel device, e.g., CPU or FPGA). In practice, however, portability can be significantly more limited due to the sheer volume of code implemented in non-portable languages. For example, the tremendous success of CUDA, as evidenced by the vast cornucopia of CUDAaccelerated applications, makes it infeasible to manually rewrite all these applications to achieve portability. Consequently, we achieve portability by using our automated CUDA-to-OpenCL source-tosource translator called CU2CL. To demonstrate the state of the practice, we use CU2CL to automatically translate three medium-tolarge, CUDA-optimized codes to OpenCL, thus enabling the codes to run on other GPU-accelerated systems (as well as CPU- or FPGAbased systems). These automatically translated codes deliver performance portability, including as much as three-fold performance improvement, on a GPU device not supported by CUDA.
Rating: 2.0/5. From 1 vote.
Please wait...

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: