high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Portable Mapping of Data Parallel Programs to OpenCL for Heterogeneous Systems

Portable Mapping of Data Parallel Programs to OpenCL for Heterogeneous Systems

Dominik Grewe, Zheng Wang, Michael F.P. O’Boyle

School of Informatics, University of Edinburgh

International Symposium on Code Generation and Optimization (CGO), 2013

@article{grewe2013portable,

title={Portable Mapping of Data Parallel Programs to OpenCL for Heterogeneous Systems},

author={Grewe, D. and Wang, Z. and O’Boyle, M.F.P.},

year={2013}

}

Download (PDF)

View

Source

2151

views

General purpose GPU based systems are highly attractive as they give potentially massive performance at little cost. Realizing such potential is challenging due to the complexity of programming. This paper presents a compiler based approach to automatically generate optimized OpenCL code from data-parallel OpenMP programs for GPUs. Such an approach brings together the benefits of a clear high level language (OpenMP) and an emerging standard (OpenCL) for heterogeneous multi-cores. A key feature of our scheme is that it leverages existing transformations, especially data transformations, to improve performance on GPU architectures and uses predictive modeling to automatically determine if it is worthwhile running the OpenCL code on the GPU or OpenMP code on the multi-core host. We applied our approach to the entire NAS parallel benchmark suite and evaluated it on two distinct GPU based systems: Core i7/NVIDIA GeForce GTX 580 and Core i7/AMD Radeon 7970. We achieved average (up to) speedups of 4.51x and 4.20x (143x and 67x) respectively over a sequential baseline. This is, on average, a factor 1.63 and 1.56 times faster than a hand-coded, GPU-specific OpenCL implementation developed by independent expert programmers.

Tags: ATI, ATI Radeon HD 7970, Compilers, Computer science, Heterogeneous systems, Machine learning, nVidia, nVidia GeForce GTX 580, OpenCL, Programming Languages

January 8, 2013 by hgpu

Rating: 3.3/5. From 2 votes.

Please wait...

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

high performance computing on graphics processing units: hgpu.org

Portable Mapping of Data Parallel Programs to OpenCL for Heterogeneous Systems

Recent source codes

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

GPU plugin for PySCF

QArray

Celerity: High-level C++ for Accelerator Clusters

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Most viewed papers (last 30 days)

Portable Mapping of Data Parallel Programs to OpenCL for Heterogeneous Systems

Share this:

Recent source codes

Most viewed papers (last 30 days)