high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Automatic compilation of MATLAB programs for synergistic execution on heterogeneous processors

Automatic compilation of MATLAB programs for synergistic execution on heterogeneous processors

Ashwin Prasad, Jayvant Anantpur, R. Govindarajan

Indian Institute of Science, Bangalore, India

Proceedings of the 32nd ACM SIGPLAN conference on Programming language design and implementation, PLDI ’11, 2011

DOI:10.1145/1993498.1993517

@inproceedings{prasad2011automatic,

title={Automatic compilation of MATLAB programs for synergistic execution on heterogeneous processors},

author={Prasad, A. and Govindarajan, J.A.R.},

booktitle={Proceedings of the 32nd ACM SIGPLAN conference on Programming language design and implementation},

pages={152–163},

year={2011},

organization={ACM}

}

Download (PDF)

View

Source

1636

views

MATLAB is an array language, initially popular for rapid prototyping, but is now being increasingly used to develop production code for numerical and scientific applications. Typical MATLAB programs have abundant data parallelism. These programs also have control flow dominated scalar regions that have an impact on the program’s execution time. Today’s computer systems have tremendous computing power in the form of traditional CPU cores and throughput oriented accelerators such as graphics processing units (GPUs). Thus, an approach that maps the control flow dominated regions to the CPU and the data parallel regions to the GPU can significantly improve program performance. In this paper, we present the design and implementation of MEGHA, a compiler that automatically compiles MATLAB programs to enable synergistic execution on heterogeneous processors. Our solution is fully automated and does not require programmer input for identifying data parallel regions. We propose a set of compiler optimizations tailored for MATLAB. Our compiler identifies data parallel regions of the program and composes them into kernels. The problem of combining statements into kernels is formulated as a constrained graph clustering problem. Heuristics are presented to map identified kernels to either the CPU or GPU so that kernel execution on the CPU and the GPU happens synergistically and the amount of data transfer needed is minimized. In order to ensure required data movement for dependencies across basic blocks, we propose a data flow analysis and edge splitting strategy. Thus our compiler automatically handles composition of kernels, mapping of kernels to CPU and GPU, scheduling and insertion of required data transfer. The proposed compiler was implemented and experimental evaluation using a set of MATLAB benchmarks shows that our approach achieves a geometric mean speedup of 19.8X for data parallel benchmarks over native execution of MATLAB.

Tags: Benchmarking, Clustering, Computer science, CUDA, Data parallelism, Flow analysis, Heterogeneous systems, nVidia, nVidia GeForce 8800 GTS, Optimization

September 20, 2011 by hgpu

No votes yet.

Please wait...

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

* * *

high performance computing on graphics processing units: hgpu.org

Automatic compilation of MATLAB programs for synergistic execution on heterogeneous processors

Recent source codes

QArray

Celerity: High-level C++ for Accelerator Clusters

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Optical flow algorithms for SYCL

OpenMP5-Offload-OpenMC-Intel-PVC

Most viewed papers (last 30 days)

Automatic compilation of MATLAB programs for synergistic execution on heterogeneous processors

Share this:

Recent source codes

Most viewed papers (last 30 days)