high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » CRINK: Automatic CUDA code generation for affine C programs

CRINK: Automatic CUDA code generation for affine C programs

Akanksha Singh

Department of Computer Science and Engineering, Indian Institute of Technology, Kanpur

Indian Institute of Technology, 2014

@phdthesis{singh2014crink,

title={CRINK: Automatic CUDA code generation for affine C programs},

author={Singh, Akanksha},

year={2014},

school={Indian Institute of Technology Kanpur}

}

Download (PDF)

View

Source

1647

views

Parallel programming has largely evolved as an efficient solution to a large number of compute intensive applications. Graphics Processing Unit (GPUs), traditionally designed to process computer graphics, are now widely applied to process large chunks of data parallely in many computationally expensive applications. While developing parallel programs to run on parallel computing platforms, such as CUDA, OpenCL, etc. requires knowledge of platform-specific concepts, it becomes very convenient if the process of parallelizing compute intensive sections of the program can be automated. We develop a tool CRINK, an end-to-end code transformation system, to convert sequential C programs to their parallel counterparts in CUDA. CRINK targets to parallelize the expensive sections (sections within loops) of the program while converting C programs to CUDA C programs. It incorporates handling of both irregular and regular kernels. We use concepts of Cycle Shrinking and Extended Cycle Shrinking for parallelism extractions and loop transformations. To analyse the performance, we run CRINK over the expensive sections taken from ZERO RC, SPEC, SANDIA RULES, Treepack and Higbie standard benchmarks. Analysis is done over 66 varied configurations of the benchmarks and datasets where we observe that drastic drops in computation times are achieved as the number of threads are increased while execution of the code transformed by CRINK.

Tags: Code generation, Computer science, CUDA, nVidia, Tesla C1060, Thesis

August 10, 2015 by hgpu

No votes yet.

Please wait...

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

high performance computing on graphics processing units: hgpu.org

CRINK: Automatic CUDA code generation for affine C programs

Recent source codes

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

GPU plugin for PySCF

QArray

Celerity: High-level C++ for Accelerator Clusters

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Most viewed papers (last 30 days)

CRINK: Automatic CUDA code generation for affine C programs

Share this:

Recent source codes

Most viewed papers (last 30 days)