high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Interleaving and Lock-Step Semantics for Analysis and Verification of GPU Kernels

Interleaving and Lock-Step Semantics for Analysis and Verification of GPU Kernels

Peter Collingbourne, Alastair F. Donaldson, Jeroen Ketema, Shaz Qadeer

Imperial College London

22nd European Symposium on Programming (ESOP 2013), 2013

@article{collingbourne2013interleaving,

title={Interleaving and Lock-Step Semantics for Analysis and Verification of GPU Kernels},

author={Collingbourne, P. and Donaldson, A.F. and Ketema, J. and Qadeer, S.},

year={2013}

}

Download (PDF)

View

Source

Source codes

Package:

GPUVerify

2622

views

We study semantics of GPU kernels – the parallel programs that run on Graphics Processing Units (GPUs). We provide a novel lock-step execution semantics for GPU kernels represented by arbitrary reducible control flow graphs and compare this semantics with a traditional interleaving semantics. We show for terminating kernels that either both semantics compute identical results or both behave erroneously. The result induces a method that allows GPU kernels with arbitrary reducible control flow graphs to be verified via transformation to a sequential program that employs predicated execution. We implemented this method in the GPUVerify tool and experimentally evaluated it by comparing the tool with the previous version of the tool based on a similar method for structured programs, i.e., where control is organised using if and while statements. The evaluation was based on a set of 163 open source and commercial GPU kernels. Among these kernels, 42 exhibit unstructured control flow which our novel method can handle fully automatically, but the previous method could not. Overall the generality of the new method comes at a modest price: Verification across our benchmark set was 2.25 times slower overall; however, the median slow down across all kernels was 0.77, indicating that our novel technique yielded faster analysis in many cases.

Tags: Benchmarking, Computer science, nVidia, nVidia GeForce 9400 M, OpenCL, Package

January 15, 2013 by hgpu

No votes yet.

Please wait...

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

high performance computing on graphics processing units: hgpu.org

Interleaving and Lock-Step Semantics for Analysis and Verification of GPU Kernels

Package:

Recent source codes

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

GPU plugin for PySCF

QArray

Celerity: High-level C++ for Accelerator Clusters

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Most viewed papers (last 30 days)

Interleaving and Lock-Step Semantics for Analysis and Verification of GPU Kernels

Package:

Share this:

Recent source codes

Most viewed papers (last 30 days)