high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » An Execution Model for OpenCL 2.0

An Execution Model for OpenCL 2.0

Benedict R. Gaster

Qualcomm, USA

Computer Sciences Technical Report 2014-02, 2014

BibTeX

Download (PDF)

View

Source

2661

views

A popular approach to programming manycore GPUs is the Single Instruction Multiple Thread (SIMT) abstraction. SIMT has the benefit of presenting a "single thread" view, alleviating the complexity of explicitly vectorizing the source code. However, due to the SIMD nature of the underlying hardware it is often difficult to fully hide all aspects from the developer. An example of "leaks", is OpenCL’s barrier, which requires all workitems (i.e. threads) to reach and execute the "same" barrier. But what does it mean to reach and execute the same barrier? OpenCL provides very little information about the underlying semantics. In this paper we describe a simple execution model for OpenCL 2.0 that captures precisely the semantics of operations like barrier and the more advanced features of subgroups, recently introduced to expose SIMD in a portable manner.

Tags: Computer science, OpenCL, Overview

November 9, 2014 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

An Execution Model for OpenCL 2.0

Your response

Recent source codes

Mutual-Supervised Learning for Sequential-to-Parallel Code Translation

Hardware Compute Partitioning on NVIDIA GPUs for Composable Systems

KISim: Kubernetes Intelligent Scheduling Simulator

Efficient GPU Implementation of Multi-Precision Integer Division

exa-AMD: Exascale Accelerated Materials Discovery

ParEval: A Parallel Code Evaluation Benchmark

FlashSparse: Minimizing Computation Redundancy for Fast Sparse Matrix Multiplications on Tensor Cores

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

Most viewed papers (last 30 days)

An Execution Model for OpenCL 2.0

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)