high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Stack-less SIMT reconvergence at low cost

Stack-less SIMT reconvergence at low cost

Sylvain Collange

ENS de Lyon, Universite de Lyon, LIP (UMR 5668 CNRS – ENS de Lyon – INRIA – UCBL), Ecole Normale Superieure de Lyon, 46 allee d’Italie, 69364 Lyon Cedex 07, France

hal-00622654, version 1, 2011

BibTeX

Download (PDF)

View

Source

1882

views

Parallel architectures following the SIMT model such as GPUs benefit from application regularity by issuing concurrent threads running in lockstep on SIMD units. As threads take different paths across the control-flow graph, lockstep execution is partially lost, and must be regained whenever possible in order to maximize the occupancy of SIMD units. In this paper, we propose a technique to handle SIMT control divergence that operates in constant space and handles indirect jumps and recursion. We describe a possible implementation which leverage the existing memory divergence management unit, ensuring a low hardware cost. In terms of performance, this solution is at least as efficient as existing techniques.

Tags: ATI, Computer science, Hardware Architecture, Memory, nVidia, Performance

September 30, 2011 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

Stack-less SIMT reconvergence at low cost

Your response

Recent source codes

Mutual-Supervised Learning for Sequential-to-Parallel Code Translation

Hardware Compute Partitioning on NVIDIA GPUs for Composable Systems

KISim: Kubernetes Intelligent Scheduling Simulator

Efficient GPU Implementation of Multi-Precision Integer Division

exa-AMD: Exascale Accelerated Materials Discovery

ParEval: A Parallel Code Evaluation Benchmark

FlashSparse: Minimizing Computation Redundancy for Fast Sparse Matrix Multiplications on Tensor Cores

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

Most viewed papers (last 30 days)

Stack-less SIMT reconvergence at low cost

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)