high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » 3D Graphics and Realism » Active thread compaction for GPU path tracing

Active thread compaction for GPU path tracing

Ingo Wald

Intel

Proceedings of the ACM SIGGRAPH Symposium on High Performance Graphics, HPG ’11, 2011

DOI:10.1145/2018323.2018331

@inproceedings{wald2011active,

title={Active thread compaction for GPU path tracing},

author={Wald, I.},

booktitle={Proceedings of the ACM SIGGRAPH Symposium on High Performance Graphics},

pages={51–58},

year={2011},

organization={ACM}

}

Download (PDF)

View

Source

1951

views

Modern GPUs like NVidia’s Fermi internally operate in a SIMD manner by ganging multiple (32) scalar threads together into SIMD warps; if a warp’s threads diverge, the warp serially executes both branches, temporarily disabling threads that are not on that path. In this paper, we explore and thoroughly analyze the concept of active thread compaction—i.e., the process of taking multiple partially-filled warps and compacting them to fewer but fully utilized warps—in the context of a CUDA path tracer. Our results show that this technique can indeed lead to significant improvements in SIMD utilization, and corresponding savings in the amount of work performed; however, they also show that certain inadequacies of today’s hardware wipe out most of the achieved gains, leaving bottom-up speed-ups of a mere 12–16%. We believe our analysis of why this is the case will provide insight to other researchers experimenting with this technique in different contexts.

Tags: 3D Graphics and Realism, Computer science, CUDA, nVidia, nVidia GeForce GTX 480, Presentation, Raytracing

September 29, 2011 by hgpu

No votes yet.

Please wait...

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

high performance computing on graphics processing units: hgpu.org

Active thread compaction for GPU path tracing

Recent source codes

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

GPU plugin for PySCF

QArray

Celerity: High-level C++ for Accelerator Clusters

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Most viewed papers (last 30 days)

Active thread compaction for GPU path tracing

Share this:

Recent source codes

Most viewed papers (last 30 days)