An instruction-systolic programmable shader architecture for multi-threaded 3D graphics processing

hgpu.org » Programming » Algorithms » An instruction-systolic programmable shader architecture for multi-threaded 3D graphics processing

An instruction-systolic programmable shader architecture for multi-threaded 3D graphics processing

Jung-Wook Park, Hoon-Mo Yang, Gi-Ho Park, Shin-Dug Kim, Charles C. Weems

Department of Computer Science, C532, Yonsei University, 134 Shinchon-dong Seoul, 120-749, Republic of Korea

Journal of Parallel and Distributed Computing, Volume 70, Issue 11, November 2010, Pages 1110-1118 (14 July 2010)

DOI:10.1016/j.jpdc.2010.07.002

BibTeX

Source

2876

views

In order to guarantee both performance and programmability demands in 3D graphics applications, vector and multithreaded SIMD architectures have been employed in recent graphics processing units. This paper introduces a novel instruction-systolic array architecture, which transfers an instruction stream in a pipelined fashion to efficiently share the expensive functional resources of a graphics processor. Specifically, cache misses and dynamic branches can cause additional latencies and complicated management in these parallel architectures. To address this problem, we combine a systolic execution scheme with on-demand warp activation that handles cache miss latency and branch divergence efficiently without significantly increasing hardware resources, either in terms of logic or register space. Simulation indicates that the proposed architecture offers 25% better performance than a traditional SIMD architecture with the same resources, and requires significantly fewer resources to match the performance of a typical modern vector multi-threaded GPU architecture.

Tags: Algorithms, Computer science, Hardware, Optimization

November 20, 2010 by hgpu

No votes yet.

Please wait...

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

Engineering Supercomputing Platforms for Biomolecular Applications

high performance computing on graphics processing units: hgpu.org