high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Parallel GPU Architecture Simulation Framework Exploiting Work Allocation Unit Parallelism

Parallel GPU Architecture Simulation Framework Exploiting Work Allocation Unit Parallelism

Sangpil Lee, Won Woo Ro

School of Electrical and Electronic Engineering, Yonsei University, Seoul, Republic of Korea

The 2013 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS 2013), 2013

@article{lee2013parallel,

title={Parallel GPU Architecture Simulation Framework Exploiting Work Allocation Unit Parallelism},

author={Lee, Sangpil and Ro, Won Woo},

year={2013}

}

Download (PDF)

View

Source

1910

views

GPU computing is at the forefront of highperformance computing, and it has greatly affected current studies on parallel software and hardware design because of its massively parallel architecture. Therefore, numerous studies have focused on the utilization of GPUs in various fields. However, studies of GPU architectures are constrained by the lack of a suitable GPU simulator. Previously proposed GPU simulators do not have sufficient simulation speed for advanced software and architecture studies. In this paper, we propose a new parallel simulation framework and a parallel simulation technique called work-group parallel simulation in order to improve the simulation speed for modern many-core GPUs. The proposed framework divides the GPU architecture into parallel and shared components, and it determines which GPU component can be effectively parallelized and can work correctly in multithreaded simulation. In addition, the work-group parallel simulation technique effectively boosts the performance of parallelized GPU simulation by eliminating the synchronization overhead. Experimental results obtained using a simulator with the proposed framework show that the proposed parallel simulation technique has a speed-up of up to 4.15 as compared to an existing sequential GPU simulator on an 8-core machine providing minimized cycle errors.

Tags: Computer science, CUDA, GPGPU-sim, nVidia, nVidia GeForce GTX 285

April 17, 2013 by hgpu

No votes yet.

Please wait...

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

* * *

high performance computing on graphics processing units: hgpu.org

Parallel GPU Architecture Simulation Framework Exploiting Work Allocation Unit Parallelism

Recent source codes

QArray

Celerity: High-level C++ for Accelerator Clusters

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Optical flow algorithms for SYCL

OpenMP5-Offload-OpenMC-Intel-PVC

Most viewed papers (last 30 days)

Parallel GPU Architecture Simulation Framework Exploiting Work Allocation Unit Parallelism

Share this:

Recent source codes

Most viewed papers (last 30 days)