NUPAR: A Benchmark Suite for Modern GPU Architectures

hgpu.org » Applications » Computer science » NUPAR: A Benchmark Suite for Modern GPU Architectures

NUPAR: A Benchmark Suite for Modern GPU Architectures

Yash Ukidave, Fanny Nina Paravecino, Leiming Yu, Charu Kalra, Amir Momeni, Zhongliang Chen, Nick Materise, Brett Daley, Perhaad Mistry, David Kaeli

Advanced Micro Devices Inc. (AMD), Boxborough, MA

6th ACM/SPEC International Conference on Performance Engineering, ICPE 2015

@inproceedings{ukidave2015nupar,

title={NUPAR: A Benchmark Suite for Modern GPU Architectures},

author={Ukidave, Yash and Paravecino, Fanny Nina and Yu, Leiming and Kalra, Charu and Momeni, Amir and Chen, Zhongliang and Materise, Nick and Daley, Brett and Mistry, Perhaad and Kaeli, David},

booktitle={Proceedings of the 6th ACM/SPEC International Conference on Performance Engineering},

pages={253–264},

year={2015},

organization={ACM}

}

Download (PDF)

View

Source

Source codes

Package:

NUPAR Benchmark suite for modern GPU architectures

3146

views

Heterogeneous systems consisting of multi-core CPUs, Graphics Processing Units (GPUs) and many-core accelerators have gained widespread use by application developers and data-center platform developers. Modern day heterogeneous systems have evolved to include advanced hardware and software features to support a spectrum of application patterns. Heterogeneous programming frameworks such as CUDA, OpenCL, and OpenACC have all introduced new interfaces to enable developers to utilize new features on these platforms. In emerging applications, performance optimization is not only limited to effectively exploiting data-level parallelism, but includes leveraging new degrees of concurrency and parallelism to accelerate the entire application. To aid hardware architects and application developers in effectively tuning performance on GPUs, we have developed the NUPAR benchmark suite. The NUPAR applications belong to a number of different scientific and commercial computing domains. These benchmarks exhibit a range of GPU computing characteristics that consider memory-bandwidth limitations, device occupancy and resource utilization, synchronization latency and device-specific compute optimizations. The NUPAR applications are specifically designed to stress new hardware and software features that include: nested parallelism, concurrent kernel execution, shared hostdevice memory and new instructions for precise computation and data movement. In this paper, we focus our discussion on applications developed in CUDA and OpenCL, and focus on high-end server class GPUs. We describe these benchmarks and evaluate their interaction with different architectural features on a GPU. Our evaluation examines the behavior of the advanced hardware features on recently-released GPU architectures.

Tags: ATI, ATI Radeon HD 7970, Benchmarking, Computer science, CUDA, Heterogeneous systems, nVidia, OpenACC, OpenCL, Package, Tesla C2070, Tesla K40

February 13, 2015 by hgpu

Rating: 0.5/5. From 1 vote.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org