high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Architectural Analysis and Performance Characterization of NVIDIA GPUs using Microbenchmarking

Architectural Analysis and Performance Characterization of NVIDIA GPUs using Microbenchmarking

Saktheesh Subramoniapillai Ajeetha

The Ohio State University

The Ohio State University, 2012

@phdthesis{ajeetha2012architectural,

title={Architectural Analysis and Performance Characterization of NVIDIA GPUs using Microbenchmarking},

author={Ajeetha, S.S.},

year={2012},

school={The Ohio State University}

}

Download (PDF)

View

Source

2790

views

Emergence of new Graphical Processors for general purpose computing presents new challenges for application developers. Graphical Processors vary in terms of number of processor cores per chip, processor speed and memory subsystems. NVIDIA’s CUDA provides a C-like abstraction layer for software developers to implement their applications on GPUs often with little knowledge of the underlying hardware and they are forced to work with high-level descriptions documented by the manufacturer. Substantial knowledge of the hardware architecture will be useful for harvesting the full potential of GPU architectures while trying to solve complex parallel programming problems. This work reports the measurements and characterization of performance of several NVIDIA GPU’s using micro benchmark analysis. Our thesis uses and adapts the CUDA Micro-benchmarks [8] and SHOC benchmarks [9] to characterize the important aspects of NVIDIA’s GTX200 series GPU- architecture machine (GTX280) and Fermi series – architecture machines (GTX580, Tesla C2050). The investigation is conducted by performing a micro architectural analysis of these machines and comparing their basic performance parameters. This thesis presents an experiment based methodology for characterizing the properties of the arithmetic pipelines. We also measure the global and shared memory latency and bandwidth of these machines and validate the hardware characteristics presented in CUDA programming guide. We hope that the insights from this work will be useful for improving the analysis and performance optimization of CUDA programs.

Tags: Benchmarking, Computer science, CUDA, nVidia, nVidia GeForce GTX 280, nVidia GeForce GTX 580, Performance, Tesla C2050, Thesis

September 6, 2012 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

Architectural Analysis and Performance Characterization of NVIDIA GPUs using Microbenchmarking

Your response

Recent source codes

UniCoder: Unified Visual-to-Code Generation via Symbolic Rewards and Reference-Guided Code Optimization

CuFuzz: An API-Knowledge-Graph Coverage-Driven Fuzzing Framework for CUDA Libraries

AutoPass: Evidence-Guided LLM Agents for Compiler Performance Tuning

Probe-and-Refine Tuning of Repository Guidance for AI Coding Agents

CUDAnalyst (CUDA + Analyst)

CodegenBench

KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels

CUDA Kernel Fusion Benchmarks

IntelliKit: Agent-first tooling for AMD hardware

DITRON: Distributed Compiler based on Triton for Parallel Systems

Most viewed papers (last 30 days)

Architectural Analysis and Performance Characterization of NVIDIA GPUs using Microbenchmarking

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)