high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » MRPB: Memory Request Prioritization for Massively Parallel Processors

MRPB: Memory Request Prioritization for Massively Parallel Processors

Wenhao Jia, Kelly A. Shaw, Margaret Martonosi

Princeton University

The 20th Int. Symp. on High Performance Computer Architecture (HPCA 2014), 2014

@article{jia2014mrpb,

title={MRPB: Memory Request Prioritization for Massively Parallel Processors},

author={Jia, Wenhao and Shaw, Kelly A and Martonosi, Margaret},

year={2014}

}

Download (PDF)

View

Source

2399

views

Massively parallel, throughput-oriented systems such as graphics processing units (GPUs) offer high performance for a broad range of programs. They are, however, complex to program, especially because of their intricate memory hierarchies with multiple address spaces. In response, modern GPUs have widely adopted caches, hoping to providing smoother reductions in memory access traffic and latency. Unfortunately, GPU caches often have mixed or unpredictable performance impact due to cache contention that results from the high thread counts in GPUs. We propose the memory request prioritization buffer (MRPB) to ease GPU programming and improve GPU performance. This hardware structure improves caching efficiency of massively parallel workloads by applying two prioritization methods-request reordering and cache bypassing-to memory requests before they access a cache. MRPB then releases requests into the cache in a more cache-friendly order. The result is drastically reduced cache contention and improved use of the limited per-thread cache capacity. For a simulated 16KB L1 cache, MRPB improves the average performance of the entire PolyBench and Rodinia suites by 2.65x and 1.27x respectively, outperforming a state-of-the-art GPU cache management technique.

Tags: Computer science, CUDA, GPGPU-sim, nVidia, Performance, Tesla C2050

January 16, 2014 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

MRPB: Memory Request Prioritization for Massively Parallel Processors

Your response

Recent source codes

True 4-Bit Quantized CNN Training on CPU

cuFuzz: A GPU-oriented coverage-guided fuzzer for userland CUDA application

KernelSkill: A Multi-Agent Framework for GPU Kernel Optimization

MSKernelBench & CUDAMaster

EvoScientist: Harness Vibe Research with Self-evolving AI Scientists

RepoLaunch: Automating Build and Test Pipeline of Code Repositories on ANY Language and ANY Platform

RepoLaunch: Automating Build and Test Pipeline of Code Repositories on ANY Language and ANY Platform

CONCUR: a benchmark designed to evaluate multithreaded Java code generated by LLMs

HIPRT: Ray Tracing using HIP

MXFP4 Training Support Codebase

Most viewed papers (last 30 days)

MRPB: Memory Request Prioritization for Massively Parallel Processors

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)