high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Fast Makespan Estimation for GPU Threads on a Single Streaming Multiprocessor

Fast Makespan Estimation for GPU Threads on a Single Streaming Multiprocessor

Kostiantyn Berezovskyi, Konstantinos Bletsas, Stefan M. Petters

CISTER Research Unit, Polytechnic Institute of Porto (ISEP-IPP), Rua Dr. Antonio Bernardino de Almeida, 431, 4200-072 Porto, Portugal

Technical Report CISTER-TR-130406, 2013

@techreport{berezovskyi2013fast,

title={Fast Makespan Estimation for GPU Threads on a Single Streaming Multiprocessor},

author={Berezovskyi, Kostiantyn and Bletsas, Konstantinos and Petters, Stefan M},

year={2013},

institution={Technical Report HURRAYTR-111215, CISTER/INESC-TEC, ISEP Research Center, Polytechnic Institute of Porto, Available at http://www. cister. isep. ipp. pt/people/Kostiantyn% 2BBerezovskyi/publications}

}

Download (PDF)

View

Source

1904

views

Graphics Processing Units (GPUs) are widely used to unload the CPUs, liberate other resources of a given computer system, and provide an alternative to multiprocessor computers as a means of processing computationally expensive parallel tasks. The recent trend of utilizing GPUs in embedded systems necessitates the development of timing analysis techniques for finding the joint worst-case execution time for a group of GPU threads of the same parallel application, on a streaming multiprocessor. The state-of-the-art approaches for computing the exact maximum makespan of GPU threads running on a single streaming multiprocessor are intractable and even pessimistic approximations usually take a long time to complete. We therefore develop a technique for finding an estimate of the maximum makespan using metaheuristics. Its simplicity, flexibility and ability for massive parallelization, determine a potential of usage for soft real-time systems.

Tags: Computer science, CUDA, Metaheuristics, nVidia, nVidia GeForce GTX 680, PTX, Timing analysis

April 22, 2013 by hgpu

No votes yet.

Please wait...

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

high performance computing on graphics processing units: hgpu.org

Fast Makespan Estimation for GPU Threads on a Single Streaming Multiprocessor

Recent source codes

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

GPU plugin for PySCF

QArray

Celerity: High-level C++ for Accelerator Clusters

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Most viewed papers (last 30 days)

Fast Makespan Estimation for GPU Threads on a Single Streaming Multiprocessor

Share this:

Recent source codes

Most viewed papers (last 30 days)