high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Automatic parallelization for graphics processing units

Automatic parallelization for graphics processing units

Alan Leung, Ondrej Lhotak, Ghulam Lashari

University of Waterloo, Waterloo, ON, Canada

Proceedings of the 7th International Conference on Principles and Practice of Programming in Java, PPPJ ’09

DOI:10.1145/1596655.1596670

@conference{leung2009automatic,

title={Automatic parallelization for graphics processing units},

author={Leung, A. and Lhot{‘a}k, O. and Lashari, G.},

booktitle={Proceedings of the 7th International Conference on Principles and Practice of Programming in Java},

pages={91–100},

year={2009},

organization={ACM}

}

Download (PDF)

View

Source

2266

views

Accelerated graphics cards, or Graphics Processing Units (GPUs), have become ubiquitous in recent years. On the right kinds of problems, GPUs greatly surpass CPUs in terms of raw performance. However, because they are difficult to program, GPUs are used only for a narrow class of special-purpose applications; the raw processing power made available by GPUs is unused most of the time. This paper presents an extension to a Java JIT compiler that executes suitable code on the GPU instead of the CPU. Both static and dynamic features are used to decide whether it is feasible and beneficial to off-load a piece of code on the GPU. The paper presents a cost model that balances the speedup available from the GPU against the cost of transferring input and output data between main memory and GPU memory. The cost model is parameterized so that it can be applied to different hardware combinations. The paper also presents ways to overcome several obstacles to parallelization inherent in the design of the Java bytecode language: unstructured control flow, the lack of multi-dimensional arrays, the precise exception semantics, and the proliferation of indirect references.

Tags: Computer science, Java, nVidia, nVidia GeForce 7800, Programming techniques, RapidMind

January 2, 2011 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

Automatic parallelization for graphics processing units

Your response

Recent source codes

UniCoder: Unified Visual-to-Code Generation via Symbolic Rewards and Reference-Guided Code Optimization

CuFuzz: An API-Knowledge-Graph Coverage-Driven Fuzzing Framework for CUDA Libraries

AutoPass: Evidence-Guided LLM Agents for Compiler Performance Tuning

Probe-and-Refine Tuning of Repository Guidance for AI Coding Agents

CUDAnalyst (CUDA + Analyst)

CodegenBench

KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels

CUDA Kernel Fusion Benchmarks

IntelliKit: Agent-first tooling for AMD hardware

DITRON: Distributed Compiler based on Triton for Parallel Systems

Most viewed papers (last 30 days)

Automatic parallelization for graphics processing units

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)