high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » JCudaMP: OpenMP/Java on CUDA

JCudaMP: OpenMP/Java on CUDA

Georg Dotzler, Ronald Veldema, Michael Klemm

University of Erlangen-Nuremberg, Martensstr, Erlangen, Germany

Proceedings of the 3rd International Workshop on Multicore Software Engineering, IWMSE ’10

DOI:10.1145/1808954.1808959

@inproceedings{dotzler2010jcudamp,

title={JCudaMP: OpenMP/Java on CUDA},

author={Dotzler, G. and Veldema, R. and Klemm, M.},

booktitle={Proceedings of the 3rd International Workshop on Multicore Software Engineering},

pages={10–17},

year={2010},

organization={ACM}

}

Download (PDF)

View

Source

2682

views

We present an OpenMP framework for Java that can exploit an available graphics card as an application accelerator. Dynamic languages (Java, C#, etc.) pose a challenge here because of their write-once-run-everywhere approach. This renders it impossible to make compile-time assumptions on whether and which type of accelerator or graphics card might be available in the system at run-time. We present an execution model that dynamically analyzes the running environment to find out what hardware is attached. Based on the results it dynamically rewrites the bytecode and generates the necessary gpGPU code on-the-fly. Furthermore, we solve two extra problems caused by the combination of Java and CUDA. First, CUDA-capable hardware usually has little memory (compared to main memory). However, as Java is a pointer-free language, array data can be stored in main memory and buffered in GPU memory. Second, CUDA requires one to copy data to and from the graphics card’s memory explicitly. As modern languages use many small objects, this would involve many copy operations when done naively. This is exacerbated because Java uses arrays-of-arrays to implement multi-dimensional arrays. A clever copying technique and two new array packages allow for more efficient use of CUDA.

Tags: Code generation, Computer science, CUDA, Java, nVidia, nVidia GeForce GTX 280, OpenMP, Programming Languages, Programming techniques

August 23, 2011 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

JCudaMP: OpenMP/Java on CUDA

Your response

Recent source codes

Kernel Library for LLM Serving

Adaptivity in AdaptiveCpp: Optimizing Performance by Leveraging Runtime Information During JIT-Compilation

Neptune: Advanced ML Operator Fusion for Locality and Parallelism on GPUs

Genten: Software for Generalized Tensor Decompositions by Sandia National Laboratories

Interleaved Learning and Exploration: A Self-Adaptive Fuzz Testing Framework for MLIR

Pinocchio: PINpointing Orbit Crossing Collapsed Hierarchical Objects

KernelCoder: trained on a curated dataset of reasoning traces and CUDA kernel pairs

VibeCodeHPC - Multi Agentic Vibe Coding for HPC

Compile-Time Resource Safety for GPU APIs: A Low-Overhead Typestate Framework

exa-AMD: Exascale Accelerated Materials Discovery

Most viewed papers (last 30 days)

JCudaMP: OpenMP/Java on CUDA

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)