Object support for OpenMP-style programming of GPU clusters in Java

hgpu.org » Applications » Computer science » Object support for OpenMP-style programming of GPU clusters in Java

Object support for OpenMP-style programming of GPU clusters in Java

Carolin Wolf, Georg Dotzler, Ronald Veldema, Michael Philippsen

University of Erlangen-Nuremberg, Computer Science Department, Programming Systems Group, Erlangen, Germany

27th International Conference on Advanced Information Networking and Applications Workshops (WAINA 2013), 2013

BibTeX

Download (PDF)

View

Source

2487

views

For scientists, it is advantageous to use a high level of abstraction for programming their simulations, so that they can focus on the problem at hand instead of struggling with low-level details. However, current HPC clusters with multiple GPUs per node only offer explicit communication to and from the GPUs, require manual work to keep the data consistent, and often need explicit kernel programming. Moreover, known GPU programming frameworks are limited to a single GPU or a single machine and also rarely support objects. Our system removes the above restrictions. With a slight but necessary change in Java’s semantics, we achieve automatic distribution and efficient use of objects and arrays of objects on multiple GPUs in a cluster. On benchmarks that distribute arrays of objects over five machines with 10 GPUs, we achieve speedups of up to 4.9 compared to one node.

Tags: Computer science, CUDA, GPU cluster, Java, nVidia, PC cluster, Tesla M1060

July 16, 2013 by hgpu

No votes yet.

Please wait...

high performance computing on graphics processing units: hgpu.org

Object support for OpenMP-style programming of GPU clusters in Java

Recent source codes

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

XaaS containers

SYCL Container

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

CFAL-bench

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Can Large Language Models Predict Parallel Code Performance?

Most viewed papers (last 30 days)

Object support for OpenMP-style programming of GPU clusters in Java

Share this:

Recent source codes

Most viewed papers (last 30 days)