high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » AeminiumGPU: A CPU-GPU Hybrid Runtime for the Aeminium Language

AeminiumGPU: A CPU-GPU Hybrid Runtime for the Aeminium Language

Alcides Fonseca

Departamento De Engenharia Informatica, Faculdade De Ciencias E Tecnologia, Universidade De Coimbra

Departamento De Engenharia Informatica, Faculdade De Ciencias E Tecnologia, Universidade De Coimbra, 2011

@article{fonseca2011masters,

title={Masters’ Degree in Informatics Engineering Dissertation},

author={Fonseca, A. and Cabral, B.},

year={2011}

}

Download (PDF)

View

Source

Source codes

Package:

AeminiumGPU

1399

views

Given that CPU clock speeds are stagnating, programmers are resorting to parallelism to improve the performance of their applications. Although such parallelism has usually been attained using either multicore architectures, multiple CPUs and/or clusters of machines, the GPU has since been used as an alternative. GPUs are an interesting resource because they can provide much more processing power at a fraction of the cost of CPUs. However, GPU programming is not an easy task. Developers that do not understand the programming model and the hardware architecture of a GPU will not be able to extract all of its processing potential. Furthermore, it is even harder to write code for the GPU that improves the performance compared to an optimized CPU version. This thesis proposes a high-level programming framework for parallel programs on both CPUs and GPUs. This approach, named AeminiumGPU, drives inspiration from Functional Programming and currently allows developers to implement programs based on the Map-Reduce pattern. In the future, the framework can be extended with other higher-order functions. AeminiumGPU does not force developers to understand the particularities of GPU programming. They write programs in pure Java (and soon Aeminium) and specific parts of that code are compiled to OpenCL and executed on the GPU. In order to generate code with good performance, AeminiumGPU performs special optimizations for the architecture of GPUs. For instance, it avoids unnecessary compilations and data transfers. Despite these optimizations, programs will not always run faster just by executing them on the GPU. It is possible that CPU code can evidence better performance than GPU versions. To handle such cases and to ensure the fastest version is always executed, AeminiumGPU automatically decides wether a particular operation should be executed on the GPU or the CPU. These decisions are based on code complexity and input data size, collected at compile-time and run-time. AeminiumGPU contributes to reducing the development time and effort required for writing GPU programs. The framework also increases the performance of Java and Aeminium code. The contributions of this thesis also include a cost model for reasoning about the fastest architecture for a given program block.

Tags: Compilers, Computer science, Java, nVidia, nVidia GeForce GTX 285, OpenCL, Optimization, Package, Thesis

October 19, 2011 by hgpu

No votes yet.

Please wait...

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

high performance computing on graphics processing units: hgpu.org

AeminiumGPU: A CPU-GPU Hybrid Runtime for the Aeminium Language

Package:

Recent source codes

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

GPU plugin for PySCF

QArray

Celerity: High-level C++ for Accelerator Clusters

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Most viewed papers (last 30 days)

AeminiumGPU: A CPU-GPU Hybrid Runtime for the Aeminium Language

Package:

Share this:

Recent source codes

Most viewed papers (last 30 days)