high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Enabling Inter-Machine Parallelism in High-Level Languages with SEJITS and MapReduce

Enabling Inter-Machine Parallelism in High-Level Languages with SEJITS and MapReduce

Michael Driscoll, Evangelos Georganas, Penporn Koanantakool

Computer Science Division, University of California, Berkeley

University of California, 2012

@article{driscoll2012enabling,

title={Enabling Inter-Machine Parallelism in High-Level Languages with SEJITS and MapReduce},

author={Driscoll, M. and Georganas, E. and Koanantakool, P.},

year={2012}

}

Download (PDF)

View

Source

1877

views

Selective, embedded, just-in-time specialization (SEJITS) is a technique for optimizing embedded domain-specific languages through the use of specializers, or code modules developed by expert programmers that target particular accelerators such as multicore processors and GPUs via just-in-time compilation. We extend SEJITS to exploit inter-machine parallelism by targeting clusters of machines via MapReduce. Our work enables the development of specializers for large, data-parallel applications whose work flows can be cast as MapReduce operations. We present an implementation that targets Hadoop and we describe specializers for two applications. The first, a pure-Python protein docking application, requires a 1-line change to realize a 280x speedup on a cluster with 450 cores. The second, an audio processing application, demonstrates our approach’s ability to leverage clusters of GPU-equipped machines by composing parallel programming patterns. Results indicate that clusters are viable targets for specialization, and that pattern composition is a useful technique for managing multi-level parallelism.

Tags: Computer science, CUDA, MapReduce, nVidia, Programming techniques, Python, Tesla M2050

February 9, 2013 by hgpu

No votes yet.

Please wait...

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

high performance computing on graphics processing units: hgpu.org

Enabling Inter-Machine Parallelism in High-Level Languages with SEJITS and MapReduce

Recent source codes

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

GPU plugin for PySCF

QArray

Celerity: High-level C++ for Accelerator Clusters

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Most viewed papers (last 30 days)

Enabling Inter-Machine Parallelism in High-Level Languages with SEJITS and MapReduce

Share this:

Recent source codes

Most viewed papers (last 30 days)