high performance computing on graphics processing units: hgpu.org

hgpu.org » Programming » Algorithms » Supporting Iteration in a Heterogeneous Data Flow Engine

Supporting Iteration in a Heterogeneous Data Flow Engine

Jon Currey, Simon Baker, Christopher J. Rossbach

Microsoft Research

The 3rd Workshop on Systems for Future Multicore Architectures, 2013

BibTeX

Download (PDF)

View

Source

2267

views

Dataflow execution engines such as MapReduce, DryadLINQ, and PTask have enjoyed success because they simplify development for a class of important parallel applications. These systems sacrifice generality for simplicity: while many workloads are easily expressed, important idioms like iteration and recursion are difficult to express and support efficiently. We consider the problem of extending a dataflow engine to support data-dependent iteration in a heterogeneous environment, where architectural diversity introduces data migration and scheduling challenges that complicate the problem. We propose constructs that enable a dataflow engine to efficiently support data-dependent control flow in a heterogeneous environment, implement them in a prototype system called IDEA, and use them to implement a variant of optical flow, a well-studied computer vision algorithm. Optical flow relies heavily on nested loops, making it difficult to express without explicit support for iteration. We demonstrate that IDEA enables up to 18x speedup over sequential and 32% speedup over a GPU implementation using synchronous host-based control.

Tags: Algorithms, Computer science, Computer vision, DirectCompute, Heterogeneous systems, MapReduce, nVidia, nVidia GeForce GTX 580, Optical flow

April 18, 2013 by hgpu

No votes yet.

Please wait...

high performance computing on graphics processing units: hgpu.org

Supporting Iteration in a Heterogeneous Data Flow Engine

Recent source codes

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

XaaS containers

SYCL Container

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

CFAL-bench

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Can Large Language Models Predict Parallel Code Performance?

Most viewed papers (last 30 days)

Supporting Iteration in a Heterogeneous Data Flow Engine

Share this:

Recent source codes

Most viewed papers (last 30 days)