high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Efficient Resource Scheduling for Big Data Processing on Accelerator-based Heterogeneous Systems

Efficient Resource Scheduling for Big Data Processing on Accelerator-based Heterogeneous Systems

Ayman Tarakji, David Hebbeker, Lyubomir Georgiev

Faculty of Electrical Engineering and Information Technology, RWTH Aachen University

Computer Communication & Collaboration, Vol. 3, Issue 2, 2015

BibTeX

Download (PDF)

View

Source

1925

views

The involvement of accelerators is becoming widespread in the field of heterogeneous processing, performing computation tasks through a wide range of applications. In this paper, we examine the heterogeneity in modern computing systems, particularly, how to achieve a good level of resource utilization and fairness, when multiple tasks with different load and computation ratios are processed. First, we present OCLSched, an OpenCL-based scheduler that is designed as a multiuser computing environment to make use of the full potential of available resources in heterogeneous compute systems. Multiple tasks can be issued by means of a C++ API that relies on the OpenCL C++ wrapper. At this point, our scheduler takes over the control immediately and performs load scheduling. Due to its implementation, our approach can be easily applicable to a common OS. We validate our method through extensive experiments deploying a set of applications, which show that the low scheduling costs remain constant in total over a wide range of input size. Then, we demonstrate the ability of our scheduling approach to manage the nontrivial applications, bringing the ability of OCLSched to handle complex applications to the front. Besides the general computation tasks used in our experiments, we want to present a new implementation concept of a recent stream data clustering algorithm DenStream. Based on a specially designed task concept, the clustering functionality is described as a single user process within our scheduler, achieving its defined targets asynchronous to other general purpose computations. Besides the different CPUs, a variety of modern GPU and other accelerator architectures are used in this work including: AMD’s Graphics Core Next, NVIDIA’s Kepler, and the Intel MIC (Many Integrated Core) architecture.

Tags: AMD FirePro S10000, AMD FirePro S7000, ATI, Computer science, Heterogeneous systems, Intel Xeon Phi, nVidia, OpenCL, Performance, Tesla C2050

May 16, 2015 by hgpu

No votes yet.

Please wait...

high performance computing on graphics processing units: hgpu.org

Efficient Resource Scheduling for Big Data Processing on Accelerator-based Heterogeneous Systems

Recent source codes

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

XaaS containers

SYCL Container

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

CFAL-bench

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Can Large Language Models Predict Parallel Code Performance?

Most viewed papers (last 30 days)

Efficient Resource Scheduling for Big Data Processing on Accelerator-based Heterogeneous Systems

Share this:

Recent source codes

Most viewed papers (last 30 days)