high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Symphony: A Scheduler for Client-Server Applications on Coprocessor-based Heterogeneous Clusters

Symphony: A Scheduler for Client-Server Applications on Coprocessor-based Heterogeneous Clusters

M. Mustafa Rafique, Srihari Cadambi, Kunal Rao, Ali R. Butt, Srimat Chakradhar

Dept. of Computer Science, Virginia Tech

IEEE International Conference on Cluster Computing (CLUSTER), 2011

DOI:10.1109/CLUSTER.2011.46

BibTeX

Download (PDF)

View

Source

1605

views

Coprocessors such as GPUs are increasingly being deployed in clusters to process scientific and compute-intensive jobs. In this work, we study if GPU-based heterogeneous clusters can benefit client-server applications. Specifically, we consider the practical situation where multiple client-server applications share a heterogeneous cluster (multi-tenancy), and experience unpredictable variations in incoming client request rates, including steep load spikes. Even for "compute-intensive" client-server applications, it is unclear if a GPU-based cluster can seamlessly deliver acceptable response times in the presence of multi-tenancy and load spikes. We argue that a cluster-level scheduler that is aware of application load, request deadlines and the heterogeneity is necessary in this situation. We propose a novel scheduler called Symphony that enables efficient, dynamic sharing of a GPU-based heterogeneous cluster across multiple concurrently-executing client-server applications, each with arbitrary load spikes. Symphony performs three key tasks: it (i) monitors the load on each application, (ii) collects past performance data and dynamically builds simple performance models of available processing resources and (iii) computes a priority for pending requests based on the above parameters and the requests’ slack. Based on this, it reorders client requests across different applications to achieve acceptable response times. We also define how client-server applications should interact with a scheduler such as Symphony, and develop an API to this end. We deploy Symphony as user-space middleware on a high-end heterogeneous cluster with dual quad-core Xeon CPUs and dual NVIDIA Fermi GPUs. An evaluation using representative applications shows that in the presence of load spikes (i) Symphony incurs 2-20x fewer requests that do not meet response time constraints compared with other schedulers, and (ii) in order to achieve the same performance as Symphony, other scheduler- – s need 2x more cluster nodes.

Tags: Cluster computing, Computer science, CUDA, GPU cluster, Heterogeneous systems, nVidia, Task scheduling, Tesla C2050

November 24, 2011 by hgpu

No votes yet.

Please wait...

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

Engineering Supercomputing Platforms for Biomolecular Applications

high performance computing on graphics processing units: hgpu.org

Symphony: A Scheduler for Client-Server Applications on Coprocessor-based Heterogeneous Clusters

Recent source codes

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

XaaS containers

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

SYCL Container

CFAL-bench

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Most viewed papers (last 30 days)

Symphony: A Scheduler for Client-Server Applications on Coprocessor-based Heterogeneous Clusters

Share this:

Recent source codes

Most viewed papers (last 30 days)