high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Symphony: A Scheduler for Client-Server Applications on Coprocessor-based Heterogeneous Clusters

Symphony: A Scheduler for Client-Server Applications on Coprocessor-based Heterogeneous Clusters

M. Mustafa Rafique, Srihari Cadambi, Kunal Rao, Ali R. Butt, Srimat Chakradhar

Dept. of Computer Science, Virginia Tech

IEEE International Conference on Cluster Computing (CLUSTER), 2011

DOI:10.1109/CLUSTER.2011.46

BibTeX

Download (PDF)

View

Source

1614

views

Coprocessors such as GPUs are increasingly being deployed in clusters to process scientific and compute-intensive jobs. In this work, we study if GPU-based heterogeneous clusters can benefit client-server applications. Specifically, we consider the practical situation where multiple client-server applications share a heterogeneous cluster (multi-tenancy), and experience unpredictable variations in incoming client request rates, including steep load spikes. Even for "compute-intensive" client-server applications, it is unclear if a GPU-based cluster can seamlessly deliver acceptable response times in the presence of multi-tenancy and load spikes. We argue that a cluster-level scheduler that is aware of application load, request deadlines and the heterogeneity is necessary in this situation. We propose a novel scheduler called Symphony that enables efficient, dynamic sharing of a GPU-based heterogeneous cluster across multiple concurrently-executing client-server applications, each with arbitrary load spikes. Symphony performs three key tasks: it (i) monitors the load on each application, (ii) collects past performance data and dynamically builds simple performance models of available processing resources and (iii) computes a priority for pending requests based on the above parameters and the requests’ slack. Based on this, it reorders client requests across different applications to achieve acceptable response times. We also define how client-server applications should interact with a scheduler such as Symphony, and develop an API to this end. We deploy Symphony as user-space middleware on a high-end heterogeneous cluster with dual quad-core Xeon CPUs and dual NVIDIA Fermi GPUs. An evaluation using representative applications shows that in the presence of load spikes (i) Symphony incurs 2-20x fewer requests that do not meet response time constraints compared with other schedulers, and (ii) in order to achieve the same performance as Symphony, other scheduler- – s need 2x more cluster nodes.

Tags: Cluster computing, Computer science, CUDA, GPU cluster, Heterogeneous systems, nVidia, Task scheduling, Tesla C2050

November 24, 2011 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

Symphony: A Scheduler for Client-Server Applications on Coprocessor-based Heterogeneous Clusters

Your response

Recent source codes

Mutual-Supervised Learning for Sequential-to-Parallel Code Translation

Hardware Compute Partitioning on NVIDIA GPUs for Composable Systems

KISim: Kubernetes Intelligent Scheduling Simulator

Efficient GPU Implementation of Multi-Precision Integer Division

exa-AMD: Exascale Accelerated Materials Discovery

ParEval: A Parallel Code Evaluation Benchmark

FlashSparse: Minimizing Computation Redundancy for Fast Sparse Matrix Multiplications on Tensor Cores

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

Most viewed papers (last 30 days)

Symphony: A Scheduler for Client-Server Applications on Coprocessor-based Heterogeneous Clusters

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)