14457

Scheduling for new computing platforms with GPUs

Florence Monna
LIP6 – Laboratoire d’Informatique de Paris 6
tel-01127919, (31 March 2015)

@phdthesis{monna:tel-01127919,

   title={Scheduling for new computing platforms with GPUs},

   author={Monna, Florence},

   url={https://tel.archives-ouvertes.fr/tel-01127919},

   number={2014PA066390},

   school={Universit{‘e} Pierre et Marie Curie – Paris VI},

   year={2014},

   month={Nov},

   keywords={GPUs; Job scheduling; Plateformes h{‘e}t{‘e}rog{‘e}nes; Algorithmes d’approximation; Calcul haute performance; Approximation duale; Ordonnancement},

   type={Theses},

   pdf={https://tel.archives-ouvertes.fr/tel-01127919/file/2014PA066390.pdf},

   hal_id={tel-01127919},

   hal_version={v1}

}

Download Download (PDF)   View View   Source Source   

1709

views

More and more computers use hybrid architectures combining multi-core processors (CPUs) and hardware accelerators like GPUs (Graphics Processing Units). These hybrid parallel platforms require new scheduling strategies. This work is devoted to a characterization of this new type of scheduling problems. The most studied objective in this work is the minimization of the makespan, which is a crucial problem for reaching the potential of new platforms in High Performance Computing. The core problem studied in this work is scheduling efficiently n independent sequential tasks with m CPUs and k GPUs, where each task of the application can be processed either on a CPU or on a GPU, with minimum makespan. This problem is NP-hard, therefore we propose approximation algorithms with performance ratios ranging from 2 to (2q+1)/(2q)+1/(2qk), q>0, and corresponding polynomial time complexities. The proposed solving method is the first general purpose algorithm for scheduling on hybrid machines with a theoretical performance guarantee that can be used for practical purposes. Some variants of the core problem are studied: a special case where all the tasks are accelerated when assigned to a GPU, with a 3/2-approximation algorithm, a case where preemptions are allowed on CPUs, the same problem with malleable tasks, with an algorithm with a ratio of 3/2. Finally, we studied the problem with dependent tasks, providing a 6-approximation algorithm. Experiments based on realistic benchmarks have been conducted. Some algorithms have been integrated into the scheduler of the xKaapi runtime system for linear algebra kernels, and compared to the state-of-the-art algorithm HEFT.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: