12587

Optimizing performance per watt on GPUs in High Performance Computing: temperature, frequency and voltage effects

D. C. Price, M. A. Clark, B. R. Barsdell, R. Babich, L. J. Greenhill
Harvard-Smithsonian Center for Astrophysics, MS 42, 60 Garden Street, Cambridge MA, 01238 USA
arXiv:1407.8116 [astro-ph.IM], (30 Jul 2014)

@article{2014arXiv1407.8116P,

   author={Price}, D.~C. and {Clark}, M.~A. and {Barsdell}, B.~R. and {Babich}, R. and {Greenhill}, L.~J.},

   title={"{Optimizing performance per watt on GPUs in High Performance Computing: temperature, frequency and voltage effects}"},

   journal={ArXiv e-prints},

   archivePrefix={"arXiv"},

   eprint={1407.8116},

   primaryClass={"astro-ph.IM"},

   keywords={Astrophysics – Instrumentation and Methods for Astrophysics, Computer Science – Distributed, Parallel, and Cluster Computing},

   year={2014},

   month={jul},

   adsurl={http://adsabs.harvard.edu/abs/2014arXiv1407.8116P},

   adsnote={Provided by the SAO/NASA Astrophysics Data System}

}

The magnitude of the real-time digital signal processing challenge attached to large radio astronomical antenna arrays motivates use of high performance computing (HPC) systems. The need for high power efficiency (performance per watt) at remote observatory sites parallels that in HPC broadly, where efficiency is an emerging critical metric. We investigate how the performance per watt of graphics processing units (GPUs) is affected by temperature, core clock frequency and voltage. Our results highlight how the underlying physical processes that govern transistor operation affect power efficiency. In particular, we show experimentally that GPU power consumption grows non-linearly with both temperature and supply voltage, as predicted by physical transistor models. We show lowering GPU supply voltage and increasing clock frequency while maintaining a low die temperature increases the power efficiency of an NVIDIA K20 GPU by up to 37-48% over default settings when running xGPU, a compute-bound code used in radio astronomy. We discuss how temperature-aware power models could be used to reduce power consumption for future HPC installations. Automatic temperature-aware and application-dependent voltage and frequency scaling (T-DVFS and A-DVFS) may provide a mechanism to achieve better power efficiency for a wider range of codes running on GPUs.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2017 hgpu.org

All rights belong to the respective authors

Contact us: