GPGPUs: How to Combine High Computational Power with High Reliability

L. Bautista Gomez, F. Cappello, L. Carro, N. DeBardeleben, B. Fang, S. Gurumurthi, S. Keckler, K. Pattabiraman, P. Rech, M. Sonza Reorda
Argonne National Laboratory, USA
Design Automation and Test in Europe Conference and Exhibition, 2014


   title={GPGPUs: How to Combine High Computational Power with High Reliability},

   author={Gomez, L Bautista and Cappello, F and Carro, L and DeBardeleben, N and Fang, B and Gurumurthi, S and Keckler, S and Pattabiraman, K and Rech, P and Reorda, M Sonza},



Download Download (PDF)   View View   Source Source   



GPGPUs are increasingly used in several domains, from gaming to different kinds of computationally intensive applications. In some cases, their reliability is becoming a serious issue and several research activities are focusing on its evaluation. This paper aims at overviewing some major results in the area. First, it shows and analyzes the results of some experiments aiming at assessing the GPGPU reliability in HPC datacenters. Secondly, it provides recent results about the reliability of some GPGPUs, derived from radiation experiments. Finally, it describes the characteristics of an advanced fault injection environment allowing one to effectively evaluate the resiliency of applications running on GPGPUs.
Rating: 1.5/5. From 2 votes.
Please wait...

* * *

* * *

HGPU group © 2010-2021 hgpu.org

All rights belong to the respective authors

Contact us: