2928

Software-Based ECC for GPUs

Naoya Maruyama, Akira Nukada, Satoshi Matsuoka
Tokyo Institute of Technology, JST CREST
Symposium on Application Accelerators in High Performance Computing, 2009 (SAAHPC’09)

@conference{maruyama2009software,

   title={Software-based ECC for GPUs},

   author={Maruyama, N. and Nukada, A. and Matsuoka, S.},

   booktitle={2009 Symposium on Application Accelerators in High Performance Computing (SAAHPC’09)},

   year={2009}

}

Download Download (PDF)   View View   Source Source   

1709

views

Commodity off-the-shelf GPUs lack error checking mechanisms for graphics memory, whereas conventional HPC platforms have used hardware-based ECC for DRAMs. To alleviate this reliability concern, we propose a software-based ECC for GPGPU applications. We add small program codes to normal CUDA programs that compute ECCs for data residing in graphics memory so that transient bit-flips can be detected or masked. Preliminary performance studies with 3-D FFT and the N-body problem show that error checking using ECC can take 200% and 7% of overhead, respectively. We discuss that performance overheads are derived from the cost of ECC computation on GPUs.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: