4855

GPGPU-Accelerated Parallel and Fast Simulation of Thousand-Core Platforms

Christian Pinto, Shivani Raghav, Andrea Marongiu, Martino Ruggiero, David Atienza, Luca Benini
DEIS – University of Bologna, Bologna, Italy
11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), 2011

@article{pintogpgpu,

   title={GPGPU-Accelerated Parallel and Fast Simulation of Thousand-core Platforms},

   author={Pinto, C. and Raghav, S. and Marongiu, A. and Ruggiero, M. and Atienza, D. and Benini, L.},

   booktitle={11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), 2011},

   year={2011}

}

Download Download (PDF)   View View   Source Source   

1472

views

The multicore revolution and the ever-increasing complexity of computing systems is dramatically changing sys-tem design, analysis and programming of computing platforms. Future architectures will feature hundreds to thousands of simple processors and on-chip memories connected through a network-on-chip. Architectural simulators will remain primary tools for design space exploration, software development and performance evaluation of these massively parallel architectures. However, architectural simulation performance is a serious concern, as virtual platforms and simulation technology are not able to tackle the complexity of thousands of core future scenarios. The main contribution of this paper is the development of a new simulation approach and technology for many core processors which exploit the enormous parallel processing capability of low-cost and widely available General Purpose Graphic Processing Units (GPGPU). The simulation of many-core architectures exhibits indeed a high level of parallelism and is inherently parallelizable, but GPGPU acceleration of architectural simulation requires an in-depth revision of the data structures and functional partitioning traditionally used in parallel simulation. We demonstrate our GPGPU simulator on a target architecture composed by several cores (i.e. ARM ISA based), with instruction and data caches, connected through a Network-on-Chip (NoC). Our experiments confirm the feasibility of our approach.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: