11247

High Performance Code Generation for Stencil Computation on Heterogeneous Multi-device Architectures

Pei Li, Elisabeth Brunet, Raymond Namyst
Telecom SudParis, 9 rue Charles Fourier, 91011, Evry, France
hal-00925481, (8 January 2014)

@inproceedings{li:hal-00925481,

   hal_id={hal-00925481},

   url={http://hal.inria.fr/hal-00925481},

   title={High Performance Code Generation for Stencil Computation on Heterogeneous Multi-device Architectures},

   author={Li, Pei and Brunet, Elisabeth and Namyst, Raymond},

   language={Anglais},

   affiliation={D{‘e}partement Informatique – INF, Laboratoire Bordelais de Recherche en Informatique – LaBRI , RUNTIME – INRIA Bordeaux – Sud-Ouest},

   booktitle={HPCC-15th IEEE International Conference on High Performance Computing and Communications},

   publisher={IEEE Computer Society},

   address={Zhangjiajie, Chine},

   audience={internationale},

   year={2013},

   pdf={http://hal.inria.fr/hal-00925481/PDF/HPCC2013.pdf}

}

Download Download (PDF)   View View   Source Source   

540

views

Heterogeneous architectures have been widely used in the domain of high performance computing. On one hand, it allows a designer to use multiple types of computing units and each able to execute the tasks that it is best suited for to increase performance; on the other hand, it brings many challenges in programming for novice users, especially for heterogeneous systems with multi-devices. In this paper, we propose the code generator STEPOCL that generates OpenCL host program for heterogeneous multi-device architecture. In order to simplify the analyzing process, we ask user to provide the description of input and kernel parameters in an XML file, then our generator analyzes the description and generates automatically the host program. Due to the data partition and data exchange strategies, the generated host program can be executed on multi-devices without changing any kernel code. The experiment of iterative stencil loop code (ISL) shows that our tool is efficient. It guarantees the minimum data exchanges and achieves high performance on heterogeneous multi-device architecture.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2017 hgpu.org

All rights belong to the respective authors

Contact us: