5911

Physis: An Implicitly Parallel Programming Model for Stencil Computations on Large-Scale GPU-Accelerated Supercomputers

Naoya Maruyama, Tatsuo Nomura, Kento Sato, Satoshi Matsuoka
Tokyo Institute of Technology, JST, CREST, 2-12-1 Ookayama, Meguro-ku, Tokyo, Japan
Proceedings of the 2011 ACM/IEEE conference on Supercomputing (SC’11), 2011

@article{maruyama2011physis,

   title={Physis: An Implicitly Parallel Programming Model for Stencil Computations on Large-Scale GPU-Accelerated Supercomputers},

   author={Maruyama, N. and Nomura, T. and Sato, K. and Matsuoka, S.},

   year={2011}

}

Download Download (PDF)   View View   Source Source   Source codes Source codes

Package:

1727

views

This paper proposes a compiler-based programming framework that automatically translates user-written structured grid code into scalable parallel implementation code for GPU-equipped clusters. To enable such automatic translations, we design a small set of declarative constructs that allow the user to express stencil computations in a portable and implicitly parallel manner. Our framework translates the user-written code into actual implementation code in CUDA for GPU acceleration and MPI for node-level parallelization with automatic optimizations such as computation and communication overlapping. We demonstrate the feasibility of such automatic translations by implementing several structured grid applications in our framework. Experimental results on the TSUBAME2.0 GPU-based supercomputer show that the performance is comparable as hand-written code and good strong and weak scalability up to 256 GPUs.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: