Exploring Traditional and Emerging Parallel Programming Models using a Proxy Application

Ian Karlin, Abhinav Bhatele, Jeff Keasler, Bradford L. Chamberlain, Jonathan Cohen, Zachary DeVito, Riyaz Haque, Dan Laney, Edward Luke, Felix Wang, David Richards, Martin Schulz, Charles H. Still
Lawrence Livermore National Laboratory, P. O. Box 808, Livermore, California 94551 USA
IEEE International Parallel & Distributed Processing Symposium (IPDPS ’13), 2013
@article{karlin2013exploring,

   title={Exploring Traditional and Emerging Parallel Programming Models using a Proxy Application},

   author={Karlin, I. and Bhatele, A. and Keasler, J. and Chamberlain, B.L. and Cohen, J. and DeVito, Z. and Haque, R. and Laney, D. and Luke, E. and Wang, F. and others},

   year={2013}

}

Download Download (PDF)   View View   Source Source   
Parallel computing architectures are becoming more complex with increasing core counts and more heterogeneous architectures. However, the most commonly used programming models, C/C++ with MPI and/or OpenMP, make it very difficult to write source code that is easily tuned for many targets. Newer language approaches attempt to ease this burden by providing optimization features such as computation and communication overlap, message-driven execution, automatic load balancing and implicit data layout optimizations. In this paper, we compare multiple implementations of LULESH, a proxy application for a shock hydrodynamics, to determine strengths and weaknesses of four traditional (OpenMP, MPI, MPI+OpenMP, CUDA) and four emerging (Chapel, Charm++, Liszt, Loci) programming models for parallel computation. In evaluating these programming models, we focus on programmer productivity, performance and ease of applying optimizations.
VN:F [1.9.22_1171]
Rating: 4.0/5 (4 votes cast)
Exploring Traditional and Emerging Parallel Programming Models using a Proxy Application, 4.0 out of 5 based on 4 ratings

You must be logged in to post a comment.

* * *

* * *

* * *

Free GPU computing nodes at hgpu.org

Registered users can now run their OpenCL application at hgpu.org. We provide 1 minute of computer time per each run on two nodes with two AMD and one nVidia graphics processing units, correspondingly. There are no restrictions on the number of starts.

The platforms are

Node 1
  • GPU device 0: AMD/ATI Radeon HD 5870 2GB, 850MHz
  • GPU device 1: AMD/ATI Radeon HD 6970 2GB, 880MHz
  • CPU: AMD Phenom II X6 @ 2.8GHz 1055T
  • RAM: 12GB
  • OS: OpenSUSE 11.4
  • SDK: AMD APP SDK 2.8
Node 2
  • GPU device 0: AMD/ATI Radeon HD 7970 3GB, 1000MHz
  • GPU device 1: nVidia GeForce GTX 560 Ti 2GB, 822MHz
  • CPU: Intel Core i7-2600 @ 3.4GHz
  • RAM: 16GB
  • OS: OpenSUSE 12.2
  • SDK: nVidia CUDA Toolkit 5.0.35, AMD APP SDK 2.8

Completed OpenCL project should be uploaded via User dashboard (see instructions and example there), compilation and execution terminal output logs will be provided to the user.

The information send to hgpu.org will be treated according to our Privacy Policy

HGPU group © 2010-2014 hgpu.org

All rights belong to the respective authors

Contact us:

contact@hgpu.org