8626

Vectorized Higher Order Finite Difference Kernels

Gerhard Zumbusch
Friedrich-Schiller-Universitat Jena, Institut fur Angewandte Mathematik, 07743 Jena, Germany
State-of-the-Art in Scientific and Parallel Computing (PARA), 2012

@inproceedings{Zumbusch2012Vectorized,

   author={G. Zumbusch},

   title={Vectorized Higher Order Finite Difference Kernels},

   booktitle={PARA 2012, State-of-the-Art in Scientific and Parallel Computing},

   year={2012},

   editor={P. Manninen},

   series={LNCS},

   pages={15},

   publisher={Springer},

   address={Heidelberg},

   pdf={http://cse.mathe.uni-jena.de/pub/zumbusch/para12.pdf},

   ps={http://cse.mathe.uni-jena.de/pub/zumbusch/para12.ps.gz},

   annote={refereed}

}

Download Download (PDF)   View View   Source Source   

1018

views

Several highly optimized implementations of Finite Difference schemes are discussed. The combination of vectorization and an interleaved data layout, spatial and temporal loop tiling algorithms, loop unrolling, and parameter tuning lead to efficient computational kernels in one to three spatial dimensions, truncation errors of order two to twelve, and isotropic and compact anisotropic stencils. The kernels are implemented on and tuned for several processor architectures like recent Intel Sandy Bridge, Ivy Bridge and AMD Bulldozer CPU cores, all with AVX vector instructions as well as Nvidia Kepler and Fermi and AMD Southern and Northern Islands GPU architectures, as well as some older architectures for comparison. The kernels are either based on a cache aware spatial loop or on time-slicing to compute several time steps at once. Furthermore, vector components can either be independent, grouped in short vectors of SSE, AVX or GPU warp size or in larger virtual vectors with explicit synchronization. The optimal choice of the algorithm and its parameters depend both on the Finite Difference stencil and on the processor architecture.
No votes yet.
Please wait...

* * *

* * *

Featured events

2018
November
27-30
Hida Takayama, Japan

The Third International Workshop on GPU Computing and AI (GCA), 2018

2018
September
19-21
Nagoya University, Japan

The 5th International Conference on Power and Energy Systems Engineering (CPESE), 2018

2018
September
22-24
MediaCityUK, Salford Quays, Greater Manchester, England

The 10th International Conference on Information Management and Engineering (ICIME), 2018

2018
August
21-23
No. 1037, Luoyu Road, Hongshan District, Wuhan, China

The 4th International Conference on Control Science and Systems Engineering (ICCSSE), 2018

2018
October
29-31
Nanyang Executive Centre in Nanyang Technological University, Singapore

The 2018 International Conference on Cloud Computing and Internet of Things (CCIOT’18), 2018

HGPU group © 2010-2018 hgpu.org

All rights belong to the respective authors

Contact us: