Accelerating cellular automata simulations using AVX and CUDA
Faculty of Physics and Astronomy, University of Wroclaw, pl. M. Borna 9, Wroclaw, Poland
arXiv:1208.2428v1 [cs.DC] (12 Aug 2012)
@article{2012arXiv1208.2428S,
author={Szkoda, Sebastian and Koza, Zbigniew and Tykierko, Mateusz},
title={"{Accelerating cellular automata simulations using AVX and CUDA}"},
journal={ArXiv e-prints},
archivePrefix={"arXiv"},
eprint={1208.2428},
primaryClass={"cs.DC"},
keywords={Distributed, Parallel, and Cluster Computing; Computational Physics; Fluid Dynamics},
year={2012},
month={aug}
}
We investigated various methods of parallelization of the Frish-Hasslacher-Pomeau (FHP) cellular automata algorithm for modeling fluid flow. These methods include SSE, AVX, and POSIX Threads for central processing units (CPUs) and CUDA for graphics processing units (GPUs). We present implementation details of the FHP algorithm based on AVX/SSE and CUDA technologies. We found that (a) using AVX or SSE is necessary to fully utilize the potential of modern CPUs; (b) CPUs and GPUs are comparable in terms of computational and economic efficiency only if the CPU code uses AVX or SSE instructions; (c) AVX does not offer any substantial improvement relative to SSE.
August 14, 2012 by hgpu