Acceleration of Cellular Automata through Parallel Computing with OpenCL
Centro de Informatica – CI, Universidade Federal da Paraiba – UFPB, Joao Pessoa, Brazil
8th Workshop on Applications for Multi-Core Architectures (WAMCA), 2017
@inproceedings{pereira2017acceleration,
title={Acceleration of Cellular Automata through Parallel Computing with OpenCL},
author={Pereira, Maelso Bruno Pacheco Nunes and Pagot, Christian Azambuja and Junior, Josue da Silva Gomes and de Souza Ramos, Jorge Gabriel Gomes and Nascimento, Tiago P and Brito, Alisson V},
booktitle={2017 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW)},
pages={73–78},
year={2017},
organization={IEEE}
}
Cellular Automata (CA) have its origins in the work of Von Neumann and, since then, have become an important research topic with a wide range of applications, ranging from DNA sequencing to ecological dynamics. One aspect that may be of interest during a CA simulation is the evolution in the number of individuals of each species along time. This analysis can give important information about the dominance of certain species in a dynamical system, or identify aspects that might favor one or more species in detriment of others. CA simulations can be computationally expensive tasks. Depending on the simulation domain size, number of dimensions or the number of individuals, these simulations can take several hours to complete. The evaluation of the number of individuals at each simulation step is an equally expensive task. Several acceleration techniques have been developed to improve the performance of CA simulations, and some of them take into account the evolution in the number of individuals along the simulation. In this work we propose an efficient CA simulator which is capable of efficiently evaluate the evolution in the number of individuals of each species. High performance is obtained through the use of the massive parallelism of GPUs. The presented approach achieved a speed-up of 44 times when compared to a sequential implementation, and 26 times when compared to a traditional approach also in GPU.
December 10, 2017 by hgpu