6294

CUDA and OpenCL-based asynchronous PSO

Youssef S. G. Nashed, Alessandro Bacchini, Stefano Cagnoni, Luca Mussi
Department of Information Engineering, University of Parma, Italy
GPUs for Genetic and Evolutionary Computation Competition at 2011 Genetic and Evolutionary Computation Conference (GECCO-2011), 2011

@article{nashed2011cuda,

   title={CUDA and OpenCL-based asynchronous PSO},

   author={Nashed, Y.S.G. and Bacchini, A. and Cagnoni, S. and Mussi, L.},

   year={2011}

}

Download Download (PDF)   View View   Source Source   

1026

views

In "synchronous" PSO, positions and velocities of all particles are updated in turn in each "generation", after which each particle’s new fitness is evaluated. The value of the social attractor is only updated at the end of each generation, when the fitness values of all particles are known. The "asynchronous" version of PSO, instead, allows the social attractors to be updated immediately after evaluating each particle’s fitness, which causes the swarm to move more promptly towards newly-found optima. In asynchronous PSO, the velocity and position update equations can be applied to any particle at any time, in no specific order. The most common GPU implementations of PSO assign one thread per particle and do not take full advantage of the GPU power in evaluating the fitness function in parallel. Parallelization only occurs on the number of particles of a swarm and ignores the dimensions of the function. In our parallel implementations: (i) we designed the thread parallelization to be as fine-grained as possible, considering that, in PSO, velocity and position update occur independently over each dimension; (ii) we implemented an "asynchronous" PSO which, despite updating all particles in parallel, allows each of them to update the social attractor without waiting for all other particles’ fitness values to be evaluated.
Rating: 2.5. From 1 vote.
Please wait...

* * *

* * *

HGPU group © 2010-2017 hgpu.org

All rights belong to the respective authors

Contact us: