high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » CUDA and OpenCL-based asynchronous PSO

CUDA and OpenCL-based asynchronous PSO

Youssef S. G. Nashed, Alessandro Bacchini, Stefano Cagnoni, Luca Mussi

Department of Information Engineering, University of Parma, Italy

GPUs for Genetic and Evolutionary Computation Competition at 2011 Genetic and Evolutionary Computation Conference (GECCO-2011), 2011

@article{nashed2011cuda,

title={CUDA and OpenCL-based asynchronous PSO},

author={Nashed, Y.S.G. and Bacchini, A. and Cagnoni, S. and Mussi, L.},

year={2011}

}

Download (PDF)

View

Source

2086

views

In "synchronous" PSO, positions and velocities of all particles are updated in turn in each "generation", after which each particle’s new fitness is evaluated. The value of the social attractor is only updated at the end of each generation, when the fitness values of all particles are known. The "asynchronous" version of PSO, instead, allows the social attractors to be updated immediately after evaluating each particle’s fitness, which causes the swarm to move more promptly towards newly-found optima. In asynchronous PSO, the velocity and position update equations can be applied to any particle at any time, in no specific order. The most common GPU implementations of PSO assign one thread per particle and do not take full advantage of the GPU power in evaluating the fitness function in parallel. Parallelization only occurs on the number of particles of a swarm and ignores the dimensions of the function. In our parallel implementations: (i) we designed the thread parallelization to be as fine-grained as possible, considering that, in PSO, velocity and position update occur independently over each dimension; (ii) we implemented an "asynchronous" PSO which, despite updating all particles in parallel, allows each of them to update the social attractor without waiting for all other particles’ fitness values to be evaluated.

Tags: Computer science, CUDA, Genetic programming, nVidia, OpenCL

November 16, 2011 by hgpu

Rating: 2.5/5. From 1 vote.

Please wait...

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

* * *

high performance computing on graphics processing units: hgpu.org

CUDA and OpenCL-based asynchronous PSO

Recent source codes

QArray

Celerity: High-level C++ for Accelerator Clusters

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Optical flow algorithms for SYCL

OpenMP5-Offload-OpenMC-Intel-PVC

Most viewed papers (last 30 days)

CUDA and OpenCL-based asynchronous PSO

Share this:

Recent source codes

Most viewed papers (last 30 days)