high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » CUDA and OpenCL-based asynchronous PSO

CUDA and OpenCL-based asynchronous PSO

Youssef S. G. Nashed, Alessandro Bacchini, Stefano Cagnoni, Luca Mussi

Department of Information Engineering, University of Parma, Italy

GPUs for Genetic and Evolutionary Computation Competition at 2011 Genetic and Evolutionary Computation Conference (GECCO-2011), 2011

@article{nashed2011cuda,

title={CUDA and OpenCL-based asynchronous PSO},

author={Nashed, Y.S.G. and Bacchini, A. and Cagnoni, S. and Mussi, L.},

year={2011}

}

Download (PDF)

View

Source

2778

views

In "synchronous" PSO, positions and velocities of all particles are updated in turn in each "generation", after which each particle’s new fitness is evaluated. The value of the social attractor is only updated at the end of each generation, when the fitness values of all particles are known. The "asynchronous" version of PSO, instead, allows the social attractors to be updated immediately after evaluating each particle’s fitness, which causes the swarm to move more promptly towards newly-found optima. In asynchronous PSO, the velocity and position update equations can be applied to any particle at any time, in no specific order. The most common GPU implementations of PSO assign one thread per particle and do not take full advantage of the GPU power in evaluating the fitness function in parallel. Parallelization only occurs on the number of particles of a swarm and ignores the dimensions of the function. In our parallel implementations: (i) we designed the thread parallelization to be as fine-grained as possible, considering that, in PSO, velocity and position update occur independently over each dimension; (ii) we implemented an "asynchronous" PSO which, despite updating all particles in parallel, allows each of them to update the social attractor without waiting for all other particles’ fitness values to be evaluated.

Tags: Computer science, CUDA, Genetic programming, nVidia, OpenCL

November 16, 2011 by hgpu

Rating: 2.5/5. From 1 vote.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

CUDA and OpenCL-based asynchronous PSO

Your response

Recent source codes

UniCoder: Unified Visual-to-Code Generation via Symbolic Rewards and Reference-Guided Code Optimization

CuFuzz: An API-Knowledge-Graph Coverage-Driven Fuzzing Framework for CUDA Libraries

AutoPass: Evidence-Guided LLM Agents for Compiler Performance Tuning

Probe-and-Refine Tuning of Repository Guidance for AI Coding Agents

CUDAnalyst (CUDA + Analyst)

CodegenBench

KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels

CUDA Kernel Fusion Benchmarks

IntelliKit: Agent-first tooling for AMD hardware

DITRON: Distributed Compiler based on Triton for Parallel Systems

Most viewed papers (last 30 days)

CUDA and OpenCL-based asynchronous PSO

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)