https://hgpu.org/?p=2563
Speedups between x70 and x120 for a generic local search (memetic) algorithm on a single GPGPU chip