https://hgpu.org/?p=14273
Automatic Optimization of Thread Mapping for a GPGPU Programming Framework