https://hgpu.org/?p=8172
Can GPGPU Programming Be Liberated from the Data-Parallel Bottleneck?