https://hgpu.org/?p=3457
A Case Study of SWIM: Optimization of Memory Intensive Application on GPGPU