https://hgpu.org/?p=2705
Data access optimized applications on the GPU using NVIDIA CUDA