https://hgpu.org/?p=14119
Automatic Data Layout Optimizations for GPUs