https://hgpu.org/?p=20210
Characterizing Optimizations to Memory Access Patterns using Architecture-Independent Program Features