https://hgpu.org/?p=17621
Toward Performance Portability for CPUs and GPUs Through Algorithmic Compositions