https://hgpu.org/?p=6087
Techniques to maximize memory bandwidth on the Rigel compute accelerator