https://hgpu.org/?p=2157
CUDA optimization strategies for compute- and memory-bound neuroimaging algorithms