https://hgpu.org/?p=1978
CUDA: Scalable parallel programming for high-performance scientific computing