https://hgpu.org/?p=28476
A portable C++ library for memory and compute abstraction on multi-core CPUs and GPUs