https://hgpu.org/?p=28074
Kernel Launcher: C++ Library for Optimal-Performance Portable CUDA Applications