https://hgpu.org/?p=14584
SKMD: Single Kernel on Multiple Devices for Transparent CPU-GPU Collaboration