https://hgpu.org/?p=14781
CLOP: A Multi-stage Compiler to Seamlessly Embed Heterogeneous Code