https://hgpu.org/?p=7929
CuNesl: Compiling Nested Data-Parallel Languages for SIMT Architectures