https://hgpu.org/?p=6554
Bringing Parallel Performance to Python with Domain-Specific Selective Embedded Just-in-Time Specialization