https://hgpu.org/?p=17995
GPU Accelerated Finite Element Assembly with Runtime Compilation