https://hgpu.org/?p=28356
Improving Performance of Iterative Applications through Interleaved Execution of Approximated CUDA Kernels