https://hgpu.org/?p=2000
MCUDA: An Efficient Implementation of CUDA Kernels for Multi-core CPUs