https://hgpu.org/?p=29162
Parallel Gaussian process with kernel approximation in CUDA