https://hgpu.org/?p=2198
Singular value decomposition on GPU using CUDA