https://hgpu.org/?p=7747
Performance Gains in Conjugate Gradient Computation with Linearly Connected GPU Multiprocessors