https://hgpu.org/?p=14790
A general tridiagonal solver for coprocessors: Adapting g-Spike for the Intel Xeon Phi