https://hgpu.org/?p=16319
An Optimized Multiple Right-Hand Side Dslash Kernel for Intel Xeon Phi