https://hgpu.org/?p=13359
Batched Matrix Computations on Hardware Accelerators Based on GPUs