https://hgpu.org/?p=13932
AccFFT: A library for distributed-memory 3-D FFT on CPU and GPU architectures