https://hgpu.org/?p=7299
Performance analysis and optimization of the OP2 framework on many-core architectures