https://hgpu.org/?p=5791
Performance Analysis and Optimisation of the OP2 Framework on Many-core Architectures