https://hgpu.org/?p=11079
Job Parallelism using Graphical Processing Unit Individual Multi-Processors and Localised Memory