https://hgpu.org/?p=11996
Orchestrating Thread Scheduling and Cache Management to Improve Memory System Throughput in Throughput Processors