https://hgpu.org/?p=3643
Extending the Scalability of Single Chip Stream Processors with On-chip Caches