https://hgpu.org/?p=24930
Ripple: Simplified Large-Scale Computation on Heterogeneous Architectures with Polymorphic Data Layout