https://hgpu.org/?p=10445
Oncilla: A GAS Runtime for Efficient Resource Allocation and Data Movement in Accelerated Clusters