https://hgpu.org/?p=25076
TENSILE: A Tensor granularity dynamic GPU memory scheduler method towards multiple dynamic workloads system