https://hgpu.org/?p=27498
iGniter: Interference-Aware GPU Resource Provisioning for Predictable DNN Inference in the Cloud