https://hgpu.org/?p=29053
LeftoverLocals: Listening to LLM Responses Through Leaked GPU Local Memory