30834

Towards Feedback-to-Plan Decisions for Self-Evolving LLM Agents in CUDA Kernel Generation

Yee Hin Chong, Jiaming Wu, Youhui Zhang, Peng Qu
Department of Computer Science and Technology, Tsinghua University, Beijing, China
arXiv:2605.26720 [cs.AI], (26 May 2026)

@misc{chong2026towards,

   title={Towards Feedback-to-Plan Decisions for Self-Evolving LLM Agents in CUDA Kernel Generation},

   author={Yee Hin Chong and Jiaming Wu and Youhui Zhang and Peng Qu},

   year={2026},

   eprint={2605.26720},

   archivePrefix={arXiv},

   primaryClass={cs.AI},

   url={https://arxiv.org/abs/2605.26720}

}

Download Download (PDF)   View View   Source Source   Source codes Source codes

416

views

Large language models (LLMs) have shown strong empirical gains as self-evolving agents for CUDA kernel generation, driven by feedback-conditioned planning across generations. However, how planning decisions attribute and combine heterogeneous feedback signals remains opaque. Standard end-to-end ablations fail to resolve this question, as iterative planning amplifies early perturbations and conflates feedback effects with trajectory-dependent drift. We introduce CUDAnalyst, a unified analysis layer for controlled, generation-level attribution of planning decisions to feedback components via trajectory freezing and selective feedback injection. CUDAnalyst enables stable generation-level evaluation and principled coalitional-style attribution of feedback effects and interactions. Our results show that explicit planning is beneficial only when feedback is aligned, that effective planning emerges from structured multi-feedback interactions, and that high-level plans from stronger reasoning models can partially transfer to weaker ones. These trends hold across reference backbones, representative workloads, and reference induction regimes, indicating that the identified feedback-to-plan structure is robust within the controlled axes studied.
No votes yet.
Please wait...

You must be logged in to post a comment.

* * *

* * *

HGPU group © 2010-2026 hgpu.org

All rights belong to the respective authors

Contact us: