Interleaved Learning and Exploration: A Self-Adaptive Fuzz Testing Framework for MLIR

hgpu.org » Applications » Computer science » Interleaved Learning and Exploration: A Self-Adaptive Fuzz Testing Framework for MLIR

Interleaved Learning and Exploration: A Self-Adaptive Fuzz Testing Framework for MLIR

Zeyu Sun, Jingjing Liang, Weiyi Wang, Chenyao Suo, Junjie Chen, Fanjiang Xu

Institute of Software, Chinese Academy of SciencesBeijing, China

arXiv:2510.07815 [cs.SE], (9 Oct 2025)

DOI:10.48550/arXiv.2510.07815

@misc{sun2025interleavedlearningexplorationselfadaptive,

title={Interleaved Learning and Exploration: A Self-Adaptive Fuzz Testing Framework for MLIR},

author={Zeyu Sun and Jingjing Liang and Weiyi Wang and Chenyao Suo and Junjie Chen and Fanjiang Xu},

year={2025},

eprint={2510.07815},

archivePrefix={arXiv},

primaryClass={cs.SE},

url={https://arxiv.org/abs/2510.07815}

}

Download (PDF)

View

Source

Source codes

Package:

Interleaved Learning and Exploration: A Self-Adaptive Fuzz Testing Framework for MLIR

235

views

MLIR (Multi-Level Intermediate Representation) has rapidly become a foundational technology for modern compiler frameworks, enabling extensibility across diverse domains. However, ensuring the correctness and robustness of MLIR itself remains challenging. Existing fuzzing approaches-based on manually crafted templates or rule-based mutations-struggle to generate sufficiently diverse and semantically valid test cases, making it difficult to expose subtle or deep-seated bugs within MLIR’s complex and evolving code space. In this paper, we present FLEX, a novel self-adaptive fuzzing framework for MLIR. FLEX leverages neural networks for program generation, a perturbed sampling strategy to encourage diversity, and a feedback-driven augmentation loop that iteratively improves its model using both crashing and non-crashing test cases. Starting from a limited seed corpus, FLEX progressively learns valid syntax and semantics and autonomously produces high-quality test inputs. We evaluate FLEX on the upstream MLIR compiler against four state-of-the-art fuzzers. In a 30-day campaign, FLEX discovers 80 previously unknown bugs-including multiple new root causes and parser bugs-while in 24-hour fixed-revision comparisons, it detects 53 bugs (over 3.5x as many as the best baseline) and achieves 28.2% code coverage, outperforming the next-best tool by 42%. Ablation studies further confirm the critical role of both perturbed generation and diversity augmentation in FLEX’s effectiveness.

Tags: Computer science, Neural networks, nVidia, nVidia GeForce RTX 4090, OpenCL, Package, Software Engineering

October 12, 2025 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

high performance computing on graphics processing units: hgpu.org