30438

hls4ml: A Flexible, Open-Source Platform for Deep Learning Acceleration on Reconfigurable Hardware

Jan-Frederik Schulte, Benjamin Ramhorst, Chang Sun, Jovan Mitrevski, Nicolò Ghielmetti, Enrico Lupi, Dimitrios Danopoulos, Vladimir Loncar, Javier Duarte, David Burnette, Lauri Laatu, Stylianos Tzelepis, Konstantinos Axiotis, Quentin Berthet, Haoyan Wang, Paul White, Suleyman Demirsoy, Marco Colombo, Thea Aarrestad, Sioni Summers, Maurizio Pierini, Giuseppe Di Guglielmo, Jennifer Ngadiuba, Javier Campos, Ben Hawks, Abhijith Gandrakota, Farah Fahim, Nhan Tran, George Constantinides, Zhiqiang Que, Wayne Luk, Alexander Tapper, Duc Hoang, Noah Paladino, Philip Harris, Bo-Cheng Lai, Manuel Valentin, Ryan Forelli, Seda Ogrenci, Lino Gerlach, Rian Flynn, Mia Liu, Daniel Diaz, Elham Khoda, Melissa Quinnan, Russell Solares, Santosh Parajuli, Mark Neubauer, Christian Herwig, Ho Fung Tsoi, Dylan Rankin, Shih-Chieh Hsu, Scott Hauck
Purdue University, USA
arXiv:2512.01463 [cs.AR], (1 Dec 2025)

@misc{schulte2025hls4mlflexibleopensourceplatform,

   title={hls4ml: A Flexible, Open-Source Platform for Deep Learning Acceleration on Reconfigurable Hardware},

   author={Jan-Frederik Schulte and Benjamin Ramhorst and Chang Sun and Jovan Mitrevski and Nicolò Ghielmetti and Enrico Lupi and Dimitrios Danopoulos and Vladimir Loncar and Javier Duarte and David Burnette and Lauri Laatu and Stylianos Tzelepis and Konstantinos Axiotis and Quentin Berthet and Haoyan Wang and Paul White and Suleyman Demirsoy and Marco Colombo and Thea Aarrestad and Sioni Summers and Maurizio Pierini and Giuseppe Di Guglielmo and Jennifer Ngadiuba and Javier Campos and Ben Hawks and Abhijith Gandrakota and Farah Fahim and Nhan Tran and George Constantinides and Zhiqiang Que and Wayne Luk and Alexander Tapper and Duc Hoang and Noah Paladino and Philip Harris and Bo-Cheng Lai and Manuel Valentin and Ryan Forelli and Seda Ogrenci and Lino Gerlach and Rian Flynn and Mia Liu and Daniel Diaz and Elham Khoda and Melissa Quinnan and Russell Solares and Santosh Parajuli and Mark Neubauer and Christian Herwig and Ho Fung Tsoi and Dylan Rankin and Shih-Chieh Hsu and Scott Hauck},

   year={2025},

   eprint={2512.01463},

   archivePrefix={arXiv},

   primaryClass={cs.AR},

   url={https://arxiv.org/abs/2512.01463}

}

We present hls4ml, a free and open-source platform that translates machine learning (ML) models from modern deep learning frameworks into high-level synthesis (HLS) code that can be integrated into full designs for field-programmable gate arrays (FPGAs) or application-specific integrated circuits (ASICs). With its flexible and modular design, hls4ml supports a large number of deep learning frameworks and can target HLS compilers from several vendors, including Vitis HLS, Intel oneAPI and Catapult HLS. Together with a wider eco-system for software-hardware co-design, hls4ml has enabled the acceleration of ML inference in a wide range of commercial and scientific applications where low latency, resource usage, and power consumption are critical. In this paper, we describe the structure and functionality of the hls4ml platform. The overarching design considerations for the generated HLS code are discussed, together with selected performance results.
No votes yet.
Please wait...

You must be logged in to post a comment.

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us: