https://hgpu.org/?p=15940
Bridging the Performance-Programmability Gap for FPGAs via OpenCL: A Case Study with OpenDwarfs