15422

A Performance Analysis Framework for Optimizing OpenCL Applications on FPGAs

Zeke Wang, Bingsheng He, Wei Zhang, Shunning Jiang
Nanyang Technological University, Singapore
The International Symposium on High-Performance Computer Architecture (HPCA), 2016

@article{wang2016performance,

   title={A Performance Analysis Framework for Optimizing OpenCL Applications on FPGAs},

   author={Wang, Zeke and He, Bingsheng and Zhang, Wei and Jiang, Shunning},

   journal={memory},

   volume={100},

   year={2016}

}

Download Download (PDF)   View View   Source Source   

1956

views

Recently, FPGA vendors such as Altera and Xilinx have released OpenCL SDK for programming FPGAs. However, the architecture of FPGA is significantly different from that of CPU/GPU, for which OpenCL is originally designed. Tuning the OpenCL code for good performance on FPGAs is still an open problem, since the existing OpenCL tools and models designed for CPUs/GPUs are not directly applicable to FPGAs. In the paper, we present an FPGA-based performance analysis framework that can shed light on the performance bottleneck and thus guide the code tuning for OpenCL applications on FPGAs. Particularly, we leverage static and dynamic analysis to develop an analytical performance model, which has captured the key architectural features of FPGA abstractions under OpenCL. Then, we provide four programmer-interpretable metrics to quantify the performance potentials of the OpenCL program with input optimization combination for the next optimization step. We evaluate our framework with a number of user cases, and demonstrate that 1) our analytical performance model can accurately predict the performance of OpenCL programs with different optimization combinations on FPGAs, and 2) our tool can be used to effectively guide the code tuning on alleviating the performance bottleneck.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: