## Exploring FPGA Optimizations to Compute Sparse Numerical Linear Algebra Kernels

Instituto de Ingeniería Eléctrica, Facultad de Ingeniería, Universidad de la República, Montevideo, Uruguay

Applied Reconfigurable Computing. Architectures, Tools, and Applications. ARC 2020. Lecture Notes in Computer Science, vol 12083. Springer, 2020

@inproceedings{favaro2020exploring,

title={Exploring fpga Optimizations to Compute Sparse Numerical Linear Algebra Kernels},

author={Favaro, Federico and Dufrechou, Ernesto and Ezzatti, Pablo and Oliver, Juan P},

booktitle={International Symposium on Applied Reconfigurable Computing},

pages={258–268},

year={2020},

organization={Springer}

}

The solution of sparse triangular linear systems (sptrsv) is the bottleneck of many numerical methods. Thus, it is crucial to count with efficient implementations of such kernel, at least for commonly used platforms. In this sense, Field–Programmable Gate Arrays (FPGAs) have evolved greatly in the last years, entering the HPC hardware ecosystem largely due to their superior energy–efficiency relative to more established accelerators. Up until recently, the design for FPGAs implied the use of low–level Hardware Description Languages (HDL) such as VHDL or Verilog. Nowadays, manufacturers are making a large effort to adopt High–Level Synthesis languages like C/C++ or OpenCL, but the gap between their performance and that of HDLs is not yet fully studied. This work focuses on the performance offered by FPGAs to compute the sptrsv using OpenCL. For this purpose, we implement different parallel variants of this kernel and experimentally evaluate several setups, varying among others the work–group size, the number of compute units, the unroll–factor and the vectorization–factor.

December 6, 2020 by hgpu