OpenCL Performance on the Intel Heterogeneous Architecture Research Platform
McKelvey School of Engineering, Washington University in St. Louis, Saint Louis, MO, USA
Proc. of IEEE High-Performance Extreme Computing Conference (HPEC), 2020
@article{harris2020opencl,
title={OpenCL Performance on the Intel Heterogeneous Architecture Research Platform},
author={Harris, Steven and Chamberlain, Roger D and Gill, Christopher},
year={2020}
}
The fundamental operation of matrix multiplication is ubiquitous across a myriad of disciplines. Yet, the identification of new optimizations for matrix multiplication remains relevant for emerging hardware architectures and heterogeneous systems. Frameworks such as OpenCL enable computation orchestration on existing systems, and its availability using the Intel High Level Synthesis compiler allows users to architect new designs for reconfigurable hardware using C/C++. Using the HARPv2 as a vehicle for exploration, we investigate the utility of several traditional matrix multiplication optimizations to better understand the performance portability of OpenCL and the implications for such optimizations on cache coherent heterogeneous architectures. Our results give targeted insights into the applicability of best practices that were designed for existing architectures when used on emerging heterogeneous systems.
October 25, 2020 by hgpu