Investigating Single Precision Floating General Matrix Multiply in Heterogeneous
McKelvey School of Engineering of Washington University
Washington University, 2020
@article{harris2020investigating,
title={Investigating Single Precision Floating General Matrix Multiply in Heterogeneous},
author={Harris, Steven},
year={2020}
}
The fundamental operation of matrix multiplication is ubiquitous across a myriad of disciplines. Yet, the identification of new optimizations for matrix multiplication remains relevant for emerging hardware architectures and heterogeneous systems. Frameworks such as OpenCL enable computation orchestration on existing systems, and its availability using the Intel High Level Synthesis compiler allows users to architect new designs for reconfigurable hardware using C/C++. Using the HARPv2 as a vehicle for exploration, we investigate the utility of several of the most notable matrix multiplication optimizations to better understand the performance portability of OpenCL and the implications for such optimizations on this and future heterogeneous architectures. Our results give targeted insights into the applicability of best practices that were for existing architectures when used on emerging heterogeneous systems.
June 7, 2020 by hgpu