Performance Evaluation and Analysis of Sparse Matrix and Graph Kernels on Heterogeneous Processors
Key Laboratory of Data Engineering and Knowledge Engineering (MOE), and School of Information, Renmin University of China, Beijing, China
CCF Transactions on High Performance Computing (THPC), 2019
@article{zhang2019performance,
title={Performance Evaluation and Analysis of Sparse Matrix and Graph Kernels on Heterogeneous Processors},
author={Zhang, Feng and Liu, Weifeng and Feng, Ningxuan and Zhai, Jidong and Du, Xiaoyong},
year={2019}
}
Heterogeneous processors integrate very distinct compute resources such as CPUs and GPUs into the same chip, thus can exploit the advantages and avoid disadvantages of those compute units. We in this work evaluate and analyze eight sparse matrix and graph kernels on an AMD CPU-GPU heterogeneous processor by using 956 sparse matrices. Five characteristics, i.e., load balancing, indirect addressing, memory reallocation, atomic operations, and dynamic characteristics are our major considerations. The experimental results show that although the CPU and GPU parts access the same DRAM, very different performance behaviors are observed. For example, though the GPU part in general outperforms the CPU part, it cannot achieve the best performance in all cases given by the CPU part. Moreover, the bandwidth utilization of atomic operations on heterogeneous processors can be much higher than a high-end discrete GPU.
June 16, 2019 by hgpu