Optimal Configuration of GPU Cache Memory to Maximize the Performance
FON University, Av. Vojvodina, 1000 Skopje, Macedonia
ICT Innovations, 2013
@article{djinevski2013optimal,
title={Optimal Configuration of GPU Cache Memory to Maximize the Performance},
author={Djinevski, Leonid and Arsenovski, Sime and Ristov, Sasko and Gusev, Marjan},
year={2013}
}
GPU devices offer great performance when dealing with algorithms that require intense computational resources. A developer can configure the L1 cache memory of the latest GPU Kepler architecture with different cache size and cache set associativity, per Streaming Multiprocessors (SM). The performance of the computation intensive algorithms can be affected by these cache parameters. In this paper, we evaluate the influence of the performance for all possible configurations of L1 cache size and associativity, for dense matrix-matrix multiplication algorithm for various problem sizes. The results show a small impact of various L1 cache memory configurations for the overall performance of the algorithm.
September 16, 2013 by hgpu