Optimal Configuration of GPU Cache Memory to Maximize the Performance

Leonid Djinevski, Sime Arsenovski, Sasko Ristov, Marjan Gusev
FON University, Av. Vojvodina, 1000 Skopje, Macedonia
ICT Innovations, 2013


   title={Optimal Configuration of GPU Cache Memory to Maximize the Performance},

   author={Djinevski, Leonid and Arsenovski, Sime and Ristov, Sasko and Gusev, Marjan},



Download Download (PDF)   View View   Source Source   



GPU devices offer great performance when dealing with algorithms that require intense computational resources. A developer can configure the L1 cache memory of the latest GPU Kepler architecture with different cache size and cache set associativity, per Streaming Multiprocessors (SM). The performance of the computation intensive algorithms can be affected by these cache parameters. In this paper, we evaluate the influence of the performance for all possible configurations of L1 cache size and associativity, for dense matrix-matrix multiplication algorithm for various problem sizes. The results show a small impact of various L1 cache memory configurations for the overall performance of the algorithm.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2021 hgpu.org

All rights belong to the respective authors

Contact us: