Acceleration of Block-Aware Matrix Factorization on Heterogeneous Platforms

Gregory W. Somers
School of Electrical Engineering and Computer Science, University of Ottawa, Ottawa, Canada
University of Ottawa, 2016


   title={Acceleration of Block-Aware Matrix Factorization on Heterogeneous Platforms},

   author={Somers, Gregory W},


   school={Universit{‘e} d’Ottawa/University of Ottawa}


Download Download (PDF)   View View   Source Source   



Block-structured matrices arise in several contexts in circuit simulation problems. These matrices typically inherit the pattern of sparsity from the circuit connectivity. However, they are also characterized by dense spots or blocks. Direct factorization of those matrices has emerged as an attractive approach if the host memory is sufficiently large to store the block-structured matrix. The approach proposed in this thesis aims to accelerate the direct factorization of general block-structured matrices by leveraging the power of multiple OpenCL accelerators such as Graphical Processing Units (GPUs). The proposed approach utilizes the notion of a Directed Acyclic Graph representing the matrix in order to schedule its factorization on multiple accelerators. This thesis also describes memory management techniques that enable handling large matrices while minimizing the amount of memory transfer over the PCIe bus between the host CPU and the attached devices. The results demonstrate that by using two GPUs the proposed approach can achieve a nearly optimal speedup when compared to a single GPU platform.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2021 hgpu.org

All rights belong to the respective authors

Contact us: