10082

Data Structures and Algorithms for Counting Problems on Graphs using GPU

Amlan Chatterjee, Sridhar Radhakrishnan, John K. Antonio
School of Computer Science, University of Oklahoma, Norman, OK, 73019, USA
International Journal of Networking and Computing, Volume 3, Number 2, pages 264-288, 2013

@article{chatterjee2013data,

   title={Data Structures and Algorithms for Counting Problems on Graphs using GPU},

   author={Chatterjee, Amlan and Radhakrishnan, Sridhar and Antonio, John K},

   journal={International Journal of Networking and Computing},

   volume={3},

   number={2},

   pages={264–288},

   year={2013}

}

Download Download (PDF)   View View   Source Source   

2371

views

The availability and utility of large numbers of Graphical Processing Units (GPUs) have enabled parallel computations using extensive multi-threading. Sequential access to global memory and contention at the size-limited shared memory have been main impediments to fully exploiting potential performance in architectures having a massive number of GPUs. After performing extensive study of data structures and complexity analysis of various data access methodologies, we propose novel memory storage and retrieval techniques that enable parallel graph computations to overcome the above issues. More specifically, given a graph G = (V,E) and an integer k <= |V |, we provide both storage techniques and algorithms to count the number of: a) connected subgraphs of size k; b) k cliques; and c) k independent sets, all of which can be exponential in number. Our storage techniques are based on creating a breadth-first search tree and storing it along with non-tree edges in a novel way. Our experiments solve the above mentioned problems by using both naive and advanced data structures on the CPU and GPU. Speedup is achieved by solving the problems on the GPU even using a brute-force approach as compared to the implementations on the CPU. Utilizing the knowledge of BFS-tree properties, the performance gain on the GPU increases and ultimately outperforms the CPU by a factor of at least 5 for graphs that completely fit in the shared memory and by a factor of 10 for larger graphs stored using the global memory. The counting problems mentioned above have many uses, including the analysis of social networks.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: