## A Tuned, Concurrent-Kernel Approach to Speed Up the APSP Problem

Dpto. Informatica, Universidad de Valladolid

13th International Conference Computational and Mathematical Methods in Science and Engineering (CMMSE 2013), 2013

@article{ortega2013tuned,

title={A Tuned, Concurrent-Kernel Approach to Speed Up the APSP Problem},

author={Ortega-Arranz, Hector and Torres, Yuri and Llanos, Diego R and Gonzalez-Escribano, Arturo},

year={2013}

}

The All-Pair Shortest-Path (APSP) problem is a well-known problem in graph theory whose objective is to find the shortest paths between any pair of nodes. Computing the distances from one source node to the rest and repeating this process for every node of the graph is an adequate solution for sparse graphs. During the last years the application of GPU devices have increased to accelerate this kind of problems. While the correctness of an NVIDIA CUDA implementation of this algorithm is easy to achieve, exploiting the GPU capabilities to obtain a good performance is a task for CUDA experienced programmers. A typical code tuning strategy is the selection of an appropriate threadBlocks size. Besides this, the concurrent deployment of several kernels that computes distances from different sources, also accelerates the execution times. In this paper we show that an adequate combination of both strategies represents a 11.5 % performance improvement between different, recommended CUDA configurations for the most costly kernel of the APSP problem.

May 25, 2013 by hgpu