Ultra-Fast Detection of Higher-Order Epistatic Interactions on GPUs
Institut fur Informatik, Johannes Gutenberg-Universitat Mainz, Germany
22nd International European Conference on Parallel and Distributed Computing (Euro-Par 2016), 2016
@article{junger2016ultrafast,
title={Ultra-Fast Detection of Higher-Order Epistatic Interactions on GPUs},
author={Junger, Daniel and Hundt, Christian and Gonzalez-Dominguez, Jorge and Schmidt, Bertil},
year={2016}
}
Detecting higher-order epistatic interactions in Genome-Wide Association Studies (GWAS) remains a challenging task in the fields of genetic epidemiology and computer science. A number of algorithms have recently been proposed for epistasis discovery. However, they suffer from a high computational cost since statistical measures have to be evaluated for each possible combination of markers. Hence, many algorithms use additional filtering stages discarding potentially non-interacting markers in order to reduce the overall number of combinations to be examined. Among others, Mutual Information Clustering (MIC) is a common pre-processing filter for grouping markers into partitions using K-Means clustering. Potentially interacting candidates for high-order epistasis are then examined exhaustively in a subsequent phase. However, analyzing real-world datasets of moderate size can still take several hours when performing analysis on a single CPU. In this work we propose a massively parallel computation scheme for the MIC algorithm targeting CUDA-enabled accelerators. Our implementation is able to perform epistasis discovery using more than 500,000 markers in just a couple of seconds in contrast to several hours when using the sequential MIC implementation. This runtime reduction by two orders-of-magnitude enables fast exploration of higher-order epistatic interactions even in large-scale GWAS datasets.
September 3, 2016 by hgpu