3864

Exact and complete short-read alignment to microbial genomes using Graphics Processing Unit programming

Jochen Blom, Tobias Jakobi, Daniel Doppmeier, Sebastian Jaenicke, Jorn Kalinowski, Jens Stoye, Alexander Goesmann
Computational Genomics, CeBiTec, Bielefeld University, Bielefeld, Germany
Bioinformatics, Vol. 27, No. 10. (15 May 2011), pp. 1351-1358.

@article{blom2011exact,

   title={Exact and complete short-read alignment to microbial genomes using Graphics Processing Unit programming},

   author={Blom, J. and Jakobi, T. and Doppmeier, D. and Jaenicke, S. and Kalinowski, J. and Stoye, J. and Goesmann, A.},

   journal={Bioinformatics},

   issn={1367-4803},

   pages={1351–1358},

   volume={27},

   issue={10},

   year={2011},

   publisher={Oxford Univ Press}

}

Source Source   Source codes Source codes

Package:

2056

views

MOTIVATION: The introduction of next-generation sequencing techniques and especially the high-throughput systems Solexa (Illumina Inc.) and SOLiD (ABI) made the mapping of short reads to reference sequences a standard application in modern bioinformatics. Short-read alignment is needed for reference based re-sequencing of complete genomes as well as for gene expression analysis based on transcriptome sequencing. Several approaches were developed during the last years allowing for a fast alignment of short sequences to a given template. Methods available to date use heuristic techniques to gain a speedup of the alignments, thereby missing possible alignment positions. Furthermore, most approaches return only one best hit for every query sequence, thus losing the potentially valuable information of alternative alignment positions with identical scores. RESULTS: We developed SARUMAN (Semiglobal Alignment of short Reads Using CUDA and NeedleMAN-Wunsch), a mapping approach that returns all possible alignment positions of a read in a reference sequence under a given error threshold, together with one optimal alignment for each of these positions. Alignments are computed in parallel on graphics hardware, facilitating an considerable speedup of this normally time-consuming step. Combining our filter algorithm with CUDA-accelerated alignments, we were able to align reads to microbial genomes in time comparable or even faster than all published approaches, while still providing an exact, complete and optimal result. At the same time, SARUMAN runs on every standard Linux PC with a CUDA-compatible graphics accelerator. AVAILABILITY: http://www.cebitec.uni-bielefeld.de/brf/saruman/saruman.html.Contact: jblom@cebitec.uni-bielefeld.deSupplementary information: Supplementary data are available at Bioinformatics online.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: