high performance computing on graphics processing units: hgpu.org

hgpu.org » Programming » Algorithms » Accelerating error correction in high-throughput short-read DNA sequencing data with CUDA

Accelerating error correction in high-throughput short-read DNA sequencing data with CUDA

Haixiang Shi, B. Schmidt, Weiguo Liu, W. Muller-Wittig

School of Computer Engineering, Nanyang Technological University, Singapore 639798

In IPDPS ’09: Proceedings of the 2009 IEEE International Symposium on Parallel & Distributed Processing (May 2009), pp. 1-8

DOI:10.1109/IPDPS.2009.5160924

@conference{shi2009accelerating,

title={Accelerating error correction in high-throughput short-read DNA sequencing data with CUDA},

author={Shi, H. and Schmidt, B. and Liu, W. and Muller-Wittig, W.},

booktitle={Parallel & Distributed Processing, 2009. IPDPS 2009. IEEE International Symposium on},

pages={1–8},

issn={1530-2075},

year={2009},

organization={IEEE}

}

Download (PDF)

View

Source

1943

views

Emerging DNA sequencing technologies open up exciting new opportunities for genome sequencing by generating read data with a massive throughput. However, produced reads are significantly shorter and more error-prone compared to the traditional Sanger shotgun sequencing method. This poses challenges for de-novo DNA fragment assembly algorithms in terms of both accuracy (to deal with short, error-prone reads) and scalability (to deal with very large input data sets). In this paper we present a scalable parallel algorithm for correcting sequencing errors in high-throughput short-read data. It is based on spectral alignment and uses the CUDA programming model. Our computational experiments on a GTX 280 GPU show runtime savings between 10 and 19 times (for different error-rates using simulated datasets as well as real Solexa/Illumina datasets).

Tags: Algorithms, Computer science, CUDA, Error recovery, nVidia, nVidia GeForce GTX 280

November 27, 2010 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

Accelerating error correction in high-throughput short-read DNA sequencing data with CUDA

Your response

Recent source codes

Awesome LLM-Driven Kernel Generation

PhysProver: Advancing Automatic Theorem Proving for Physics

ParaCodex: A Profiling-Guided Autonomous Coding Agent for Reliable Parallel Code Generation and Translation

SeedFold: Scaling Biomolecular Structure Prediction

Tilus: A Tile-Level GPU Kernel Programming Language

Memory-Efficient Acceleration of Block Low-Rank Foundation Models on Resource Constrained GPUs

BoltzGen:Toward Universal Binder Design

CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning

cuPilot: A Strategy-Coordinated Multi-agent Framework for CUDA Kernel Evolution

MATLAB Tensor Core models

Most viewed papers (last 30 days)

Accelerating error correction in high-throughput short-read DNA sequencing data with CUDA

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)