high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Biology » Quality-score guided error correction for short-read sequencing data using CUDA

Quality-score guided error correction for short-read sequencing data using CUDA

Haixiang Shi, Bertil Schmidt, Weiguo Liu, Wolfgang Muller-Wittig

School of Computer Engineering, N40-2-32a Nanyang Ave., Nanyang Technological University, Singapore 639798

Procedia Computer Science, Vol. 1, No. 1. (May 2010), pp. 1129-1138.

DOI:10.1016/j.procs.2010.04.125

@article{shi2010quality,

title={Quality-score guided error correction for short-read sequencing data using CUDA},

author={Shi, H. and Schmidt, B. and Liu, W. and M{\”u}ller-Wittig, W.},

journal={Procedia Computer Science},

volume={1},

number={1},

pages={1123–1132},

issn={1877-0509},

year={2010},

publisher={Elsevier}

}

Source

1916

views

Recently introduced new sequencing technologies can produce massive amounts of short-read data. Detection and correction of sequencing errors in this data is an important but time-consuming pre-processing step for de-novo genome assembly. In this paper, we demonstrate how the quality-score value associated with each base-call can be integrated in a CUDA-based parallel error correction algorithm. We show that quality-score guided error correction can improve the assembly accuracy of several datasets from the NCBI SRA (Short-Read Archive) in terms of N50-values as well as runtime. We further propose a number of improvements of to our previously published CUDA-EC algorithm to improve its runtime by a factor of up to 1.88.

Tags: Bioinformatics, Biology, CUDA, Error recovery, Genetics, nVidia

January 11, 2011 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

KernelGYM & Dr. Kernel: A distributed GPU environment and a collection of RL training methods to support RL for Kernel Generations

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

* * *

high performance computing on graphics processing units: hgpu.org

Quality-score guided error correction for short-read sequencing data using CUDA

Your response

Recent source codes

CL4SE: A Context Learning Benchmark For Software Engineering Tasks

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5

DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels

KernelGYM & Dr. Kernel: A distributed GPU environment and a collection of RL training methods to support RL for Kernel Generations

Vortex-Optimized Light-weight Toolchain (VOLT)

SciDef: Automated Definition Extraction from Scientific Literature

bioagent-bench: Benchmark for evaluating LLM agents in bioinformatics

Benchmark suite for LLM inference on NVIDIA consumer GPUs

Theorizer: from the paper Generating Literature-Driven Scientific Discoveries at Scale

Most viewed papers (last 30 days)

Quality-score guided error correction for short-read sequencing data using CUDA

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)