high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Accelerating Phylogenetic Inference on GPUs: an OpenACC and CUDA comparison

Accelerating Phylogenetic Inference on GPUs: an OpenACC and CUDA comparison

Lidia Kuan, Joao Neves, Frederico Pratas, Pedro Tomas, Leonel Sousa

INESC-ID, IST, University of Lisbon, Lisboa, Portugal

International Work-Conference on Bioinformatics and Biomedical Engineering (IWBBIO), 2014

BibTeX

Download (PDF)

View

Source

2619

views

Phylogenetic inference is used to derive a "tree of life" for a collection of species whose DNA sequences are known. While several software packages have already been developed to take advantage of GPUs to accelerate phylogenetic inference, they typically require significant changes to the original code, constraining code maintenance. Recently, the OpenACC API was proposed to minimize the programming efforts on accelerator devices. In this work we evaluate the applicability of the OpenACC API for phylogenetic inference using the most recent MrBayes program (version 3.2.2). A new parallelization strategy is proposed that is specifically adapted to the latest version of MrBayes and minimizes the data transfers between the host (CPU) and the accelerating device (GPU). We further implement the proposed strategy using both the OpenACC and CUDA programming frameworks. Experimental results demonstrate that significant performance gains can be achieved using OpenACC with a reduced amount of programming effort. Comparing with state-of-art GPU’s implementations, the proposed OpenACC and CUDA implementations achieve a performance gain of up to 5.2x and 5.7x, respectively. Experimental results indicate that with a reduced amount of programming effort, we achieve a performance that is only 10% inferior to one obtained with CUDA, which uses device specific optimizations.

Tags: Computer science, CUDA, nVidia, nVidia GeForce GTX 580, OpenACC, Performance, Tesla K20

September 28, 2014 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

Accelerating Phylogenetic Inference on GPUs: an OpenACC and CUDA comparison

Your response

Recent source codes

GEAK-agent: LLM-based AI agent, which can write correct and efficient GPU kernels automatically

OpenDwarfs 2025: re-engineered version of the OpenDwarfs benchmark suite, for compatibility with modern platforms

Specx: Speculative task-based runtime system

Mutual-Supervised Learning for Sequential-to-Parallel Code Translation

Hardware Compute Partitioning on NVIDIA GPUs for Composable Systems

KISim: Kubernetes Intelligent Scheduling Simulator

Efficient GPU Implementation of Multi-Precision Integer Division

exa-AMD: Exascale Accelerated Materials Discovery

ParEval: A Parallel Code Evaluation Benchmark

FlashSparse: Minimizing Computation Redundancy for Fast Sparse Matrix Multiplications on Tensor Cores

Most viewed papers (last 30 days)

Accelerating Phylogenetic Inference on GPUs: an OpenACC and CUDA comparison

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)