high performance computing on graphics processing units: hgpu.org

hgpu.org » Programming » Algorithms » GPU-to-GPU and Host-to-Host multipattern string matching on a GPU

GPU-to-GPU and Host-to-Host multipattern string matching on a GPU

Xinyan Zha, Sartaj Sahni

Computer and Information Science and Engineering, University of Florida, Gainesville, FL 32611

IEEE Transactions on Computers, 2012

DOI:10.1109/TC.2012.61

@misc{zha2012gpu,

title={GPU-to-GPU and Host-to-Host multipattern string matching on a GPU},

author={Zha, X. and Sahni, S.},

year={2012}

}

Download (PDF)

View

Source

2007

views

We develop GPU adaptations of the Aho-Corasick and multipattern Boyer-Moore string matching algorithms for the two cases GPU-to-GPU (input is initially in GPU memory and the output is left in GPU memory) and host-to-host (input and output are in the memory of the host CPU). For the GPU-to-GPU case, we consider several refinements to a base GPU implementation and measure the performance gain from each refinement. For the host-to-host case, we analyze two strategies to communicate between the host and the GPU and show that one is optimal with respect to run time while the other requires less device memory. Experiments conducted on an NVIDIA Tesla GT200 GPU that has 240 cores running off of a Xeon 2.8GHz quad-core host CPU show that, for the GPU-to-GPU case, our Aho-Corasick GPU adaptation achieves a speedup between 8.5 and 9.5 relative to a single-thread CPU implementation and between 2.4 and 3.2 relative to the best multithreaded implementation. For the host-to-host case, the GPU AC code achieves a speedup of 3.1 relative to a single-threaded CPU implementation. However, the GPU is unable to deliver any speedup relative to the best multithreaded code running on the quad-core host. In fact, the measured speedups for the latter case ranged between 0.74 and 0.83.

Tags: Algorithms, Computer science, CUDA, nVidia, String matching, Tesla

August 3, 2012 by hgpu

Rating: 0.5/5. From 2 votes.

Please wait...

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

high performance computing on graphics processing units: hgpu.org

GPU-to-GPU and Host-to-Host multipattern string matching on a GPU

Recent source codes

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

GPU plugin for PySCF

QArray

Celerity: High-level C++ for Accelerator Clusters

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Most viewed papers (last 30 days)

GPU-to-GPU and Host-to-Host multipattern string matching on a GPU

Share this:

Recent source codes

Most viewed papers (last 30 days)