Efficient parallel lists intersection and index compression algorithms using graphics processing units

hgpu.org » Programming » Algorithms » Efficient parallel lists intersection and index compression algorithms using graphics processing units

Efficient parallel lists intersection and index compression algorithms using graphics processing units

Naiyong Ao, Fan Zhang, Di Wu, Douglas S. Stones, Gang Wang, Xiaoguang Liu, Jing Liu, Sheng Lin

Nankai-Baidu Joint Lab, Nankai University, 94 Weijin Road, 300071, Tianjin, China

Proceedings of the VLDB Endowment, Volume 4 Issue 8, 2011

BibTeX

Download (PDF)

View

Source

2064

views

Major web search engines answer thousands of queries per second requesting information about billions of web pages. The data sizes and query loads are growing at an exponential rate. To manage the heavy workload, we consider techniques for utilizing a Graphics Processing Unit (GPU). We investigate new approaches to improve two important operations of search engines — lists intersection and index compression. For lists intersection, we develop techniques for efficient implementation of the binary search algorithm for parallel computation. We inspect some representative real-world datasets and find that a sufficiently long inverted list has an overall linear rate of increase. Based on this observation, we propose Linear Regression and Hash Segmentation techniques for contracting the search range. For index compression, the traditional d-gap based compression schemata are not well-suited for parallel computation, so we propose a Linear Regression Compression schema which has an inherent parallel structure. We further discuss how to efficiently intersect the compressed lists on a GPU. Our experimental results show significant improvements in the query processing throughput on several datasets.

Tags: Algorithms, Compression, Computer science, CUDA, Information Retrieval, nVidia, nVidia GeForce GTX 480, Performance

December 28, 2011 by hgpu

No votes yet.

Please wait...

high performance computing on graphics processing units: hgpu.org

Efficient parallel lists intersection and index compression algorithms using graphics processing units

Recent source codes

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

XaaS containers

SYCL Container

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

CFAL-bench

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Can Large Language Models Predict Parallel Code Performance?

Most viewed papers (last 30 days)

Efficient parallel lists intersection and index compression algorithms using graphics processing units

Share this:

Recent source codes

Most viewed papers (last 30 days)