Generic Inverted Index on the GPU

Jingbo Zhou, Qi Guo, H. V. Jagadish, Wenhao Luan, Anthony K. H. Tung, Yueji Yang, Yuxin Zheng
School of Computing, National University of Singapore
arXiv:1603.08390 [cs.DB], (28 Mar 2016)


   title={Generic Inverted Index on the GPU},

   author={Zhou, Jingbo and Guo, Qi and Jagadish, H. V. and Luan, Wenhao and Tung, Anthony K. H. and Yang, Yueji and Zheng, Yuxin},






Download Download (PDF)   View View   Source Source   Source codes Source codes



Data variety, as one of the three Vs of the Big Data, is manifested by a growing number of complex data types such as documents, sequences, trees, graphs and high dimensional vectors. To perform similarity search on these data, existing works mainly choose to create customized indexes for different data types. Due to the diversity of customized indexes, it is hard to devise a general parallelization strategy to speed up the search. In this paper, we propose a generic inverted index on the GPU (called GENIE), which can support similarity search of multiple queries on various data types. GENIE can effectively support the approximate nearest neighbor search in different similarity measures through exerting Locality Sensitive Hashing schemes, as well as similarity search on original data such as short document data and relational data. Extensive experiments on different real-life datasets demonstrate the efficiency and effectiveness of our system.
Rating: 2.1/5. From 4 votes.
Please wait...

* * *

* * *

HGPU group © 2010-2021 hgpu.org

All rights belong to the respective authors

Contact us: