Parsing in Parallel on Multiple Cores and GPUs
Centre for Language Sciences and Department of Computing, Macquarie University, Sydney, Australia
Proceedings of Australasian Language Technology Association Workshop, pages 29-37, 2011
@InProceedings{MarkJohnson2011ALTA2011,
author={Johnson, Mark},
title={Parsing in Parallel on Multiple Cores and GPUs},
booktitle={Proceedings of the Australasian Language Technology Association Workshop 2011},
month={December},
year={2011},
address={Canberra, Australia},
pages={29–37},
url={http://www.aclweb.org/anthology/U/U11/U11-2006}
}
This paper examines the ways in which parallelism can be used to speed the parsing of dense PCFGs. We focus on two kinds of parallelism here: Symmetric Multi-Processing (SMP) parallelism on shared-memory multicore CPUs, and Single-Instruction MultipleThread (SIMT) parallelism on GPUs. We describe how to achieve speed-ups over an already very efficient baseline parser using both kinds of technology. For our dense PCFG parsing task we obtained a 60x speed-up using SMP and SSE parallelism coupled with a cache-sensitive algorithm design, parsing section 24 of the Penn WSJ treebank in a little over 2 secs.
December 20, 2011 by hgpu