GRN: Gated Relation Network to Enhance Convolutional Neural Network for Named Entity Recognition

hgpu.org » Applications » Computer science » GRN: Gated Relation Network to Enhance Convolutional Neural Network for Named Entity Recognition

GRN: Gated Relation Network to Enhance Convolutional Neural Network for Named Entity Recognition

Hui Chen, Zijia Lin, Guiguang Ding, Jianguang Lou, Yusen Zhang, Borje Karlsson

Beijing National Research Center for Information Science and Technology (BNRist), School of Software, Tsinghua University, Beijing, China

arXiv:1907.05611 [cs.CL], (12 Jul 2019)

@misc{chen2019grn,

title={GRN: Gated Relation Network to Enhance Convolutional Neural Network for Named Entity Recognition},

author={Hui Chen and Zijia Lin and Guiguang Ding and Jianguang Lou and Yusen Zhang and Borje Karlsson},

year={2019},

eprint={1907.05611},

archivePrefix={arXiv},

primaryClass={cs.CL}

}

Download (PDF)

View

Source

1831

views

The dominant approaches for named entity recognition (NER) mostly adopt complex recurrent neural networks (RNN), e.g., long-short-term-memory (LSTM). However, RNNs are limited by their recurrent nature in terms of computational efficiency. In contrast, convolutional neural networks (CNN) can fully exploit the GPU parallelism with their feedforward architectures. However, little attention has been paid to performing NER with CNNs, mainly owing to their difficulties in capturing the long-term context information in a sequence. In this paper, we propose a simple but effective CNN-based network for NER, i.e., gated relation network (GRN), which is more capable than common CNNs in capturing long-term context. Specifically, in GRN we firstly employ CNNs to explore the local context features of each word. Then we model the relations between words and use them as gates to fuse local context features into global ones for predicting labels. Without using recurrent layers that process a sentence in a sequential manner, our GRN allows computations to be performed in parallel across the entire sentence. Experiments on two benchmark NER datasets (i.e., CoNLL2003 and Ontonotes 5.0) show that, our proposed GRN can achieve state-of-the-art performance with or without external knowledge. It also enjoys lower time costs to train and test.

Tags: Computer science, Neural networks, NLP, nVidia, Tesla P100

July 21, 2019 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

high performance computing on graphics processing units: hgpu.org