location: Current position: Home >> Scientific Research >> Paper Publications

Biomedical Named Entity Recognition Based on the Two Channels and Sentence-level Reading Control Conditioned LSTM-CRF

Hits:

Indexed by:会议论文

Date of Publication:2017-01-01

Included Journals:CPCI-S、Scopus

Volume:2017-January

Page Number:380-385

Key Words:Biomedical literature; Named entity recognition; LSTM-CRF

Abstract:Biomedical named entity recognition (Bio-NER) is an important preliminary step for many biomedical text mining tasks. The current mainstream methods for NER are based on the neural networks to avoid the complex hand-designed features derived from various linguistic analyses. However, the performance of these methods is always limited to exploring dependencies across output label and ignoring some potential word-level and sentence-level semantic information. Therefore, we propose a novel Long Short Term Memory (LSTM) Networks model SC-LSTM-CRF integrating two channels and the sentence-level embeddings for Bio-NER. In our model, two channels word embeddings are introduced as the input to obtain more abundant potential information, and the sentence-level reading control gate (SC) is inserted into the networks to integrate the implicit meaning of an entire sentence. The experimental results conducted on the BioCreative II GM corpus show that our method can achieve an F-score of 89.49%, which outperforms all state-of-the-art systems and is 0.88% higher than the best performing neural networks.

Pre One:基于双向LSTM和两阶段方法的触发词识别

Next One:Exploiting Argument Information to Improve Biomedical Event Trigger Identification via Recurrent Neural Networks and Supervised Attention Mechanisms