location: Current position: Home >> Scientific Research >> Paper Publications

Integrating Language Model and Reading Control Gate in BLSTM-CRF for Biomedical Named Entity Recognition

Hits:

Indexed by:Journal Papers

Date of Publication:2020-05-01

Journal:IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS

Included Journals:SCIE

Volume:17

Issue:3

Page Number:841-846

ISSN No.:1545-5963

Key Words:Logic gates; Computer architecture; Biological system modeling; Computational modeling; Semantics; Microprocessors; Syntactics; Biomedical named entity recognition; Language model; LSTM-CRF; reading control gate

Abstract:Biomedical named entity recognition (Bio-NER) is an important preliminary step for many biomedical text mining tasks. The current mainstream methods for NER are based on the neural networks to avoid the complex hand-designed features derived from various linguistic analyses. However, these methods ignore some potential sentence-level semantic information and general features of semantic and syntactic. Therefore, we propose a novel Long Short Term Memory (LSTM) Networks model integrating language model and sentence-level reading control gate (LS-BLSTM-CRF) for Bio-NER. In our model, a sentence-level reading control gate (SC) is inserted into the networks to integrate the implicit meaning of an entire sentence and the language model is integrated to our model to learn richer potential features. Besides, character-level embeddings are introduced as the input to deal with out-of-vocabulary words. The experimental results conducted on the BioCreative II GM corpus show that our method can achieve an F-score of 89.94 percent, which outperforms all state-of-the-art systems and is 1.33 percent higher than the best performing neural networks.

Pre One:Exploiting dependency information to improve biomedical event detection via gated polar attention mechanism

Next One:Extracting drug-drug interactions from texts with BioBERT and multiple entity-aware attentions