location: Current position: Home >> Scientific Research >> Paper Publications

An attention-based effective neural model for drug-drug interactions extraction

Hits:

Indexed by:期刊论文

Date of Publication:2017-10-10

Journal:BMC BIOINFORMATICS

Included Journals:Scopus、SCIE、EI、PubMed

Volume:18

Issue:1

Page Number:445

ISSN No.:1471-2105

Key Words:Attention; Recurrent neural network; Long short-term memory; Drug-drug interactions; Text mining

Abstract:Background: Drug-drug interactions (DDIs) often bring unexpected side effects. The clinical recognition of DDIs is a crucial issue for both patient safety and healthcare cost control. However, although text-mining-based systems explore various methods to classify DDIs, the classification performance with regard to DDIs in long and complex sentences is still unsatisfactory.
   Methods: In this study, we propose an effective model that classifies DDIs from the literature by combining an attention mechanism and a recurrent neural network with long short-term memory (LSTM) units. In our approach, first, a candidate-drug-oriented input attention acting on word-embedding vectors automatically learns which words are more influential for a given drug pair. Next, the inputs merging the position- and POS-embedding vectors are passed to a bidirectional LSTM layer whose outputs at the last time step represent the high-level semantic information of the whole sentence. Finally, a softmax layer performs DDI classification.
   Results: Experimental results from the DDIExtraction 2013 corpus show that our system performs the best with respect to detection and classification (84.0% and 77.3%, respectively) compared with other state-of-the-art methods. In particular, for the Medline-2013 dataset with long and complex sentences, our F-score far exceeds those of top-ranking systems by 12.6%.
   Conclusions: Our approach effectively improves the performance of DDI classification tasks. Experimental analysis demonstrates that our model performs better with respect to recognizing not only close-range but also long-range patterns among words, especially for long, complex and compound sentences.

Pre One:Leveraging Biomedical Resources in Bi-LSTM for Drug-Drug Interaction Extraction

Next One:Deep transfer learning for modality classification of medical images