location: Current position: Home >> Scientific Research >> Paper Publications

An effective neural model extracting document level chemical-induced disease relations from biomedical literature.

Hits:

Indexed by:期刊论文

Date of Publication:2018-01-01

Journal:Journal of biomedical informatics

Included Journals:PubMed、SCIE、SSCI

Volume:83

Page Number:1-9

ISSN No.:1532-0480

Key Words:Chemical-induced diseases; Document level; Long short-term memory; Convolutional neural network; Text mining

Abstract:Since identifying relations between chemicals and diseases (CDR) are important for biomedical research and healthcare, the challenge proposed by BioCreative V requires automatically mining causal relationships between chemicals and diseases which may span sentence boundaries. Although most systems explore feature engineering and knowledge bases to recognize document level CDR relations, feature learning automatically is limited only in a sentence. In this work, we proposed an effective model that automatically learns document level semantic representations to extract chemical-induced disease (CID) relations from articles by combining advantages of convolutional neural network and recurrent neural network. First, to purposefully collect contexts, candidate entities existing in multiple sentences of an article were masked to make the model have ability to discern candidate entities and general terms. Next, considering the contiguity and temporality among associated sentences as well as the topic of an article, a hierarchical network architecture was designed at the document level to capture semantic information of different types of text segments in an article. Finally, a softmax classifier performed the CID recognition. Experimental results on the CDR corpus show that the proposed model achieves a good overall performance compared with other state-of-the-art methods. Although only using two types of embedding vectors, our approach can perform well for recognizing not only intra-sentential but also inter-sentential CID relations. Copyright © 2018. Published by Elsevier Inc.

Pre One:SemaTyP: a knowledge graph based literature mining method for drug discovery.

Next One:Exploring Answer Information for Question Classification in Community Question Answering