个人信息Personal Information
教授
博士生导师
硕士生导师
性别:女
毕业院校:大连理工大学
学位:博士
所在单位:计算机科学与技术学院
学科:计算机应用技术. 计算机软件与理论
办公地点:创新大厦A930
电子邮箱:lils@dlut.edu.cn
An approach to improve kernel-based Protein-Protein Interaction extraction by learning from large-scale network data
点击次数:
论文类型:期刊论文
发表时间:2015-07-15
发表刊物:METHODS
收录刊物:SCIE、PubMed
卷号:83
页面范围:44-50
ISSN号:1046-2023
关键字:Protein-Protein Interaction; Word representation; Distributed representation; Brown clusters
摘要:Protein-Protein Interaction extraction (PPIe) from biomedical literatures is an important task in biomedical text mining and has achieved desirable results on the annotated datasets. However, the traditional machine learning methods on PPIe suffer badly from vocabulary gap and data sparseness, which weakens classification performance. In this work, an approach capturing external information from the web-based data is introduced to address these problems and boost the existing methods. The approach involves three kinds of word representation techniques: distributed representation, vector clustering and Brown clusters. Experimental results show that our method outperforms the state-of-the-art methods on five publicly available corpora. Our code and data are available at: http://chaoslog.com/improving-kernel-based-protein-protein-interaction-extraction-by-unsupervised-word-representation-codes-and-data.html. (C) 2015 Elsevier Inc. All rights reserved.