个人信息Personal Information
教授
博士生导师
硕士生导师
性别:女
毕业院校:大连理工大学
学位:博士
所在单位:计算机科学与技术学院
学科:计算机应用技术. 计算机软件与理论
办公地点:创新大厦A930
电子邮箱:lils@dlut.edu.cn
Integrating Active Learning Strategy to the Ensemble Kernel-based Method for Protein-Protein Interaction Extraction
点击次数:
论文类型:期刊论文
发表时间:2013-01-01
发表刊物:CHINESE JOURNAL OF ELECTRONICS
收录刊物:SCIE、EI、Scopus
卷号:22
期号:1
页面范围:41-45
ISSN号:1022-4653
关键字:Protein-protein interaction (PPI); Combined kernel; Active learning; SVM
摘要:This paper presents an ensemble kernel-based active learning method for PPI (Protein-protein interaction) extraction. This ensemble kernel is composed of feature-based kernel and structure-based kernel. Experimental results show that the F-scores of PPI extraction using ensemble kernel model on AIMED (Abstracts in medline), IEPA (the Interaction extraction performance assessment corpus) and BCPPI (Biocreative PPI dataset) corpora are 64.50%, 69.74% and 60.38% respectively. As the passive learning methods need large labeled data sets and it is expensive to label data manually, we integrate active learning strategy into the ensemble kernel model. The uncertainty-based sampling strategy is used in the active learning method. Two experiments for active learning are conducted on AIMED, IEPA, BCPPI corpus. The experimental results integrating the active learning strategy show that the F-scores on AIMED, IEPA and BCPPI corpora are better than those using the passive learning, and meantime reduce the labeling data.