王健

个人信息Personal Information

教授

博士生导师

硕士生导师

性别:女

毕业院校:大连理工大学

学位:博士

所在单位:计算机科学与技术学院

学科:计算机应用技术

办公地点:创新园大厦B811

联系方式:0411-84706009-2811

电子邮箱:wangjian@dlut.edu.cn

扫描关注

论文成果

当前位置: 中文主页 >> 科学研究 >> 论文成果

Combining Labeled and Unlabeled Data For Biomedical Event Extraction

点击次数:

论文类型:会议论文

发表时间:2012-01-01

收录刊物:CPCI-S、SCIE

关键字:bio-event extraction; unlabeled data; data sparseness

摘要:In biomedical event extraction domain, there is a small amount of labeled data along with a large pool of unlabeled data. Many supervised learning algorithms for bio-event extraction have been affected by the data sparseness. In this paper, we present a new solution to perform biomedical event extraction from scientific documents, applying a semi-supervised approach to extract features from unlabeled data using labeled data features as a reference. This strategy is evaluated via experiments in which the data from the BioNLP2011 and PubMed are applied. To the best of our knowledge, it is the first time that the combination of labeled and unlabeled data are used for biomedical event extraction and our experimental results demonstrate the state-of-the-art performance in this task.