Hits:
Indexed by:会议论文
Date of Publication:2012-01-01
Included Journals:CPCI-S、SCIE
Key Words:bio-event extraction; unlabeled data; data sparseness
Abstract:In biomedical event extraction domain, there is a small amount of labeled data along with a large pool of unlabeled data. Many supervised learning algorithms for bio-event extraction have been affected by the data sparseness. In this paper, we present a new solution to perform biomedical event extraction from scientific documents, applying a semi-supervised approach to extract features from unlabeled data using labeled data features as a reference. This strategy is evaluated via experiments in which the data from the BioNLP2011 and PubMed are applied. To the best of our knowledge, it is the first time that the combination of labeled and unlabeled data are used for biomedical event extraction and our experimental results demonstrate the state-of-the-art performance in this task.