个人信息Personal Information
教授
博士生导师
硕士生导师
性别:女
毕业院校:大连理工大学
学位:博士
所在单位:计算机科学与技术学院
学科:计算机应用技术. 计算机软件与理论
办公地点:创新大厦A930
电子邮箱:lils@dlut.edu.cn
A Two-Phase Bio-NER System Based on Integrated Classifiers and Multiagent Strategy
点击次数:
论文类型:期刊论文
发表时间:2013-07-01
发表刊物:IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS
收录刊物:SCIE、EI、PubMed、Scopus
卷号:10
期号:4
页面范围:897-904
ISSN号:1545-5963
关键字:Named entity recognition and classification; two-layer stacking method; multiagent; bioinformatics
摘要:Biomedical named entity recognition (Bio-NER) is a fundamental step in biomedical text mining. This paper presents a two-phase Bio-NER model targeting at JNLPBA task. Our two-phase method divides the task into two subtasks: named entity detection (NED) and named entity classification (NEC). The NED subtask is accomplished based on the two-layer stacking method in the first phase, where named entities (NEs) are distinguished from nonnamed-entities (NNEs) in biomedical literatures without identifying their types. Then six classifiers are constructed by four toolkits (CRF++, YamCha, maximum entropy, Mallet) with different training methods and integrated based on the two-layer stacking method. In the second phase for the NEC subtask, the multiagent strategy is introduced to determine the correct entity type for entities identified in the first phase. The experiment results show that the presented approach can achieve an F-score of 76.06 percent, which outperforms most of the state-of-the-art systems.