• 更多栏目

    赵哲焕

    • 副教授       硕士生导师
    • 性别:男
    • 毕业院校:大连理工大学
    • 学位:博士
    • 所在单位:软件学院、国际信息与软件学院
    • 学科:软件工程
    • 办公地点:大连理工大学,开发区校区,综合楼317
    • 电子邮箱:z.zhao@dlut.edu.cn

    访问量:

    开通时间:..

    最后更新时间:..

    A Hybrid Protein-Protein Interaction Triple Extraction Method for Biomedical Literature

    点击次数:

    论文类型:会议论文

    发表时间:2017-01-01

    收录刊物:SCIE、CPCI-S、Scopus

    卷号:2017-January

    页面范围:1515-1521

    关键字:protein protein interaction triple extraction; interaction word extraction; protein named entity recognition

    摘要:Protein-protein interaction extraction research can be widely applied to the field of life science research. However, most of the machine learning based methods focus on binary PPI relation extraction, which loses rich relationship type information that is critical to the PPIs study. The rule based open information extraction methods can extract the PPI triple (i.e. "protein1, interaction word, protein2"), but suffers from low recall rate problem. In this paper, we propose a hybrid protein-protein interaction triple extraction method. In this method, firstly, machine learning techniques are used to recognize protein entities and extract relational protein pairs. Then, the syntactic patterns and a dictionary are employed to find out corresponding interaction words that represent the relationships between two proteins. This method obtains an F-score of 40.18% on the AImed corpus, which is much higher than the result achieved by the rule based Stanford open information extraction method.