黄德根Huang Degen

(教授)

 博士生导师  硕士生导师
学位:博士
性别:男
毕业院校:大连理工大学
所在单位:计算机科学与技术学院
电子邮箱:huangdg@dlut.edu.cn

论文成果

A general protein-protein interaction extraction architecture based on word representation and feature selection

发表时间:2019-03-09 点击次数:

论文名称:A general protein-protein interaction extraction architecture based on word representation and feature selection
论文类型:期刊论文
发表刊物:INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS
收录刊物:SCIE、Scopus
卷号:14
期号:3
页面范围:276-291
ISSN号:1748-5673
关键字:instance representation; word representation; protein-protein interaction; relation extraction; biomedical text mining
摘要:Previous researches have shown that supervised Protein-Protein Interaction Extraction (PPIE) can get high accuracies with elaborately selected features and kernels. However, most features and kernels rest upon domain knowledge and natural language analysis, which makes the supervised model expensive, heavy and brittle. Moreover, commonly used representation techniques, such as one-hot encoding and Vector Space Model, fail to capture the semantic similarity between words. To reduce the manual labour and take advantage of semantic representation, we put forward a general instance representation architecture for PPIE, which integrates word representation, vector composition and feature selection. Our method obtains F-scores of 69.7, 78.8, 72.3, 72.0 and 83.7 on AIMed, BioInfer, HPRD50, IEPA and LLL respectively.
发表时间:2016-01-01