xKR8pMVyFJv40vI0x65iDDyQ2pUhyeLJUwJDZDkwwfDOTPanmlGasa9VrNSt

黄德根Huang Degen

教授

 博士生导师  硕士生导师
学位:博士
性别:男
毕业院校:大连理工大学
所在单位:计算机科学与技术学院
Email :

论文成果

Improving Kernel-Based Protein-Protein Interaction Extraction by Unsupervised Word Representation

发布时间:2019-03-11 点击次数:

论文类型:会议论文
收录刊物:Scopus、SCIE、CPCI-S
页面范围:379-384
关键字:Protein-Protein Interaction; word representation; distributed representation; Brown clusters
摘要:As an important branch of biomedical information extraction, Protein-Protein Interaction extraction (PPIe) from biomedical literatures has been widely researched, and machine learning methods have achieved great success for this task. However, the word feature generally adopted in the existing methods suffers badly from vocabulary gap and data sparseness, weakening the classification performance. In this paper, the unsupervised word representation approach is introduced to address these problems. Three word representation methods are adopted to improve the performance of PPIe: distributed representation, vector clustering and Brown clusters representation. Experimental results show that our method outperforms the state-of-the-art methods on five publicly available corpora.