黄德根Huang Degen

教授

 博士生导师  硕士生导师
学位:博士
性别:男
毕业院校:大连理工大学
所在单位:计算机科学与技术学院
Email :

论文成果

Training Word Embeddings for Deep Learning in Biomedical Text Mining Tasks

发布时间:2019-03-11 点击次数:

论文类型:会议论文
收录刊物:Scopus、CPCI-S、EI
页面范围:625-628
关键字:biomedical word embeddings; word representation; deep learning; biomedical text mining
摘要:Most word embedding methods are proposed with general purpose which take a word as a basic unit and learn embeddings according to words' external contexts. However, in biomedical text mining, there are many biomedical entities and syntactic chunks which contain rich domain information, and the semantic meaning of a word is also strongly related to those information. Hence, we present a biomedical domain-specific word embedding model by incorporating stem, chunk and entity to train word embeddings. We also present two deep learning architectures respectively for two biomedical text mining tasks, by which we evaluate our word embeddings and compare them with other models. Experimental results show that our biomedical domain-specific word embeddings overall outperform other general-purpose word embeddings in these deep learning methods for biomedical text mining tasks.