王健

个人信息Personal Information

教授

博士生导师

硕士生导师

性别:女

毕业院校:大连理工大学

学位:博士

所在单位:计算机科学与技术学院

学科:计算机应用技术

办公地点:创新园大厦B811

联系方式:0411-84706009-2811

电子邮箱:wangjian@dlut.edu.cn

扫描关注

论文成果

当前位置: 中文主页 >> 科学研究 >> 论文成果

Intelligent multi-document summarization for biomedical literature by word embeddings and graph-based ranking

点击次数:

论文类型:期刊论文

发表时间:2019-01-01

发表刊物:JOURNAL OF INTELLIGENT & FUZZY SYSTEMS

收录刊物:SCIE

卷号:37

期号:4

页面范围:4797-4802

ISSN号:1064-1246

关键字:Intelligent; text summarization; graph-based ranking; similarity calculation

摘要:With the rapid development of clinical and laboratory medicine, the field of bioinformatics boasts of extensive clinical records and research literature. Retrieving effective information from this huge data has become a challenging task. Hence, Intelligent text summarization, which enables users to find and understand relevant source texts more quickly and effortlessly, becomes a very significant and valuable field of research. In this study, we propose an improved TextRank algorithm with weight calculation based on sentence graph to solve this problem. For the experimental dataset obtained from Pubmed, we represent terms as vectors by using Skip-gram model. We design three methods which utilize word embeddings to calculate weights between sentences. Then we build an undirected graph with sentences as nodes. At last, we use the improved TextRank algorithm to calculate the importance of sentences and further generated summarizations base on its ranking. The experimental results and analysis on the datasets demonstrate the effectiveness of the proposed model.