郭崇慧

个人信息Personal Information

教授

博士生导师

硕士生导师

主要任职:Director of Institute of Systems Engineering

其他任职:大连市数据科学与知识管理重点实验室主任

性别:男

毕业院校:大连理工大学

学位:博士

所在单位:系统工程研究所

学科:管理科学与工程. 系统工程

办公地点:经济管理学院D337室

联系方式:0411-84708007

电子邮箱:dlutguo@dlut.edu.cn

扫描关注

论文成果

当前位置: 中文主页 >> 科学研究 >> 论文成果

Textual Topic Evolution Analysis Based on Term Co-occurrence: A Case Study on the Government Work Report of the State Council (1954-2017)

点击次数:

论文类型:会议论文

发表时间:2017-01-01

收录刊物:EI、CPCI-S

关键字:the Government Work Report of the State Council; word co-occurrence; text mining; topic evolution

摘要:The government work report of the State Council is a kind of comprehensive policy text. This paper uses text mining technology to carry out a comprehensive multi granularity, multi-level quantitative analysis of the government work reports, which has a great practical and instructive significance for relevant personnels to understand the evolution of domain knowledge in a short time. Firstly, a series of text preprocessing is done by using the Chinese word segmentation tool combined with three kind of dictionary built by authors, i.e., the domain word dictionary, the domain synonym dictionary and the domain stopword dictionary. Then, according to the co-occurrence information of words in the government work reports, we attempt to conduct topic modeling on the corpus consisted of all the government work reports and single government work report respectively, Finally, we find 12 latent topics for the corpus, such as the "Economic reform", "Agriculture", "Government construction", "Defense military" and so on. Based on the 12 topics, we conduct the topic modeling on every single government work report, with which topic evolution analysis is carried out over the whole period of all government work reports.