党延忠

个人信息Personal Information

教授

博士生导师

硕士生导师

性别:男

毕业院校:大连理工大学

学位:博士

所在单位:系统工程研究所

学科:管理科学与工程. 系统工程

电子邮箱:yzhdang@dlut.edu.cn

扫描关注

论文成果

当前位置: 中文主页 >> 科学研究 >> 论文成果

Autocorrection of noise text based on modularity optimization

点击次数:

论文类型:会议论文

发表时间:2007-09-05

收录刊物:EI

摘要:This paper brings forward an autocorrection algorithm for noise texts based on modularity optimization. By noise texts we mean those documents in text corpus being distributed to a wrong category. Firstly, the document-similarity network is constructed, in which each node represents a document. If two nodes are similar in content, they are connected with a weighted edge, and their similarity is the weight. Secondly, the categories constitute the corresponding community structure in the network. Modularity has been introduced as a measure to evaluate the quality of community structures. In this paper modularity is used to evaluate the quality of categorise. Finally, noise texts are autocorrected by optimizing the modularity. The experimental results indicate that this algorithm can effectively revise the noise texts. This algorithm can also be used in the preprocessing of text classification or taxonomy building. © 2007 IEEE.