的个人主页 http://faculty.dlut.edu.cn/1989011035/zh_CN/index.htm
点击次数:
论文类型:会议论文
发表时间:2008-10-12
收录刊物:EI、CPCI-S、Scopus
页面范围:5006-5009
关键字:document representation; graph structure; Chinese document; clustering
摘要:In this paper, we propose a graph-structure-based method to represent knowledge for Chinese document clustering. First, we introduce a new knowledge representation method called Graph Space Model (GSM) to convert each document to a graph structure, and then we adopt Maximum Common Subgraph (MCS) to compute the similarities between any two graph structures, which can be further used for document clustering. The results show that the GSM approach can outperform VSM method in representing capability of Chinese documents.