扫描手机二维码

欢迎您的访问
您是第 位访客

开通时间:..

最后更新时间:..

  • 吴江宁 ( 教授 )

    的个人主页 http://faculty.dlut.edu.cn/1989011035/zh_CN/index.htm

  •   教授   硕士生导师
论文成果 当前位置: 中文主页 >> 科学研究 >> 论文成果
Search Results Clustering in Chinese Context Based on a New Suffix Tree

点击次数:
论文类型:会议论文
发表时间:2008-07-08
收录刊物:EI、CPCI-S、Scopus
页面范围:110-115
摘要:Searching for information by search engines has been gaining popularity in recent years. However, results returned by most Chinese Web search engines usually reach up to thousands or even millions documents, so search results clustering is of critical need for on-line grouping of similar documents to improve user experience while searching collections of Web pages and facilitate browsing Chinese Web pages in a more compact and thematic form. This paper presents a new Suffix Tree Clustering (STC) algorithm for Web search results clustering, which is more suitable for Chinese context. It is built in terms of Chinese words, of which meaningless phrases are ignored by an efficient strategy we proposed Meanwhile the Chinese synonymy is introduced into the suffix free to improve the quality of the clusters. Experiments show that the proposed novel STC algorithm has a better performance in precision and speed than original STC.

 

辽ICP备05001357号 地址:中国·辽宁省大连市甘井子区凌工路2号 邮编:116024
版权所有:大连理工大学