夏昊翔
开通时间:..
最后更新时间:..
点击次数:
论文类型:期刊论文
发表时间:2006-12-01
发表刊物:JOURNAL OF SYSTEMS SCIENCE AND SYSTEMS ENGINEERING
收录刊物:SCIE、Scopus
卷号:15
期号:4
页面范围:474-492
ISSN号:1004-3756
关键字:Ant-based clustering; text clustering; ant movement rule; semantic similarity measure
摘要:Ant-based text clustering is a promising technique that has attracted great research attention. This paper attempts to improve the standard ant-based text-clustering algorithm in two dimensions. On one hand, the ontology-based semantic similarity measure is used in conjunction with the traditional vector-space-model-based measure to provide more accurate assessment of the similarity between documents. On the other, the ant behavior model is modified to pursue better algorithmic performance. Especially, the ant movement rule is adjusted so as to direct a laden ant toward a dense area of the same type of items as the ant's carrying item, and to direct an unladen ant toward an area that contains an item dissimilar with the surrounding items within its Moore neighborhood. Using WordNet as the base ontology for assessing the semantic similarity between documents, the proposed algorithm is tested with a sample set of documents excerpted from the Reuters-21578 corpus and the experiment results partly indicate that the proposed algorithm perform better than the standard ant-based text-clustering algorithm and the k-means algorithm.