江贺

个人信息Personal Information

教授

博士生导师

硕士生导师

主要任职:未来技术学院/人工智能学院副院长

性别:男

毕业院校:中国科技大学

学位:博士

所在单位:软件学院、国际信息与软件学院

联系方式:jianghe@dlut.edu.cn

扫描关注

论文成果

当前位置: jianghe >> 科学研究 >> 论文成果

Data set homeomorphism transformation based meta-clustering

点击次数:

论文类型:会议论文

发表时间:2007-05-27

收录刊物:EI、CPCI-S

卷号:4489

期号:PART 3

页面范围:661-+

关键字:clustering analysis; meta-clustering; data set homeomorphism transformation

摘要:Clustering analysis is an important data mining technique with a variety of applications. In this paper, the data set is treated in a dynamic way and a Data Set Homeomorphism Transformation Based Meta-Clustering algorithm (DSHTBMC) is proposed. DSHTBMC decomposes the task of clustering into multiple stages. It firstly constructs a series of homeomorphous data sets ranging from high regularity to low, and then iteratively clusters each homeomorphism data set based on the clustering result of the preceding homeomorphism data set. Since data sets of high regularities are easier to be clustered, and the clustering result of each homeomorphism data set can be used to induce high quality clusters in the following-up homeomorphism data set, in this way, the hardness of the problem is decreased. Two strategies (i.e., Displacement and Noising) for data set homeomorphism transformation are proposed, with classical hierarchical divisive method-Bisecting k-means as DSHTBMC's subordinate clustering algorithm, two new clustering algorithms----HD-DSHTBMC-D and HD-DSHTBMC-N are obtained. Experimental results indicate that the new clustering algorithms are remarkably better than Bisecting k-means algorithm in terms of clustering quality.