个人信息Personal Information
副教授
硕士生导师
任职 : AI+教育研究所所长
性别:女
毕业院校:大连理工大学
学位:博士
所在单位:软件学院、国际信息与软件学院
学科:软件工程. 人工智能
电子邮箱:hongyu@dlut.edu.cn
Constraint Based Subspace Clustering for High Dimensional Uncertain Data
点击次数:
论文类型:会议论文
发表时间:2016-01-01
收录刊物:CPCI-S
卷号:9652
页面范围:271-282
摘要:Both uncertain data and high-dimensional data pose huge challenges to traditional clustering algorithms. It is even more challenging for clustering high dimensional uncertain data and there are few such algorithms. In this paper, based on the classical FINDIT subspace clustering algorithm for high dimensional data, we propose a constraint based semi-supervised subspace clustering algorithm for high dimensional uncertain data, UFINDIT. We extend both the distance functions and dimension voting rules of FINDIT to deal with high dimensional uncertain data. Since the soundness criteria of FINDIT fails for uncertain data, we introduce constraints to solve the problem. We also use the constraints to improve FINDIT in eliminating parameters' effect on the process of merging medoids. Furthermore, we propose some methods such as sampling to get an more efficient algorithm. Experimental results on synthetic and real data sets show that our proposed UFINDIT algorithm outperforms the existing subspace clustering algorithm for uncertain data.