教授 博士生导师 硕士生导师
性别: 男
毕业院校: 中国科技大学
学位: 博士
所在单位: 软件学院、国际信息与软件学院
学科: 计算机应用技术. 软件工程
电子邮箱: xczhang@dlut.edu.cn
开通时间: ..
最后更新时间: ..
点击次数:
论文类型: 期刊论文
发表时间: 2016-08-01
发表刊物: ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA
收录刊物: SCIE
卷号: 11
期号: 1
ISSN号: 1556-4681
关键字: 2 Spectral clustering; Nystrom extension; incremental sampling; clusterability analysis; loss analysis
摘要: Sampling is the key aspect for Nystrom extension based spectral clustering. Traditional sampling schemes select the set of landmark points on a whole and focus on how to lower the matrix approximation error. However, the matrix approximation error does not have direct impact on the clustering performance. In this article, we propose a sampling framework from an incremental perspective, i.e., the landmark points are selected one by one, and each next point to be sampled is determined by previously selected landmark points. Incremental sampling builds explicit relationships among landmark points; thus, they work together well and provide a theoretical guarantee on the clustering performance. We provide two novel analysis methods and propose two schemes for selecting-the-next-one of the framework. The first scheme is based on clusterability analysis, which provides a better guarantee on clustering performance than schemes based on matrix approximation error analysis. The second scheme is based on loss analysis, which provides maximized predictive ability of the landmark points on the (implicit) labels of the unsampled points. Experimental results on a wide range of benchmark datasets demonstrate the superiorities of our proposed incremental sampling schemes over existing sampling schemes.