郭崇慧

个人信息Personal Information

教授

博士生导师

硕士生导师

主要任职:Director of Institute of Systems Engineering

其他任职:大连市数据科学与知识管理重点实验室主任

性别:男

毕业院校:大连理工大学

学位:博士

所在单位:系统工程研究所

学科:管理科学与工程. 系统工程

办公地点:经济管理学院D337室

联系方式:0411-84708007

电子邮箱:dlutguo@dlut.edu.cn

扫描关注

论文成果

当前位置: 中文主页 >> 科学研究 >> 论文成果

Non-unique cluster numbers determination methods based on stability in spectral clustering

点击次数:

论文类型:期刊论文

发表时间:2013-08-01

发表刊物:KNOWLEDGE AND INFORMATION SYSTEMS

收录刊物:SCIE、Scopus

卷号:36

期号:2

页面范围:439-458

ISSN号:0219-1377

关键字:Spectral clustering method; Cluster numbers; Multiway normalized cut criterion; Stability

摘要:Recently, a large amount of work has been devoted to the study of spectral clustering-a simple yet powerful method for finding structure in a data set using spectral properties of an associated pairwise similarity matrix. Most of the existing spectral clustering algorithms estimate only one cluster number or estimate non-unique cluster numbers based on eigengap criterion. However, the number of clusters not always exists one, and eigengap criterion lacks theoretical justification. In this paper, we propose non-unique cluster numbers determination methods based on stability in spectral clustering (NCNDBS). We first utilize the multiway normalized cut spectral clustering algorithm to cluster data set for a candidate cluster number . Then the ratio value of the multiway normalized cut criterion of the obtained clusters and the sum of the leading eigenvalues (descending sort) of the stochastic transition matrix is chosen as a standard to decide whether the is a reasonable cluster number. At last, by varying the scaling parameter in the Gaussian function, we judge whether the reasonable cluster number is also a stability one. By three stages, we can determine non-unique cluster numbers of a data set. The Lumpability theorem concluded by Meil and Xu provides a theoretical base for our methods. NCNDBS can estimate non-unique cluster numbers of the data set successfully by illustrative experiments.