郭崇慧

个人信息Personal Information

教授

博士生导师

硕士生导师

主要任职:Director of Institute of Systems Engineering

其他任职:大连市数据科学与知识管理重点实验室主任

性别:男

毕业院校:大连理工大学

学位:博士

所在单位:系统工程研究所

学科:管理科学与工程. 系统工程

办公地点:经济管理学院D337室

联系方式:0411-84708007

电子邮箱:dlutguo@dlut.edu.cn

扫描关注

论文成果

当前位置: 中文主页 >> 科学研究 >> 论文成果

Fast affinity propagation clustering based on incomplete similarity matrix

点击次数:

论文类型:期刊论文

发表时间:2017-06-01

发表刊物:KNOWLEDGE AND INFORMATION SYSTEMS

收录刊物:SCIE

卷号:51

期号:3

页面范围:941-963

ISSN号:0219-1377

关键字:Exemplar-based clustering; Affinity propagation; Incomplete similarity matrix; Fast algorithm

摘要:Affinity propagation (AP) is a recently proposed clustering algorithm, which has been successful used in a lot of practical problems. Although effective in finding meaningful clustering solutions, a key disadvantage of AP is its efficiency, which has become the bottleneck when applying AP for large-scale problems. In the literature, most of the methods proposed to improve the efficiency of AP are based on implementing the message-passing on a sparse similarity matrix, while neither the decline in effectiveness nor the improvement in efficiency is theoretically analyzed. In this paper, we propose a two-stage fast affinity propagation (FastAP) algorithm. Different from previous work, the scale of the similarity matrix is first compressed by selecting only potential exemplars, then further reduced by sparseness according to k nearest neighbors. More importantly, we provide theoretical analysis, based on which the improvement of efficiency in our method is controllable with guaranteed clustering performance. In experiments, two synthetic data sets, seven publicly available data sets, and two real-world streaming data sets are used to evaluate the proposed method. The results demonstrate that FastAP can achieve comparable clustering performances with the original AP algorithm, while the computational efficiency has been improved with a several-fold speed-up on small data sets and a dozens-of-fold on larger-scale data sets.