卢伟

个人信息Personal Information

教授

博士生导师

硕士生导师

性别:男

毕业院校:大连理工大学

学位:博士

所在单位:控制科学与工程学院

学科:控制理论与控制工程. 模式识别与智能系统. 检测技术与自动化装置

办公地点:创新园大厦B707

电子邮箱:luwei@dlut.edu.cn

扫描关注

论文成果

当前位置: 中文主页 >> 科学研究 >> 论文成果

Fuzzy C-Means clustering of incomplete data based on probabilistic information granules of missing values

点击次数:

论文类型:期刊论文

发表时间:2016-05-01

发表刊物:KNOWLEDGE-BASED SYSTEMS

收录刊物:SCIE、EI、Scopus

卷号:99

页面范围:51-70

ISSN号:0950-7051

关键字:Fuzzy clustering; Incomplete data; Missing value; Probabilistic information granules; Alternating optimization

摘要:Missing values are a common phenomenon when dealing with real-world data sets. Analysis of incomplete data sets has become an active area of research. In this paper, we focus on the problem of clustering incomplete data, which is intended to introduce some prior distribution information of the missing values into the algorithm of fuzzy clustering. First, non-parametric hypothesis testing is employed to describe the missing values adhering to a certain Gaussian distribution as probabilistic information granules based on the nearest neighbors of incomplete data. Second, we propose a novel clustering model, in which probabilistic information granules of missing values are incorporated into the Fuzzy C-Means clustering of incomplete data by involving the maximum likelihood criterion. Third, the clustering model is optimized by using a tri-level alternating optimization utilizing the method of Lagrange multipliers. The convergence and the time complexity of the clustering algorithm are also discussed. The experiments reported both on synthetic and real-world data sets demonstrate that the proposed approach can effectively realize clustering of incomplete data. (C) 2016 Elsevier B.V. All rights reserved.