李丹

个人信息Personal Information

副教授

硕士生导师

性别:女

毕业院校:大连理工大学

学位:博士

所在单位:控制科学与工程学院

办公地点:大连理工大学创新园大厦A716

电子邮箱:ldan@dlut.edu.cn

扫描关注

论文成果

当前位置: 中文主页 >> 科学研究 >> 论文成果

A fuzzy c-means clustering algorithm based on pseudo-nearest-neighbor intervals for incomplete data

点击次数:

论文类型:期刊论文

发表时间:2015-03-15

发表刊物:Journal of Computational Information Systems

收录刊物:EI、Scopus

卷号:11

期号:6

页面范围:2139-2146

ISSN号:15539105

摘要:Missing data handling is a challenging issue often dealt with in data mining and pattern classification. In this paper, a fuzzy c-means clustering algorithm based on pseudo-nearest-neighbor intervals for incomplete data is given. The data are first completed using the pseudo-nearest-neighbor intervals approach, then the data set can be clustered based on the fuzzy c-means algorithm for interval-valued data. The proposed algorithm estimates the missing attribute values without normalization, thus captures the essence of pattern similarities in the original untouched data set. Additionally, the pseudo-nearest-neighbor intervals representation takes account of implicit uncertainly of missing attribute values, and considers the angle between incomplete data and other data as well. Results on several incomplete data sets demonstrate the effectiveness of the proposed algorithm. Copyright ? 2015 Binary Information Press.