Hits:
Indexed by:期刊论文
Date of Publication:2017-05-10
Journal:NEUROCOMPUTING
Included Journals:SCIE、EI、Scopus
Volume:237
Page Number:316-331
ISSN No.:0925-2312
Key Words:Incomplete data; Nearest neighbor interval; Kernel Fuzzy C-Means; Interval kernel distance
Abstract:In the clustering of incomplete data, the processing of missing attribute values and the optimization procedure of clustering are always of concern. In this paper, a novel clustering method is proposed to cope with incomplete data. Owing to the uncertainty of missing values, we first estimate these values in the form of intervals using the nearest neighbor method, which utilizes information about the distribution of data and transforms incomplete data set into an interval-valued one. Then, a kernel method is introduced to increase the separability between data by implicitly mapping them into a higher dimensional feature space, in which a kernel-induced distance is used to replace the Euclidean distance so that the data can be processed in the original data space. We realize the kernel clustering of incomplete data set by means of a gradient-based alternating optimization of interval data clustering based on the interval kernel distance. Finally, the experimental results demonstrate that the proposed approach is superior in terms of its clustering performance.