Current position: Home >> Scientific Research >> Paper Publications

Interval kernel Fuzzy C-Means clustering of incomplete data

Release Time:2019-03-12  Hits:

Indexed by: Journal Article

Date of Publication: 2017-05-10

Journal: NEUROCOMPUTING

Included Journals: Scopus、EI、SCIE

Volume: 237

Page Number: 316-331

ISSN: 0925-2312

Key Words: Incomplete data; Nearest neighbor interval; Kernel Fuzzy C-Means; Interval kernel distance

Abstract: In the clustering of incomplete data, the processing of missing attribute values and the optimization procedure of clustering are always of concern. In this paper, a novel clustering method is proposed to cope with incomplete data. Owing to the uncertainty of missing values, we first estimate these values in the form of intervals using the nearest neighbor method, which utilizes information about the distribution of data and transforms incomplete data set into an interval-valued one. Then, a kernel method is introduced to increase the separability between data by implicitly mapping them into a higher dimensional feature space, in which a kernel-induced distance is used to replace the Euclidean distance so that the data can be processed in the original data space. We realize the kernel clustering of incomplete data set by means of a gradient-based alternating optimization of interval data clustering based on the interval kernel distance. Finally, the experimental results demonstrate that the proposed approach is superior in terms of its clustering performance.

Prev One:Fuzzy C-Means clustering based on dual expression between cluster prototypes and reconstructed data

Next One:Research on Central Heating System Control Strategy Based on Genetic Algorithm