location: Current position: Home >> Scientific Research >> Paper Publications

Interval kernel Fuzzy C-Means clustering of incomplete data

Hits:

Indexed by:期刊论文

Date of Publication:2017-05-10

Journal:NEUROCOMPUTING

Included Journals:SCIE、EI、Scopus

Volume:237

Page Number:316-331

ISSN No.:0925-2312

Key Words:Incomplete data; Nearest neighbor interval; Kernel Fuzzy C-Means; Interval kernel distance

Abstract:In the clustering of incomplete data, the processing of missing attribute values and the optimization procedure of clustering are always of concern. In this paper, a novel clustering method is proposed to cope with incomplete data. Owing to the uncertainty of missing values, we first estimate these values in the form of intervals using the nearest neighbor method, which utilizes information about the distribution of data and transforms incomplete data set into an interval-valued one. Then, a kernel method is introduced to increase the separability between data by implicitly mapping them into a higher dimensional feature space, in which a kernel-induced distance is used to replace the Euclidean distance so that the data can be processed in the original data space. We realize the kernel clustering of incomplete data set by means of a gradient-based alternating optimization of interval data clustering based on the interval kernel distance. Finally, the experimental results demonstrate that the proposed approach is superior in terms of its clustering performance.

Pre One:Fuzzy C-Means clustering based on dual expression between cluster prototypes and reconstructed data

Next One:Research on Central Heating System Control Strategy Based on Genetic Algorithm