location: Current position: Home >> Scientific Research >> Paper Publications

Fuzzy c-means clustering of partially missing data sets based on statistical representation

Hits:

Indexed by:会议论文

Date of Publication:2010-01-01

Included Journals:EI、Scopus

Volume:1

Page Number:460-464

Abstract:The fuzzy c-means algorithm is a useful technique for clustering real s-dimensional data, but it can not be directly used for partially missing data sets. In this paper, the problem of missing data handling for fuzzy clustering is considered, and a statistical representation of missing attributes is proposed. The approach reduces the statistical analysis of missing attributes to the subsets of the dataset with similar data of incomplete data to impute the missing attributes, thus is helpful in enhancing the learning of missing attributes and the performance of fuzzy clustering based on the recovered data. Comparisons and analysis of the clustering results of the incomplete IRIS data demonstrate that the proposed statistical representation can estimate missing attributes rationally and improve the fuzzy cmeans clustering of incomplete data. ?2010 IEEE.

Pre One:多频率方波激励阻容解耦软测量的数值模拟

Next One:Collaborative optimization of clustering by fuzzy c-means and weight determination by reliefF