• 更多栏目

    顾宏

    • 教授     博士生导师   硕士生导师
    • 性别:男
    • 毕业院校:浙江大学
    • 学位:博士
    • 所在单位:控制科学与工程学院
    • 学科:模式识别与智能系统
    • 办公地点:创新园大厦B0715
    • 电子邮箱:guhong@dlut.edu.cn

    访问量:

    开通时间:..

    最后更新时间:..

    A hybrid genetic algorithm-fuzzy c-means approach for incomplete data clustering based on nearest-neighbor intervals

    点击次数:

    论文类型:期刊论文

    发表时间:2013-10-01

    发表刊物:SOFT COMPUTING

    收录刊物:SCIE、EI、Scopus

    卷号:17

    期号:10

    页面范围:1787-1796

    ISSN号:1432-7643

    关键字:Fuzzy clustering; Hybrid approach; Incomplete data; Nearest-neighbor interval

    摘要:Incomplete data are often encountered in data sets used in clustering problems, and inappropriate treatment of incomplete data can significantly degrade the clustering performance. In view of the uncertainty of missing attributes, we put forward an interval representation of missing attributes based on nearest-neighbor information, named nearest-neighbor interval, and a hybrid approach utilizing genetic algorithm and fuzzy c-means is presented for incomplete data clustering. The overall algorithm is within the genetic algorithm framework, which searches for appropriate imputations of missing attributes in corresponding nearest-neighbor intervals to recover the incomplete data set, and hybridizes fuzzy c-means to perform clustering analysis and provide fitness metric for genetic optimization simultaneously. Several experimental results on a set of real-life data sets are presented to demonstrate the better clustering performance of our hybrid approach over the compared methods.