大连理工大学  登录  English 
张宪超
点赞:

教授   博士生导师   硕士生导师

主要任职: 国防(先进)科学技术发展研究院副院长

性别: 男

毕业院校: 中国科技大学

学位: 博士

在职信息:在职

所在单位: 软件学院

学科: 计算机应用技术 软件工程

电子邮箱:

手机版

访问量:

开通时间 : ..

最后更新时间: ..

当前位置: 中文主页 >> 科学研究 >> 论文成果
Self-adapted mixture distance measure for clustering uncertain data

点击量:

论文类型: 期刊论文

第一作者: Liu, Han

通讯作者: Liu, H (reprint author), Dalian Univ Technol, Sch Comp Sci & Technol, Dalian 116024, Peoples R China.; Liu, H (reprint author), Dalian Univ Technol, Sch Software, Dalian 116020, Peoples R China.

合写作者: Zhang, Xianchao,Zhang, Xiaotong,Cui, Yi

发表时间: 2017-06-15

发表刊物: KNOWLEDGE-BASED SYSTEMS

收录刊物: SCIE、EI、Scopus

卷号: 126

页面范围: 33-47

ISSN号: 0950-7051

关键字: Clustering; Uncertain data; Induced kernel distance; Jensen-Shannon divergence; Self-adapted mixture distance measure

摘要: Distance measure plays an important role in clustering uncertain data. However, existing distance measures for clustering uncertain data suffer from some issues. Geometric distance measure can not identify the difference between uncertain objects with different distributions heavily overlapping in locations. Probability distribution distance measure can not distinguish the difference between different pairs of completely separated uncertain objects. In this paper, we propose a self-adapted mixture distance measure for clustering uncertain data which considers the geometric distance and the probability distribution distance simultaneously, thus overcoming the issues in previous distance measures. The proposed distance measure consists of three parts: (1) The induced kernel distance: it can be used to measure the geometric distance between uncertain objects. (2) The Jensen-Shannon divergence: it can be used to measure the probability distribution distance between uncertain objects. (3) The self-adapted weight parameter: it can be used to adjust the importance degree of the induced kernel distance and the Jensen Shannon divergence according to the location overlapping information of the dataset. The proposed distance measure is symmetric, finite and parameter adaptive. Furthermore, we integrate the self-adapted mixture distance measure into the partition-based and density-based algorithms for clustering uncertain data. Extensive experimental results on synthetic datasets, real benchmark datasets and real world uncertain datasets show that our proposed distance measure outperforms the existing distance measures for clustering uncertain data. (C) 2017 Elsevier B.V. All rights reserved.

辽ICP备05001357号 地址:中国·辽宁省大连市甘井子区凌工路2号 邮编:116024
版权所有:大连理工大学