location: Current position: Home >> Scientific Research >> Paper Publications

High-order possibilistic c-means algorithms based on tensor decompositions for big data in IoT

Hits:

Indexed by:期刊论文

Date of Publication:2018-01-01

Journal:INFORMATION FUSION

Included Journals:Scopus、SCIE、EI、ESI高被引论文、ESI热点论文

Volume:39

Page Number:72-80

ISSN No.:1566-2535

Key Words:Big data; IoT; Possibilistic c-means clustering; Canonical polyadic decomposition; Tensor-train network

Abstract:Internet of Things (IoT) connects the physical world and the cyber world to offer intelligent services by data mining for big data. Each big data sample typically involves a large number of attributes, posing a remarkable challenge on the high-order possibilistic c-means algorithm (HOPCM). Specially, HOPCM requires high-performance servers with a large-scale memory and a powerful computing unit, to cluster big samples, limiting its applicability in IoT systems with low-end devices such as portable computing units and embedded devises which have only limited memory space and computing power. In this paper, we propose two high-order possibilistic c-means algorithms based on the canonical polyadic decomposition (CP-HOPCM) and the tensor-train network (TT-HOPCM) for clustering big data. In detail, we use the canonical polyadic decomposition and the tensor-train network to compress the attributes of each big data sample. To evaluate the performance of our algorithms, we conduct the experiments on two representative big data datasets, i.e., NUS-WIDE-14 and SNAE2, by comparison with the conventional high order possibilistic c-means algorithm in terms of attributes reduction, execution time, memory usage and clustering accuracy. Results imply that CP-HOPCM and TT-HOPCM are potential for big data clustering in IoT systems with low-end devices since they can achieve a high compression rate for heterogeneous samples to save the memory space significantly without a significant clustering accuracy drop. (C) 2017 Published by Elsevier B.V.

Pre One:Incomplete multi-view clustering via deep semantic mapping

Next One:Dependable Deep Computation Model for Feature Learning on Big Data in Cyber-Physical Systems