Current position: Home >> Scientific Research >> Paper Publications

BRDPHHC: A Balance RDF Data Partitioning Algorithm based on Hybrid Hierarchical Clustering

Release Time:2019-03-11  Hits:

Indexed by: Conference Paper

Date of Publication: 2015-08-24

Included Journals: Scopus、CPCI-S、EI

Page Number: 1755-1760

Key Words: RDF; data partitioning; hybrid hierarchical clustering

Abstract: Data partitioning is a fundamental step to achieve effective storage and query of RDF big data. This paper presents a balance RDF data partitioning algorithm based on hybrid hierarchical clustering (BRDPHHC), which combines AP and K-means clustering. BRDPHHC's functionality includes three aspects: (i) a pre-processing step combining nodes compression and nodes remove to reduce the scale of raw data points, (ii) AP clustering algorithm is used to coarsen the RDF graph step by step and produce data blocks, and (iii) K-means algorithm is used for data partitioning finally. Experiments on benchmark datasets demonstrate the effectiveness of the proposed scheme.

Prev One:A Hybrid Method for Incomplete Data Imputation

Next One:A Trust-based User Assignment Scheme in Ad hoc Social Networks