location: Current position: Home >> Scientific Research >> Paper Publications

Distributed Feature Selection for Efficient Economic Big Data Analysis

Hits:

Indexed by:Journal Papers

Date of Publication:2018-04-01

Journal:IEEE TRANSACTIONS ON BIG DATA

Included Journals:EI

Volume:4

Issue:2

Page Number:164-176

ISSN No.:2332-7790

Key Words:Feature selection; big data; subtractive clustering; collaborative theory; economy; urbanization

Abstract:With the rapidly increasing popularity of economic activities, a large amount of economic data is being collected. Although such data offers super opportunities for economic analysis, its low-quality, high-dimensionality and huge-volume pose great challenges on efficient analysis of economic big data. The existing methods have primarily analyzed economic data from the perspective of econometrics, which involves limited indicators and demands prior knowledge of economists. When embracing large varieties of economic factors, these methods tend to yield unsatisfactory performance. To address the challenges, this paper presents a new framework for efficient analysis of high-dimensional economic big data based on innovative distributed feature selection. Specifically, the framework combines the methods of economic feature selection and econometric model construction to reveal the hidden patterns for economic development. The functionality rests on three pillars: (i) novel data pre-processing techniques to prepare high-quality economic data, (ii) an innovative distributed feature identification solution to locate important and representative economic indicators from multidimensional data sets, and (iii) new econometric models to capture the hidden patterns for economic development. The experimental results on the economic data collected in Dalian, China, demonstrate that our proposed framework and methods have superior performance in analyzing enormous economic data.

Pre One:Local Similarity Imputation Based on Fast Clustering for Incomplete Data in Cyber-Physical Systems

Next One:Incomplete multi-view clustering via deep semantic mapping