location: Current position: Home >> Scientific Research >> Paper Publications

Data imputation for gas flow data in steel industry based on non-equal-length granules correlation coefficient

Hits:

Indexed by:期刊论文

Date of Publication:2016-11-01

Journal:INFORMATION SCIENCES

Included Journals:SCIE、EI、Scopus

Volume:367

Page Number:311-323

ISSN No.:0020-0255

Key Words:Byproduct gas of steel industry; Data imputation; Non-equal-length granules correlation coefficient; Estimation of distribution algorithm

Abstract:In the field of data-driven based modeling and optimization, the completeness and the accuracy of data samples are the foundations for further research tasks. Since the byproduct gas system of steel industry is rather complicated and its data-acquisition process might be frequently affected by the unexpected operational factors, the data-missing phenomenon usually occurs, which might lead to the failure of model establishment or inaccurate information discovery. In this study, a data imputation method based on the manufacturing characteristics is proposed for resolving the data-missing problem in steel industry. A novel correlation analysis, named by non-equal-length granules correlation coefficient (NGCC), is reported, and the corresponding model based on Estimation of Distribution Algorithm (EDA) is established to study the correlation of the similar procedures. To verify the performance of the proposed method, this study considers three typical features of the gas flow data with different missing ratios. The experiment results indicate that it is greatly effective for the missing data imputation of byproduct gas, and exhibits better performance on the accuracy compared to the other methods. (C) 2016 Elsevier Inc. All rights reserved.

Pre One:基于输入延迟支持向量机的氮气管网压力预测

Next One:Prediction intervals for industrial data with incomplete input using kernel-based dynamic Bayesian networks