location: Current position: Home >> Scientific Research >> Paper Publications

A Hierarchical Missing Value Imputation Method by Correlation-Based K-Nearest Neighbors

Hits:

Indexed by:会议论文

Date of Publication:2020-01-01

Included Journals:EI

Volume:1037

Page Number:486-496

Key Words:Missing value imputation; K-nearest neighbors; Correlation analysis; Incomplete record division

Abstract:Missing value is a common occurrence in the real-world dataset, and many methods have been proposed to solve it. Among those methods, KNN imputation attracts a lot of attention due to the simple realization, easy understanding, and relatively high accuracy. However, it ignores the influence of correlations between attributes on the similarity of records. In this paper, we take the correlations into consideration when selecting the nearest neighbors, and impute the incomplete records successively according to the number of missing values in each record. During the imputation, the correlation coefficients are calculated by the complete records and updated with the union of complete records and imputed records. Therefore, the correlations between attributes are more accurate with the improvement of data utilization, which makes the selected nearest neighbors more appropriate. Experimental results demonstrate that the improved method is more effective in missing value imputation.

Pre One:Imputations of missing values using a tracking-removed autoencoder trained with incomplete data

Next One:理工科课程思政建设方法研究与实践