location: Current position: Home >> Scientific Research >> Paper Publications

Ensemble Unsupervised Feature Selection Based on Permutation and R-value

Hits:

Indexed by:会议论文

Date of Publication:2015-01-01

Included Journals:CPCI-S

Page Number:795-800

Key Words:unsupervised feature selection; ensemble technique; clustering; permutation

Abstract:Selecting the informative features from the high dimensional data can improve the performance of the classification and get a deep understanding of the problems. A non-problem related feature contains little information and has little influence on the data distribution. By permuting the feature and calculating the data distribution difference, how much information the feature contains could be measured. In this paper, we propose an unsupervised feature selection method (EUFSPR), which combines the ensemble technique, clustering, permutation and data distribution evaluation techniques to measure the feature importance. Clustering is adopted to get the sample groups and the data distribution is evaluated by the overlapping areas. Eight gene expression microarray datasets are utilized to demonstrate the effectiveness of the proposed method over the unsupervised feature selection methods and supervised feature selection methods.

Pre One:A weighted relative difference accumulation algorithm for dynamic metabolomics data: long-term elevated bile acids are risk factors for hepatocellular carcinoma

Next One:Chaos and bifurcations in chaotic maps with parameter q: Numerical and analytical studies