林晓惠

个人信息Personal Information

教授

博士生导师

硕士生导师

性别:女

毕业院校:大连理工大学

学位:博士

所在单位:计算机科学与技术学院

电子邮箱:datas@dlut.edu.cn

扫描关注

论文成果

当前位置: 算法设计与分析 >> 科学研究 >> 论文成果

A new feature selection method based on symmetrical uncertainty and interaction gain

点击次数:

论文类型:期刊论文

发表时间:2019-12-01

发表刊物:COMPUTATIONAL BIOLOGY AND CHEMISTRY

收录刊物:PubMed、EI、SCIE

卷号:83

页面范围:107149

ISSN号:1476-9271

关键字:Feature selection; Biological data analysis; Interaction gain

摘要:Defining important information from complex biological data is of great significance in biological study. It is known that the physiological and pathological changes in an organism are usually influenced by molecule interactions. Analyzing biological data by fusing the evaluation of the individual molecules and molecule interactions could induce a more accurate and comprehensive understanding of the organism. This study proposes an Interaction Gain - Recursive Feature Elimination (IG-RFE) method which evaluates the feature importance by combining the relevance between feature and class label and the interaction among features. Symmetrical uncertainty is adopted to measure the relevance between feature and the class label. The average normalized interaction gain of feature f, every other features and the class label is calculated to reflect the interaction of feature f with other features in the feature set F. Based on the combination of symmetrical uncertainty and normalized interaction gain, less important features are removed iteratively. To show the performance of IG-RFE, it was compared with seven efficient feature selection methods, MIFS, mRMR, CMIM, ReliefF, FCBF, PGVNS and SVM-RFE, on eleven public datasets. The experiment results showed the superiority of IG-RFE in accuracy, sensitivity, specificity and stability. Hence, integrating feature individual discriminative ability and the interaction among features could better evaluate feature importance in biological data analysis.