林晓惠

个人信息Personal Information

教授

博士生导师

硕士生导师

性别:女

毕业院校:大连理工大学

学位:博士

所在单位:计算机科学与技术学院

电子邮箱:datas@dlut.edu.cn

扫描关注

论文成果

当前位置: 算法设计与分析 >> 科学研究 >> 论文成果

A new feature selection algorithm based on relevance, redundancy and complementarity

点击次数:

论文类型:期刊论文

发表时间:2020-04-01

发表刊物:COMPUTERS IN BIOLOGY AND MEDICINE

收录刊物:EI、SCIE

卷号:119

ISSN号:0010-4825

关键字:Biological data analysis; Feature selection; Feature relevance; Feature redundancy; Feature complementarity

摘要:Defining important information from biological data is critical for the study of disease diagnosis, drug efficacy and individualized treatment. Hence, the feature selection technique is widely applied. Many feature selection methods measure features based on relevance, redundancy and complementarity. Feature complementarity means that two features' cooperation can provide more information than the simple summation of their individual information. In this paper, we studied the feature selection technique and proposed a new feature selection algorithm based on relevance, redundancy and complementarity (FS-RRC). On selecting the feature subset, FS-RRC not only evaluates the feature relevance with the class label and the redundancy among the features but also evaluates the feature complementarity. If complementary features exist for a selected relevant feature, FS-RRC retains the feature with the largest complementarity to the selected feature subset. To show the performance of FS-RRC, it was compared with eleven efficient feature selection methods, MIFS, mRMR, CMIM, ReliefF, FCBF, PGVNS, MCRMCR, MCRMICR, RCDFS, SAFE and SVM-RFE on two synthetic datasets and fifteen public biological datasets. The experimental results showed the superiority of FS-RRC in accuracy, sensitivity, specificity, stability and time complexity. Hence, integrating feature individual discriminative ability, redundancy and complementarity can define more powerful feature subset for biological data analysis, and feature complementarity can help to study the biomedical phenomena more accurately.