![]() |
个人信息Personal Information
教授
博士生导师
硕士生导师
主要任职:软件学院(大连理工大学-立命馆大学国际信息与软件学院)党委书记
性别:女
毕业院校:吉林大学
学位:博士
所在单位:软件学院、国际信息与软件学院
学科:软件工程. 计算数学. 计算机应用技术
办公地点:大连理工大学开发区校区信息楼309室
联系方式:nalei@dlut.edu.cn
电子邮箱:nalei@dlut.edu.cn
基于众包学习的交互式特征选择方法
点击次数:
发表时间:2020-01-01
发表刊物:Scientia Sinica Informationis
卷号:50
期号:6
页面范围:794-812
ISSN号:1674-7267
关键字:"ensemble feature selection; learning-from-crowds; visual analysis; interactive visualization; ranking visualization"
CN号:11-5846/TP
摘要:Ensemble feature selection algorithms aggregate the results of multiple feature selection methods in order to select an effective subset of features. However, typically, ensemble algorithms treat each feature selection method equally and do not consider performance differences. Consequently, features selected by a relatively smaller number of methods may not be included. To address this problem, we propose an interactive feature selection method that can more effectively aggregate the results of multiple feature selection methods and iteratively improve the selected features by integrating expert knowledge. The proposed method includes a learning-from-crowds-based ensemble feature selection algorithm and a visual analysis system. The algorithm models the performance of multiple feature selection methods, calculates their reliabilities, and aggregates results. To integrate expert knowledge, the visual analysis system provides a set of ranking schemes to assist experts in understanding the results of an individual feature selection method and the roles played by the features in classification tasks. A numerical experiment conducted on four real-world datasets shows that the proposed algorithm can improve classification accuracy by 0.63%-2.85% compared to state-of-the-art ensemble feature selection algorithms. In addition, we conducted case studies on text and image data to demonstrate that the proposed visual analysis system can further improve classification accuracy by 0.28%-5.24%.
备注:新增回溯数据