杨光飞

个人信息Personal Information

教授

博士生导师

硕士生导师

性别:男

毕业院校:早稻田大学

学位:博士

所在单位:系统工程研究所

学科:管理科学与工程

联系方式:邮件:gfyang@dlut.edu.cn 电话:0411-84707917

电子邮箱:gfyang@dlut.edu.cn

扫描关注

论文成果

当前位置: 杨光飞 >> 科学研究 >> 论文成果

A Study of Interestingness Measures for Associative Classification on Imbalanced Data

点击次数:

论文类型:会议论文

发表时间:2015-05-19

收录刊物:EI、CPCI-S、Scopus

卷号:9441

页面范围:141-151

关键字:Associative classification; Imbalanced data; Interestingness measure; Stable clusters; Ranking computation

摘要:Associative Classification (AC) is a well known tool in knowledge discovery and it has been proved to extract competitive classifiers. However, imbalanced data has posed a challenge for most classifier learning algorithms including AC methods. Because in the AC process, Interestingness Measure (IM) plays an important role to generate interesting rules and build good classifiers, it is very important to select IMs for improving AC's performance in the context of imbalanced data. In this paper, we aim at improving AC's performance on imbalanced data through studying IMs. To achieve this, there are two main tasks to be settled. The first one is to find which measures have similar behaviors on imbalanced data. The second is to select appropriate measures. We evaluate each measure's performance by AUC which is usually used for evaluation of imbalanced data classification. Firstly, based on the performances, we propose a frequent correlated patterns mining method to extract stable clusters in which the IMs have similar behaviors. Secondly, we find 26 proper measures for imbalanced data after the IM ranking computation method and divide them into two groups with one especially for extremely imbalanced data and the other suitable for slightly imbalanced data.