金博

个人信息Personal Information

教授

博士生导师

硕士生导师

性别:男

毕业院校:大连理工大学

学位:博士

所在单位:创新创业学院

学科:计算机应用技术

办公地点:创客空间607

电子邮箱:jinbo@dlut.edu.cn

扫描关注

论文成果

当前位置: 金博 >> 科学研究 >> 论文成果

Efficient Methods for Multi-label Classification

点击次数:

论文类型:会议论文

发表时间:2015-05-19

收录刊物:EI、CPCI-S、Scopus

卷号:9077

页面范围:164-175

关键字:Classification; Clustering; Dimension reduction

摘要:As a generalized form of multi-class classification, multilabel classification allows each sample to be associated with multiple labels. This task becomes challenging when the number of labels bulks up, which demands a high efficiency. Many approaches have been proposed to address this problem, among which one of the main ideas is to select a subset of labels which can approximately span the original label space, and training is performed only on the selected set of labels. However, these proposed sampling algorithms either require nondeterministic number of sampling trials or are time consuming. In this paper, we propose two label selection methods for multi-label classification (i) clustering based sampling (CBS) that uses deterministic number of sampling trials; and (ii) frequency based sampling (FBS) utilizing only label frequency statistics which makes it more efficient. Moreover, neither of these two algorithms needs to perform singular value decomposition (SVD) on label matrix which is used in previously mentioned approaches. Experiments are performed on several real world multi-label data sets with the number of labels ranging from hundreds to thousands, and it is shown that the proposed approaches achieve the state-of-the-art performance among label space reduction based multi-label classification algorithms.