孟军

个人信息Personal Information

教授

博士生导师

硕士生导师

性别:女

毕业院校:大连理工大学

学位:博士

所在单位:计算机科学与技术学院

学科:计算机应用技术. 计算机软件与理论

扫描关注

论文成果

当前位置: 中文主页 >> 科学研究 >> 论文成果

Data editing based self-training algorithm

点击次数:

论文类型:期刊论文

发表时间:2009-06-01

发表刊物:Journal of Computational Information Systems

收录刊物:EI、Scopus

卷号:5

期号:3

页面范围:1373-1378

ISSN号:15539105

摘要:Self-Training algorithm is a semi-supervised classification algorithm which through repeated training with the labeled data to get a enlarged labeled data set and improve the classification accuracy meanwhile. Since the initial labeled data set in Self-Training algorithm may be small, a considerable number of data are mislabeled in the training process is unavoidable. A nearest neighbor rule based data editing technique is introduced, which extends traditional self-training algorithm by new methods of identifying and removing the mislabeled data, so that it can reduce the mislabeled data and improve the classification accuracy. The data sets used in experiments are all from the UCI machine repository. The classification effect is improved in different levels through contrast. The experimental results show that the introduction of the data editing technique is beneficial for improving the classification effect of Self-Training. 1553-9105/ Copyright ? 2009 Binary Information Press.