Current position: Home >> Scientific Research >> Paper Publications

Data editing based self-training algorithm

Release Time:2019-03-11  Hits:

Indexed by: Journal Article

Date of Publication: 2009-06-01

Journal: Journal of Computational Information Systems

Included Journals: Scopus、EI

Volume: 5

Issue: 3

Page Number: 1373-1378

ISSN: 15539105

Abstract: Self-Training algorithm is a semi-supervised classification algorithm which through repeated training with the labeled data to get a enlarged labeled data set and improve the classification accuracy meanwhile. Since the initial labeled data set in Self-Training algorithm may be small, a considerable number of data are mislabeled in the training process is unavoidable. A nearest neighbor rule based data editing technique is introduced, which extends traditional self-training algorithm by new methods of identifying and removing the mislabeled data, so that it can reduce the mislabeled data and improve the classification accuracy. The data sets used in experiments are all from the UCI machine repository. The classification effect is improved in different levels through contrast. The experimental results show that the introduction of the data editing technique is beneficial for improving the classification effect of Self-Training. 1553-9105/ Copyright ? 2009 Binary Information Press.

Prev One:Real-time collision detection and response techniques for deformable objects based on hybrid bounding volume hierarchy

Next One:计算机专业课程研究型教学模式的探讨