location: Current position: Home >> Scientific Research >> Paper Publications

Decision tree for credit scoring and discovery of significant features: an empirical analysis based on Chinese microfinance for farmers

Hits:

Indexed by:期刊论文

Date of Publication:2018-01-01

Journal:FILOMAT

Volume:32

Issue:5

Page Number:1513-1521

ISSN No.:0354-5180

Key Words:Decision trees; C5.0; CHAID; Data mining; Classification; Credit scoring

Abstract:For the tens of thousands of farmers' loan financing, it's imperative to find which features are the key indicators affecting the credit scoring of rural households. In this paper, C5.0, CHAID and C&RT three models are used to screen the key indicators affecting farmers' credit scoring, and 2044 farmers' microfinance data from 28 provinces in China are applied in the empirical study. The empirical results show the classification accuracy of C5.0 is better than CHAID and C&RT in both the training set and test set, thus finally use the feature subset selected by C5.0. Six key features screened from 44 attributes by C5.0, which have significant influence on credit scoring of farmers, namely, education level, net income each year/per capita GDP, education cost of children each year, Residence type, residential year, relationship with cosigners.

Pre One:Credit default prediction using a support vector machine and a probabilistic neural network

Next One:基于幂风险谱和蒙特卡洛模拟的贷款优化配置模型