location: Current position: Home >> Scientific Research >> Paper Publications

Multi-view ensemble learning based on distance-to-model and adaptive clustering for imbalanced credit risk assessment in P2P lending

Hits:

Indexed by:Journal Papers

Date of Publication:2020-07-01

Journal:INFORMATION SCIENCES

Included Journals:SCIE、SSCI

Volume:525

Page Number:182-204

ISSN No.:0020-0255

Key Words:Credit risk assessment; Peer-to-peer lending; Multi-view ensemble learning; Adaptive clustering; Distance-to-model

Abstract:Credit risk assessment is a crucial task in the peer-to-peer (P2P) lending industry. In recent years, ensemble learning methods have been verified to perform better in default prediction than individual classifiers and statistical techniques. Real-world loan datasets are imbalanced; however, most studies focus on enhancing overall prediction accuracy rather than improving the identification ability of real default loans. Moreover, some of the features that are significantly correlated with default rates are not attached importance in the model construction of previous studies. To fill these gaps, we propose a distance-to-model and adaptive clustering-based multi-view ensemble (DM-ACME) learning method for predicting default risk in P2P lending. In this method, multi-view learning and an adaptive clustering method are explored to produce an ensemble of diverse ensembles constituted by gradient boosting decision trees. A novel combination strategy called distance-to-model and a soft probability fashion are embedded for model integration. To verify the effectiveness of the proposed ensemble approach, comprehensive analysis on DM-ACME, comparative experiments with several state-of-the-art methods, and feature importance evaluation are conducted with the data provided by Lending Club. Experimental results demonstrate the superiority of the proposed method as well as indicate the importance of some features in loan default prediction. (C) 2020 Elsevier Inc. All rights reserved.

Pre One:Crowd counting considering network flow constraints in videos

Next One:Simulation of Individual Knowledge System and Its Application