Peer-to-peer lending (P2P lending) has proliferated in recent years thanks to Fintech and big data advancements. However, P2P lending platforms are not tightly governed by relevant laws yet, as their development speed has far exceeded that of regulations. Therefore, P2P lending operations are still subject to risks. This paper proposes prediction models to mitigate the risks of default and asymmetric information on P2P lending platforms. Specifically, we designed sophisticated procedures to pre-process mass data extracted from Lending Club in 2018 Q3–2019 Q2. After that, three statistical models, namely, Logistic Regression, Bayesian Classifier, and Linear Discriminant Analysis (LDA), and five AI models, namely, Decision Tree, Random Forest, LightGBM, Artificial Neural Network (ANN), and Convolutional Neural Network (CNN), were utilized for data analysis. The loan statuses of Lending Club’s customers were rationally classified. To evaluate the models, we adopted the confusion matrix series of metrics, AUC-ROC curve, Kolmogorov–Smirnov chart (KS), and Student’s t-test. Empirical studies show that LightGBM produces the best performance and is 2.91% more accurate than the other models, resulting in a revenue improvement of nearly USD 24 million for Lending Club. Student’s t-test proves that the differences between models are statistically significant.
This paper introduces a multi-objective genetic algorithm (MOGA) in regard to the portfolio optimization issue in different risk measures, such as mean-variance, semi-variance, mean-variance-skewness, mean-absolutedeviation and lower-partial-moment to optimize risk of portfolio. This study introduces a PONSGA model by appling the non-dominated sorting genetic algorithm (NSGA-II) to maximize both the expected return and the skewness of portfolio as well as to simultaneously minimize different portfolio risks. The experimental results demonstrated that the PONSGA approach is significantly superior to the GA in all performances, examined such as the coefficient of variation, Sharpe index, Sortino index and portfolio performance index. The statistical significance tests also showed that the NSGA-II models outperformed the GA models in different risk measures.2000 Mathematics Subject Classification. 49 and 90.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.