Data Analysis of Loan Defaulters Prediction Model Using Machine Learning Techniques
DOI:
https://doi.org/10.17010/ijcs/2024/v9/i4/174567Keywords:
AOC_ROC
, Machine Learning, Precision, Recall, SMOTE, Prediction model, Recommendation.Paper Submission Date
, July 5, 2024, Paper sent back for Revision, July 15, Paper Acceptance Date, July 17, Paper Published Online, August 5, 2024.Abstract
In recent times, the defaulters of loans have become a serious concern for the banking sector. To overcome this problem, the proposed prediction model has been recommended to predict the defaulters using different Machine Learning algorithms. This paper builds a loan defaulters prediction model based on real-world user loan data. The SMOTE method is adopted to cope with the problem of imbalance class in the dataset, and then a series of Machine Learning algorithms is used to find the best model; as a result, the Naive Bayes model performed better compared to other models based on the output of Recall, Precision, and AOC_ROC accuracy.
Downloads
Published
How to Cite
Issue
Section
References
A. A. A. Albastaki, “Loan Default Prediction System,†Thesis, Rochester Inst. Technol., 2022. [Online]. Available: https://repository.rit.edu/cgi/viewcontent.cgi?article=12544&context=theses
J. Chen, “Default: What it means, what happens when you default, and examples,†investopedia.com. [Online]. Available: https://www.investopedia.com/terms/d/default2.asp
J. Lin, “Research on loan default prediction based on logistic regression, randomforest, xgboost and adaboost,†SHS Web Conferences, vol. 181, Art. no. 02008, Jan. 2024, doi: 10.1051/shsconf/202418102008.
M. Alsaleem and S. Hasoon, “Predicting bank loan risks using machine learning algorithms,†AL-Rafidain, J. Comp. Sciences Math., vol. 14, no. 1, pp. 159–168, Jun. 2020. doi: 10.33899/csmj.2020.164686.
A. Kacheria, N. Shivakumar, S. Sawkar, and A. Gupta, “Loan sanctioning prediction system,†Int. J. Soft Comput. Eng., vol. 6, no. 4, pp. 50–53, Sep. 2016.
A. Uwais, “Loan default prediction using Spark Machine Learning algorithms,†2021. [Online]. Available: https://ceur-ws.org/Vol-3105/paper30.pdf
L. Zhu, L., D. Qiu, D. Ergu, C. Ying, and K. Liu, “A study on predicting loan default based on the random forest algorithm,†Procedia Comp. Sci., vol. 162, Jan. 2019. [Online[. Available: https://www.researchgate.net/publication/338286615_A_study_on_predicting_loan_default_based_on_the_random_forest_algorithm
C. N. Kumar, D. Keerthana, M. Kavitha, and M. Kalyani, "Customer loan eligibility prediction using Machine Learning algorithms in banking sector," 2022 7th Int. Conf. Communication Electronics Syst., Coimbatore, India, 2022, pp. 1007–1012, doi: 10.1109/ICCES54183.2022.9835725.
V. Singh, A. Yadav, R. Awasthi, and N. Partheeban, "Prediction of modernized loan approval system based on Machine Learning approach," 2021 Int. Conf. Intell. Technologies, Hubli, India, 2021, pp. 1–4, doi: 10.1109/CONIT51480.2021.9498475.
P. N. Hegde, Deepa, C. Shetty, Rashmi N., Dhananjaya B., and Prathvakshini, "Predictive analysis of loan data using Machine Learning," 2022 Int. Conf. Artif. Intell. Data Eng., Karkala, India, 2022, pp. 272–276, doi: 10.1109/AIDE57180.2022.10060781.
“What is Binning?,†2024. [Online]. Available: https://docs.tibco.com/pub/spotfire/7.0.1/doc/html/bin/bin_what_is_binning.htm
A. Sethi, “One hot encoding vs. label encoding in Machine Learning,†2024. [Online]. Available: https://www.analyticsvidhya.com/blog/2020/03/one-hot-encoding-vs-label-encoding-using-scikit-learn/
P. Tumuluru, L. R. Burra, M. Loukya, S. Bhavana, H. M. H. CSaiBaba, and N. Sunanda, "Comparative analysis of customer loan approval prediction using Machine Learning algorithms," 2022 2nd Int. Conf. Artif. Intell. Smart Energy, Coimbatore, India, pp. 349–353, 2022, doi: 10.1109/ICAIS53314.2022.9742800.
L. Victor and M. Raheem, “Loan default prediction using Genetic Algorithm: A study within peer-to-peer lending communities,†Int. J. Innovative Sci. Res. Technol., vol. 6, no. 3, pp. 1195–1205, 2021. [Online]. Available: https://ijisrt.com/assets/upload/files/IJISRT21MAR687.pdf
V. Kotu and B. Deshpande, Data Sci. Concepts Pract. 2nd ed., 2019. Cambridge: Morgan Kaufmann.
N. Darapaneni, A. Kumar, A. Dixet, M. Suriyanarayanan, S. Srivastava, and A. R. Paduri, "Loan prediction software for financial institutions," 2022 Interdisciplinary Res. Technol. Manage., Kolkata, India, pp. 1–8, doi: 10.1109/IRTM54583.2022.9791797.
D. Nautiyal, “ML|Underfitting and Overfitting.†[Online] Available: https://www.geeksforgeeks.org/underfitting-and-overfitting-in-machine-learning/
J. Korstanje, “Towards Data Science.†[Online] Available: https://towardsdatascience.com/smotefdce2f605729#:~:text=SMOTE%20is%20a%20machine%20learning,with%20this%20type%20o f%20data
L. Zhu, D. Qiu, D. Ergu, C. Ying, and K. Liu, “A study on predicting loan default based on the random forest algorithm,†Procedia Comp. Sci., vol. 162, pp. 503–513, 2019, doi: 10.1016/j.procs.2019.12.017.