An Ensemble Classifier-Based Approach for Feature Selection and Prediction of Loan Approval
DOI:
https://doi.org/10.17010/ijf/2025/v19i4/174947Keywords:
loan approval
, machine learning, feature ranking, LightGBM, XGBoost, CatBoost, AdaBoostJEL Classification Codes
, A1, C45, G21, G24, G32, N2Paper Submission Date
, July 5, 2024, Paper sent back for Revision, February 25, 2025, Paper Acceptance Date, March 10, Paper Published Online, April 15, 2025Abstract
Purpose : The current research addressed the challenges confronted by banks and financial institutions in processing loan requests. With an increase in the number of loans being requested, the risk of non-payment also increases. Thus, an accurate and reliable method was needed that can evaluate the eligibility of the applicant and lessen limit losses. Therefore, this study aimed to implement machine learning (ML) methods to anticipate loan approvals and diagnose potential defaulters.
Methodology : An ensemble-based ML method that uses grid search-based optimized classifiers was proposed. Four different classifiers, namely LightGBM, XGBoost, CatBoost, and AdaBoost, were used in this study. At first, 12 features were utilized to train these models, and the models established their order importance. These features were arranged in descending order of importance, and feature sensitivity analysis was done to find the most optimal set of features for the particular classifier. The performance of the model was compared based on accuracy and F1 score.
Findings : The results depicted that all three ensemble classifiers except Adaboost achieved an accuracy of ~98% trained using only six best features. Furthermore, Catboost outperformed all other models in the prediction of loan approvals with an accuracy and F1 score of 98.68% and 92.632%, respectively. This reflected the importance of selecting optimal features and ensemble learning, which improves the model performance and mitigates the computational complexity.
Practical Implications : The findings recommended that utilizing ML algorithms can improve the loan approval method by reducing the risk of operating inefficiencies and defaults. They also highlighted the necessity of selecting the optimal features to make reliable ML models. The future scope of this research lies in exploring additional data sources, such as transactional and behavioral data, to improve the model’s effectiveness.
Originality : The traditional methods for loan approval rely on subjective assessments and manual judgments. This study provides a data-driven method that can be automated with an improved decision-making process. Furthermore, this study used feature ranking methodology and ensemble classifiers that contribute to different aspects of loan analysis, offering a scalable and efficient alternative to banks and financial institutions.
Downloads
Published
How to Cite
Issue
Section
References
Abdelmoula, A. K. (2015). Bank credit risk analysis with k-nearest-neighbor classifier: Case of Tunisian banks. Accounting and Management Information Systems, 14(1), 79–106. https://online-cig.ase.ro/jcig/art/14_1_4.pdf
Al Mamun, M., Farjana, A., & Mamun, M. (2022). Predicting bank loan eligibility using machine learning models and comparison analysis. In Proceedings of the 7th North American International Conference on Industrial Engineering and Operations Management. IEOM Society International. https://doi.org/10.46254/NA07.20220328
Arun, K., Ishan, G., & Sanmeet, K. (2016). Loan approval prediction based on machine learning approach. IOSR Journal of Computer Engineering, 18(3), 79–81. https://doi.org/10.9790/0661-1803017981
Beade, Ã., RodrÃguez, M., & Santos, J. (2024). Multiperiod bankruptcy prediction models with interpretable single models. Computational https://doi.org/10.1007/s10614-023-10479-z Economics, 64(3), 1357–1390.
Behera, A. P., Rautaray, S. S., Pandey, M., & Gourisaria, M. K. (2021). Predictive loan approval model using logistic regression. In S. C. Satapathy, V. Bhateja, M. N. Favorskaya, & T. Adilakshmi (eds.), Smart computing techniques and applications. Smart innovation, systems and technologies (Vol. 225, pp. 715–722). Springer. https://doi.org/10.1007/978-981-16-0878-0_69
Budhedeo, S. H., & Pandya, N. P. (2018). Financial performance of public sector banks in India: A post-reform analysis. Indian Journal of Finance, 12(10), 7–20. https://doi.org/10.17010/ijf/2018/v12i10/132491
Chandra, S., & Awasthi, R. (2019). Insolvency risk: Issues and challenges for public sector commercial banks in India. Indian Journal of Finance, 13(12), 19–33. https://doi.org/10.17010/ijf/2019/v13i12/149266
Chen, C. M., Tso, G. K., & He, K. (2024). Quantum-optimized cost-based feature selection and credit scoring for mobile micro-financing. Computational Economics, 63(2), 919–950. https://doi.org/10.1007/s10614-023-10365-8
Cozarenco, A., & Szafarz, A. (2018). Gender biases in bank lending: Lessons from microcredit in France. Journal of Business Ethics, 147, 631–650. https://doi.org/10.1007/s10551-015-2948-y
Gupta, P., & Jaiswal, K. K. (2020). Analysis of financial performance of selected public and private sector banks. Indian Journal https://doi.org/10.17010/ijf/2020/v14i1/149856 of Finance, 14(1), 45–57.
Hancock, J. T., & Khoshgoftaar, T. M. (2020). CatBoost for big data: An interdisciplinary review. Journal of Big Data, 7(1), Article no. 94. https://doi.org/10.1186/s40537-020-00369-8
Kaur, S., & Pasricha, J. S. (2019). Problems faced by bank employees in implementing financial inclusion schemes. Indian Journal https://doi.org/10.17010/ijf/2019/v13i12/149267 of Finance, 13(12), 34–49.
Kumar, J., & Singh, R. (2017). Fiscal decentralization and urban local finance in India: A case study of Haryana. Arthashastra Indian Journal of Economics & Research, 6(6), 38–50. https://doi.org/10.17010/aijer/2017/v6i6/120116
Kumari, S., Nayak, D. S., & Swarnkar, T. (2023). Loan eligibility prediction using machine learning: A comparative approach. Global Journal of Modeling and Intelligent Computing, 3(1), 48–54. https://www.researchgate.net/publication/372656643
Liu, Z., Du, G., Zhou, S., Lu, H., & Ji, H. (2022). Analysis of internet financial risks based on deep learning and BP neural network. Computational Economics, 59(4), 1481–1499. https://doi.org/10.1007/s10614-021-10229-z
Madane, N., & Nanda, S. (2019). Loan prediction analysis decision tree. Journal of the Gujrat Research Society, 21(14s). http://www.gujaratresearchsociety.in/index.php/JGRS/article/view/1351
Mohapatra, S., Sahoo, D., & Kesharwani, A. (2015). Outsourcing of information technology: An empirical study in the Indian banking industry. Indian Journal of Finance, 9(7), 7–21. https://doi.org/10.17010/ijf/2015/v9i7/72350
Nag, A. K., & Shah, J. (2022). An empirical study on the impact of Gen Z investors' financial literacy to invest in the Indian stock market. Indian Journal of Finance, 16(10), 43–59. https://doi.org/10.17010/ijf/2022/v16i10/172387
Nagaraju, T. (2014). An analysis of profitability and marketability efficiencies of Indian public and private banks. Indian Journal https://doi.org/10.17010/ijf/2014/v8i1/71981 banks of Finance, 8(1), 15–28.
Narayanaswamy, T., & Muthulakshmi, A. P. (2016). Productivity and cost efficiency of commercial in India. Indian https://doi.org/10.17010/ijf/2016/v10i1/85839 Journal of Finance, 10(1), 8–27.
Pandey, N., Gupta, R., Uniyal, S., & Kumar, V. (2021). Loan approval prediction using machine learning algorithms approach. International Journal of Innovative Research in Technology, 8(1), 898–902. https://ijirt.org/Article?manuscript=151769
Park, M. S., Son, H., Hyun, C., & Hwang, H. J. (2021). Explainability of machine learning models for bankruptcy prediction. https://doi.org/10.1109/ACCESS.2021.3110270 IEEE Access, 9, 124887–124899.
Prakash, J., & Kankar, P. K. (2020). Health prediction of hydraulic cooling circuits using a deep neural network with an ensemble feature ranking technique. Measurement, 151, Article ID 107225. https://doi.org/10.1016/j.measurement.2019.107225
Ranawat, N. S., & Chakraborty, A. (2023). The impact of third-party financial products on the consumer loan services market in the banking sector: An analysis of sales progress and consumer behavior. Asia-Pacific https://doi.org/10.1007/s10690-023-09419-6 Financial Markets, 31, 367–387.
Ranawat, N. S., Prakash, J., Miglani, A., & Kankar, P. K. (2023). Performance evaluation of LSTM and Bi-LSTM using non-convolutional features for blockage detection in a centrifugal pump. Engineering Applications of Artificial Intelligence, 122, Article ID 106092. https://doi.org/10.1016/j.engappai.2023.106092
Ranawat, N. S., Miglani, A., & Kankar, P. K. (2023). Blockage detection in centrifugal pump using butterfly optimization-based feature selection. Mapan - Journal of Metrology Society of India, 38(2), 499–509. https://doi.org/10.1007/s12647-022-00616-8
Rodney, A., & Mincey, K. (2020). Understanding the relationship between student loan debt and stress among female students at an HBCU. Journal of African American Studies, 24, 269–275. https://doi.org/10.1007/s12111-020-09468-5
Shahraki, A., Abbasi, M., & Haugen, Ø. (2020). Boosting algorithms for network intrusion detection: A comparative evaluation of Real AdaBoost, Gentle AdaBoost and Modest AdaBoost. Engineering Applications of Artificial Intelligence, 94, Article ID 103770. https://doi.org/10.1016/j.engappai.2020.103770
Shukla, S. (2016). Performance of the Indian banking industry: A comparison of public and private sector banks. Indian Journal https://doi.org/10.17010/ijf/2016/v10i1/85843 of Finance, 10(1), 41–55.
Supriya, P., Pavani, M., Saisushma, N., Kumari, N. V., & Vikas, K. (2019). Loan prediction by using machine learning models. International Journal of Engineering and Techniques, 5(2), 144–148. http://www.ijetjournal.org/volume5/issue2/IJET-V5I2P28.pdf
Tejaswini, J., Kavya, T. M., Ramya, R. D., Triveni, P. S., & Maddumala, V. R. (2020). Accurate loan approval prediction based on machine learning approach. Journal of Engineering Science, 11(4), 523–532. https://jespublication.com/upload/2020-110471.pdf
Viswanatha, V., Ramachandra, A. C., Vishwas, K. N., & Adithya, G. (2023). Prediction of loan approval in banks using machine learning approach. International Journal of Engineering and Management Research, 13(4), 7–19. https://doi.org/10.31033/ijemr.13.4.2
Wang, D., Hong, D., & Wu, Q. (2023). Prediction of loan rate for mortgage data: Deep learning versus robust regression. Computational Economics, 61(3), 1137–1150. https://doi.org/10.1007/s10614-022-10239-5
Wilson, F. (2016). Making loan decisions in banks: Straight from the gut? Journal of Business Ethics, 137, 53–63. https://doi.org/10.1007/s10551-014-2515-y
Wu, Z. (2022). Using machine learning approach to evaluate the excessive financialization risks of trading enterprises. Computational Economics, 59(4), 1607–1625. https://doi.org/10.1007/s10614-020-10090-6