Evaluating Trade-offs Between Error Rates in Machine Learning Credit Scoring Models

Zipporah Chepkemoi; Lilian Wanzare D.; Sylvester Mcoyowo

doi:10.32628/CSEIT25111678

Authors

Zipporah Chepkemoi Department of Computer Science, Maseno University/Kisumu, Kenya Author
Lilian Wanzare D. Department of Computer Science, Maseno University/Kisumu, Kenya Author
Sylvester Mcoyowo Department of Computer Science, Maseno University/Kisumu, Kenya Author

DOI:

https://doi.org/10.32628/CSEIT25111678

Keywords:

Credit Scoring Models, Machine learning, Error rate tuning, financial decision making

Abstract

Several studies have explored the application of machine learning in credit scoring, however there is limited research focusing on the implication of the trade-offs between the false positive and false negative rates in these models. Trade-off occurs when one error is prioritized over the other and have an impact on both the lenders and borrowers. An increased false positive rate will misclassify more potential customers as high-risk, causing financial losses for lenders and negatively impacting credit applicants, while an increased false negative rate leads to missed opportunities for approving creditworthy applicants. This study assesses the trade-offs in the following machine learning models for credit scoring: logistic regression, multilayer perceptron, support vector machine and random forest trained on the German Credit and the Kenyan Uwezo Fund datasets and further tests a number of methods to control it so as to arrive at the trade-off that works best for the intended users. The outcome of the study showed that assigning equal cost to both errors, balancing class distributions through resampling techniques and adjusting thresholds affected the trade-offs between false positive rates and false negative rates in the models therefore managing the trade-offs towards an optimal point. The results of the study highlights the potential impacts on fairness and decision-making in credit scoring.

Downloads

Download data is not yet available.

References

G. Rebecca, “How Does Credit Scoring Work?,” https://staging.comparebanks.co.uk/guides/how-does-credit-scoring-work/, Jul. 30, 2024.

A. Hakberdiev, “ABOUT THE CREDIT MODEL SYSTEM.,” Multidiscip. Multidimens. J., vol. 3, no. 4, pp. 150–152, Apr. 2024.

O. Bello, “Machine learning algorithms for credit risk assessment: an economic and financial analysis.,” Int. J. Manag., vol. 10, no. 1, pp. 109–133, 2023.

U. Aslam, H. Tariq Aziz, A. Sohail, and N. Batcha, “An empirical study on loan default prediction models,” J. Comput. Theor. Nanosci., vol. 16, no. 8, pp. 3483-3488., Aug. 2019. DOI: https://doi.org/10.1166/jctn.2019.8312

Z. Al-Slehat, S. Almanaseer, B. Al Sharif , Y. Al-Haraisa, S. Aloshaibat, and M. Almahasneh, “Creditworthiness Criteria According to the 5Cs Model and Credit Decision: The Moderating Role of Intellectual Capital.,” Int. Rev. Manag. Mark., vol. 14, no. 6, pp. 274–287, Oct. 2024. DOI: https://doi.org/10.32479/irmm.17257

T. Sarkar, M. Rakhra, V. Sharma, and A. Singh, “An Empirical Comparison of Machine Learning Techniques for Bank Loan Approval Prediction,” in In 2024 International Conference on Communication, Computer Sciences and Engineering (IC3SE), May 2024, pp. 137–143. DOI: https://doi.org/10.1109/IC3SE62002.2024.10593355

S. Ronad, “Automated Decision-Making in Green Banking and Sustainable Credit Scoring”.

R. Bhandary and B. K. Ghosh, “Credit Card Default Prediction: An Empirical Analysis on Predictive Performance Using Statistical and Machine Learning Methods,” J. Risk Financ. Manag., vol. 18, no. 1, p. 23, 2025. DOI: https://doi.org/10.3390/jrfm18010023

Y. E. Gür, M. Toğaçar, and B. Solak, “Integration of CNN Models and Machine Learning Methods in Credit Score Classification: 2D Image Transformation and Feature Extraction,” Comput. Econ., pp. 1–45, 2025. DOI: https://doi.org/10.1007/s10614-025-10893-5

S. Gupta, “Advanced Credit Scoring Models Using Dremio And Google Cloud ML: Developing Machine Learning Algorithms That Incorporate Alternative Data Sources To Enhance Credit Scoring Accuracy”.

Y. Chen, R. Calabrese, and B. Martin-Barragan, “Interpretable machine learning for imbalanced credit scoring datasets,” Eur. J. Oper. Res., vol. 312, no. 1, pp. 357–372, 2024. DOI: https://doi.org/10.1016/j.ejor.2023.06.036

E. MARUTI, “Effect of microfinance practices on credit accessibility by consumer based small scale business in Kisumu city,” DissMaseno Univ., 2022.

S. Ronad, “Automated Decision-Making in Green Banking and Sustainable Credit Scoring”.

X. Li and Y. Zhong, “An overview of personal credit scoring: techniques and future work.,” 2012. DOI: https://doi.org/10.4236/ijis.2012.224024

E. Oye, A. Mattews, P. Peace, and M. Andrews, “Real-Time Credit Risk Monitoring with AI-Generated Insights,” 2025.

A. Bhattacharya, S. K. Biswas, A. Mandal, and A. K. Das, “Ensembling of Performance Metrics in Credit Risk Assessment Using Machine Learning Analytics,” presented at the International Conference on Computing and Machine Learning, Springer, 2024, pp. 135–155. DOI: https://doi.org/10.1007/978-981-97-6588-1_11

Y. Fang, “Research on machine learning in credit risk management: current developments and future trends,” presented at the International Conference on Artificial Intelligence and Machine Learning Research (CAIMLR 2024), SPIE, 2025, pp. 264–272. DOI: https://doi.org/10.1117/12.3058170

S. Andrae, “Fairness and bias in machine learning models for credit decisions,” in Machine learning and modeling techniques in financial data science, IGI Global Scientific Publishing, 2025, pp. 1–24. DOI: https://doi.org/10.4018/979-8-3693-8186-1.ch001

M. Vasconcelos and L. Cavique, “Mitigating false negatives in imbalanced datasets: An ensemble approach,” Expert Syst. Appl., vol. 262, p. 125674, 2025. DOI: https://doi.org/10.1016/j.eswa.2024.125674

Y. Wang and J. L. Priestley, “Binary classification on past due of service accounts using logistic regression and decision tree,” 2017.

M. Ala’raj and M. Abbod, “A systematic credit scoring model based on heterogeneous classifier ensembles,” presented at the 2015 international symposium on innovations in intelligent systems and applications (INISTA), IEEE, 2015, pp. 1–7. DOI: https://doi.org/10.1109/INISTA.2015.7276736

M. Carvalho, A. J. Pinho, and S. Brás, “Resampling approaches to handle class imbalance: a review from a data perspective,” J. Big Data, vol. 12, no. 1, p. 71, 2025. DOI: https://doi.org/10.1186/s40537-025-01119-4

B. Yousefimehr et al., “Data Balancing Strategies: A Survey of Resampling and Augmentation Methods,” ArXiv Prepr. ArXiv250513518, 2025.

V. Meursault, D. Moulton, L. Santucci, and N. Schor, “One threshold doesn’t fit all: Tailoring machine learning predictions of consumer default for lower‐income areas,” J. Policy Anal. Manage., vol. 44, no. 3, pp. 792–815, 2025. DOI: https://doi.org/10.1002/pam.22662

A. Markov, Z. Seleznyova, and V. Lapshin, “Credit scoring methods: Latest trends and points to consider,” J. Finance Data Sci., vol. 8, pp. 180–201, 2022. DOI: https://doi.org/10.1016/j.jfds.2022.07.002

J. N. Semendawai, D. Stiawan, and I. Pahendra, “Shellcode Classification with Machine Learning Based on Binary Classification.,” J. Indones. Sos. Teknol., vol. 6, no. 2, 2025. DOI: https://doi.org/10.59141/jist.v6i2.3233

Evaluating Trade-offs Between Error Rates in Machine Learning Credit Scoring Models

Authors

DOI:

Keywords:

Abstract

Downloads

References

Downloads

Published

Issue

Section

License

How to Cite

IssueDate

RightSideBlock

Latest publications