Integrated data-driven credit default prediction in Uganda using machine learning models
Keywords:
SVM, XGBoost, Credit defaulter, Prediction model, Financial dataAbstract
The prediction of credit facility defaulters is quite a challenge in Uganda, particularly for those without a formal banking history. Existing prediction models cater for prediction using conventional banking records which is not sufficient. The use of integrated data to cater for the unbanked population is required to further enhance financial inclusivity and stability in Uganda’s financial landscape. This study therefore aims at filling this gap by using machine learning techniques on a rich blend of financial data, including mobile money and Fintech (Financial Technology) services, as well as traditional banking records. Several machine learning algorithms used for loan default prediction were compared, such as Random Forest, Logistic Regression, Support Vector Machines (SVM), and Extreme Gradient Boosting (XGBoost). Random Forest Model showed 96.66% accuracy, 79.65% recall, 96.52% precision and 0.85 AUC. XGBoost model was found to have an accuracy of 95.23%; recall, 73.32%; precision, 94.11%; and Area Under the Curve (AUC) of 0.81. However, Random Forest performed best by all metrics with XGBoost following slightly. Logistic Regression showed 89.53% accuracy but had a very low recall at 43.24% and precision at 66.59%. SVM performed averagely with 93.21% accuracy and 62.80% recall all falling below that of XGBoost and Random Forest. Thus, the study revealed the significance of machine learning models like Random Forest and XGBoost for credit scoring prediction. Overall, it will improve the ability of institutions and policymakers to identify potential default borrowers so as to mitigate loan default rates and ensure economic growth in underserved communities through more accurate and inclusive credit evaluation tools.
Published
How to Cite
Issue
Section
Copyright (c) 2025 George Muddu, Shefiu Olusegun Ganiyu, Adekunle Olugbenga Ejidokun, Yusuf Abass Aleshinloye (Author)

This work is licensed under a Creative Commons Attribution 4.0 International License.
How to Cite
Similar Articles
- Chinedu L. Udeze, Idongesit E. Eteng, Ayei E. Ibor, Application of Machine Learning and Resampling Techniques to Credit Card Fraud Detection , Journal of the Nigerian Society of Physical Sciences: Volume 4, Issue 3, August 2022
- Sherifdeen O. Bolarinwa, Eli Danladi, Andrew Ichoja, Muhammad Y. Onimisia, Christopher U. Achem, Synergistic Study of Reduced Graphene Oxide as Interfacial Buffer Layer in HTL-free Perovskite Solar Cells with Carbon Electrode , Journal of the Nigerian Society of Physical Sciences: Volume 4, Issue 3, August 2022
- Olumide S. Adesina, Adedayo F. Adedotuun, Kayode S. Adekeye, Ogbu F. Imaga, Adeleke J. Adeyiga, Toluwalase J. Akingbade, On logistic regression versus support vectors machine using vaccination dataset , Journal of the Nigerian Society of Physical Sciences: Volume 6, Issue 1, February 2024
- Raphael Ozighor Enihe, Rajesh Prasad, Francisca Nonyelum Ogwueleka, Fatimah Binta Abdullahi, The effect of imbalance data mitigation techniques on cardiovascular disease prediction , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 2, May 2025
- L. G. Salaudeen, D. GABI, M. Garba, H. U. Suru, Deep convolutional neural network based synthetic minority over sampling technique: a forfending model for fraudulent credit card transactions in financial institution , Journal of the Nigerian Society of Physical Sciences: Volume 6, Issue 2, May 2024
- Akila Dabara Kayit, Mohd Tahir Ismail, Novel way to predict stock movements using multiple models and comprehensive analysis: leveraging voting meta-ensemble techniques , Journal of the Nigerian Society of Physical Sciences: Volume 6, Issue 3, August 2024
- Umaru Hassan, Mohd Tahir Ismail, Improving forecasting accuracy using quantile regression neural network combined with unrestricted mixed data sampling , Journal of the Nigerian Society of Physical Sciences: Volume 5, Issue 4, November 2023
- Gabriel James, Anietie Ekong, Etimbuk Abraham, Enobong Oduobuk, Peace Okafor, Analysis of support vector machine and random forest models for predicting the scalability of a broadband network , Journal of the Nigerian Society of Physical Sciences: Volume 6, Issue 3, August 2024
- F. U. Salifu, O. A. Oladipo, E. O. Ebock, B. Nava, Deep neural network model for vertical total electron content prediction at a single low latitude station , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 4, November 2025
- O. J. Ibidoja, F. P. Shan, Mukhtar, J. Sulaiman, M. K. M. Ali, Robust M-estimators and Machine Learning Algorithms for Improving the Predictive Accuracy of Seaweed Contaminated Big Data , Journal of the Nigerian Society of Physical Sciences: Volume 5, Issue 1, February 2023
You may also start an advanced similarity search for this article.

