Addressing class imbalance in lassa fever epidemic data, using machine learning: a case study with SMOTE and random forest
Keywords:
Lassa fever, Machine learning, SMOTE, Random forest, Class imbalanceAbstract
Class imbalance in epidemiological datasets, particularly for rare outcomes like Lassa Fever fatalities, complicates predictive modeling. This study addresses the issue by employing SMOTE to rebalance the dataset and Random Forest for classification while identifying significant predictors such as age, symptom severity, and residence. SMOTE successfully balanced the dataset (minority class recall improved from 0.60 to 1.00 in Random Forest), mitigating the bias toward majority classes. Without SMOTE, models including Random Forest, XGBoost, and LightGBM achieved high accuracy (> 99%) but demonstrated poor minority recall (?0.75), confirming the challenge of imbalanced data. Post-SMOTE balancing, these models achieved 100% accuracy, precision, recall, and F1-scores across major classes. Notably, the hybrid ensemble model further enhanced outcomes, achieving an F1-score of 0.80 for the rarest class. These results underscore the superiority of SMOTE in improving classification for underrepresented outcomes compared to reliance on Random Forest alone, demonstrating its value in developing equitable predictive tools for outbreak management.
Published
How to Cite
Issue
Section
Copyright (c) 2025 Osowomuabe Njama-Abang, Denis U. Ashishie, Paul T. Bukie

This work is licensed under a Creative Commons Attribution 4.0 International License.
How to Cite
Similar Articles
- Kazeem A. Tijani, Chinwendu E. Madubueze, Reuben I. Gweryina, Typhoid fever dynamical model with cost-effective optimalcontrol , Journal of the Nigerian Society of Physical Sciences: Volume 5, Issue 4, November 2023
- Retraction Notice: Fractional-order modeling of visceral leishmaniasis disease transmission dynamics: strategies in eastern Sudan , Journal of the Nigerian Society of Physical Sciences: Volume 8, Issue 2, May 2026 (In Progress)
- Maduabuchi Gabriel Orakwelu, Olumuyiwa Otegbeye, Hermane Mambili-Mamboundou, A class of single-step hybrid block methods with equally spaced points for general third-order ordinary differential equations , Journal of the Nigerian Society of Physical Sciences: Volume 5, Issue 4, November 2023
- Solomon A. Ayuba, I. Akeyede, A. S. Olagunju, Stability and Sensitivity Analysis of Dengue-Malaria Co-Infection Model in Endemic Stage , Journal of the Nigerian Society of Physical Sciences: Volume 3, Issue 2, May 2021
- O. G. Obadina, Adedayo Funmi Adedotuun, O. A. Odusanya, Ridge Estimation's Effectiveness for Multiple Linear Regression with Multicollinearity: An Investigation Using Monte-Carlo Simulations , Journal of the Nigerian Society of Physical Sciences: Volume 3, Issue 4, November 2021
- Afis Saliu, Semiu Oladipupo Oladejo, On Lemniscate of Bernoulli of q-Janowski type , Journal of the Nigerian Society of Physical Sciences: Volume 4, Issue 4, November 2022
- Umaru Hassan, Mohd Tahir Ismail, Improving forecasting accuracy using quantile regression neural network combined with unrestricted mixed data sampling , Journal of the Nigerian Society of Physical Sciences: Volume 5, Issue 4, November 2023
- Ghada A. Ahmed, Fractional-order modeling of visceral leishmaniasis disease transmission dynamics : strategies in eastern Sudan , Journal of the Nigerian Society of Physical Sciences: Volume 5, Issue 4, November 2023
- Abdullahi Moyosore, Haslina Ahmad, Muhammad Alif Muhammad Latif, Mostafa Yousefzadeh Borzehandani, Mohd Basyaruddin AbdulRahman, Emilia Abdelmalek, Carbon (IV) oxide adsorption efficiency of functionalized HKUST-1, IRMF-1, and UiO-66 metal organic frameworks , Journal of the Nigerian Society of Physical Sciences: Volume 6, Issue 1, February 2024
- Fathelrhman EL Guma, Ossama M. Badawy, Mohammed Berir, Mohamed A. Abdoon, Numerical Analysis of Fractional-Order Dynamic Dengue Disease Epidemic in Sudan , Journal of the Nigerian Society of Physical Sciences: Volume 5, Issue 2, May 2023
You may also start an advanced similarity search for this article.
Most read articles by the same author(s)
- Paul Tawo Bukie, Idongesit E. Eteng, Eyo E. Essien, Development of internet of things-based petroleum pipeline topology leak monitoring and detection system using sensors , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 4, November 2025
- Catherine N. Ogbizi-Ugbe, Osowomuabe Njama-Abang, Samuel Oladimeji, Idongetsit E. Eteng, Edim A. Emanuel, Synergistic intelligence: a novel hybrid model for precision agriculture using k-means, naive Bayes, and knowledge graphs , Journal of the Nigerian Society of Physical Sciences: Volume 8, Issue 1, February 2026

