Addressing class imbalance in lassa fever epidemic data, using machine learning: a case study with SMOTE and random forest
Keywords:
Lassa fever, Machine learning, SMOTE, Random forest, Class imbalanceAbstract
Class imbalance in epidemiological datasets, particularly for rare outcomes like Lassa Fever fatalities, complicates predictive modeling. This study addresses the issue by employing SMOTE to rebalance the dataset and Random Forest for classification while identifying significant predictors such as age, symptom severity, and residence. SMOTE successfully balanced the dataset (minority class recall improved from 0.60 to 1.00 in Random Forest), mitigating the bias toward majority classes. Without SMOTE, models including Random Forest, XGBoost, and LightGBM achieved high accuracy (> 99%) but demonstrated poor minority recall (?0.75), confirming the challenge of imbalanced data. Post-SMOTE balancing, these models achieved 100% accuracy, precision, recall, and F1-scores across major classes. Notably, the hybrid ensemble model further enhanced outcomes, achieving an F1-score of 0.80 for the rarest class. These results underscore the superiority of SMOTE in improving classification for underrepresented outcomes compared to reliance on Random Forest alone, demonstrating its value in developing equitable predictive tools for outbreak management.
Published
How to Cite
Issue
Section
Copyright (c) 2025 Osowomuabe Njama-Abang, Denis U. Ashishie, Paul T. Bukie

This work is licensed under a Creative Commons Attribution 4.0 International License.
How to Cite
Similar Articles
- Omowumi F. Lawal, Tunde T. Yusuf, Afeez Abidemi, On mathematical modelling of optimal control of typhoid fever with efficiency analysis , Journal of the Nigerian Society of Physical Sciences: Volume 6, Issue 4, November 2024
- Shaymaa Mohammed Ahmed, Majid Khan Majahar Ali, Raja Aqib Shamim, Integrating robust feature selection with deep learning for ultra-high-dimensional survival analysis in renal cell carcinoma , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 4, November 2025
- S. N. Enemuo, O. N. Akande, M. O. Lawrence, I. C. Saidu, Optimized aspect level sentiment analysis of tweet data using deep learning and rule-based techniques , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 2, May 2025
- Felix Yakubu Eguda, Andrawus James, Sunday Babuba, The Solution of a Mathematical Model for Dengue Fever Transmission Using Differential Transformation Method , Journal of the Nigerian Society of Physical Sciences: Volume 1, Issue 3, August 2019
- Gurpreet Tuteja, Tapshi Singh, Comments on “The Solution of a Mathematical Model for Dengue Fever Transmission Using Differential Transformation Method: J. Nig. Soc. Phys. Sci. 1 (2019) 82-87” , Journal of the Nigerian Society of Physical Sciences: Volume 3, Issue 2, May 2021
- J. M. Orverem, Y. Haruna, B. M. Abdulhamid, M. Y. Adamu, The Use of Differential Forms to Linearize a Class of Geodesic Equations , Journal of the Nigerian Society of Physical Sciences: Volume 4, Issue 4, November 2022
- Mokhtar Ali, Abdelkerim Souahlia, Abdelhalim Rabehi, Mawloud Guermoui, Ali Teta, Imad Eddine Tibermacine, Abdelaziz Rabehi, Mohamed Benghanem , A robust deep learning approach for photovoltaic power forecasting based on feature selection and variational mode decomposition , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 3, August 2025
- M. O. Ogunniran, A Class of Block Multi-derivative Numerical Techniques for Singular Advection Equations , Journal of the Nigerian Society of Physical Sciences: Volume 1, Issue 2, May 2019
- Saheed Ajao, Isaac Olopade, Titilayo Akinwumi, Sunday Adewale, Adelani Adesanya, Understanding the Transmission Dynamics and Control of HIV Infection: A Mathematical Model Approach , Journal of the Nigerian Society of Physical Sciences: Volume 5, Issue 2, May 2023
- Kazeem A. Tijani, Chinwendu E. Madubueze, Reuben I. Gweryina, Typhoid fever dynamical model with cost-effective optimalcontrol , Journal of the Nigerian Society of Physical Sciences: Volume 5, Issue 4, November 2023
You may also start an advanced similarity search for this article.
Most read articles by the same author(s)
- Paul Tawo Bukie, Idongesit E. Eteng, Eyo E. Essien, Development of internet of things-based petroleum pipeline topology leak monitoring and detection system using sensors , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 4, November 2025
- Catherine N. Ogbizi-Ugbe, Osowomuabe Njama-Abang, Samuel Oladimeji, Idongetsit E. Eteng, Edim A. Emanuel, Synergistic intelligence: a novel hybrid model for precision agriculture using k-means, naive Bayes, and knowledge graphs , Journal of the Nigerian Society of Physical Sciences: Volume 8, Issue 1, February 2026

