Addressing class imbalance in lassa fever epidemic data, using machine learning: a case study with SMOTE and random forest
Keywords:
Lassa fever, Machine learning, SMOTE, Random forest, Class imbalanceAbstract
Class imbalance in epidemiological datasets, particularly for rare outcomes like Lassa Fever fatalities, complicates predictive modeling. This study addresses the issue by employing SMOTE to rebalance the dataset and Random Forest for classification while identifying significant predictors such as age, symptom severity, and residence. SMOTE successfully balanced the dataset (minority class recall improved from 0.60 to 1.00 in Random Forest), mitigating the bias toward majority classes. Without SMOTE, models including Random Forest, XGBoost, and LightGBM achieved high accuracy (> 99%) but demonstrated poor minority recall (?0.75), confirming the challenge of imbalanced data. Post-SMOTE balancing, these models achieved 100% accuracy, precision, recall, and F1-scores across major classes. Notably, the hybrid ensemble model further enhanced outcomes, achieving an F1-score of 0.80 for the rarest class. These results underscore the superiority of SMOTE in improving classification for underrepresented outcomes compared to reliance on Random Forest alone, demonstrating its value in developing equitable predictive tools for outbreak management.
Published
How to Cite
Issue
Section
Copyright (c) 2025 Osowomuabe Njama-Abang, Denis U. Ashishie, Paul T. Bukie

This work is licensed under a Creative Commons Attribution 4.0 International License.
How to Cite
Similar Articles
- Muhammad Dahiru Liman, Salamatu Ibrahim Osanga, Esther Samuel Alu, Sa'adu Zakariya, Regularization Effects in Deep Learning Architecture , Journal of the Nigerian Society of Physical Sciences: Volume 6, Issue 2, May 2024
- Hamza Abubakar, Abdu Sagir Masanawa, Surajo Yusuf, G. I. Boaku, Optimal representation to High Order Random Boolean kSatisability via Election Algorithm as Heuristic Search Approach in Hopeld Neural Networks , Journal of the Nigerian Society of Physical Sciences: Volume 3, Issue 3, August 2021
- Atiek Iriany, Wigbertus Ngabu, Henny Pramoedyo, Amarifai, Geographically weighted regression random forest for modeling soil particles , Journal of the Nigerian Society of Physical Sciences: Volume 8, Issue 2, May 2026 (In Progress)
- Umaru C. Obini, Chukwu Jeremiah, Sylvester A. Igwe, Development of a machine learning based fileless malware filter system for cyber-security , Journal of the Nigerian Society of Physical Sciences: Volume 6, Issue 4, November 2024
- P. O. Odion, M. N. Musa, S. U. Shuaibu, Age Prediction from Sclera Images using Deep Learning , Journal of the Nigerian Society of Physical Sciences: Volume 4, Issue 3, August 2022
- Emmanuel C. Ukekwe, Adaora A. Obayi, Akpa Johnson, Daniel A. Musa, Jonathan C. Agbo, Optimizing data and voice service delivery for mobile phones based on clients' demand and location using affinity propagation machine learning , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 2, May 2025
- Nour Hamad Abu Afouna, Majid Khan Majahar Ali, Optimizing precision farming: enhancing machine learning efficiency with robust regression techniques in high-dimensional data , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 1, February 2025
- Constantin Falk, Tarek El Ghayed , Ron van de Sand, Jörg Reiff-Stephan, A Data-Driven Approach Towards the Application of Reinforcement Learning Based HVAC Control , Journal of the Nigerian Society of Physical Sciences: Volume 5, Issue 1, February 2023
- Nneka Ernestina Richard-Nnabu, Chinagolum Ituma, Henry Friday Nweke, Convolutional neural networks method for folded naira currency denominations recognition and analysis , Journal of the Nigerian Society of Physical Sciences: Volume 6, Issue 4, November 2024
- Chuchu Liang, Majid Khan Majahar Ali, Lili Wu, A novel multi-class classification method for arrhythmias using Hankel dynamic mode decomposition and long short-term memory networks , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 2, May 2025
You may also start an advanced similarity search for this article.
Most read articles by the same author(s)
- Paul Tawo Bukie, Idongesit E. Eteng, Eyo E. Essien, Development of internet of things-based petroleum pipeline topology leak monitoring and detection system using sensors , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 4, November 2025
- Catherine N. Ogbizi-Ugbe, Osowomuabe Njama-Abang, Samuel Oladimeji, Idongetsit E. Eteng, Edim A. Emanuel, Synergistic intelligence: a novel hybrid model for precision agriculture using k-means, naive Bayes, and knowledge graphs , Journal of the Nigerian Society of Physical Sciences: Volume 8, Issue 1, February 2026

