Addressing class imbalance in lassa fever epidemic data, using machine learning: a case study with SMOTE and random forest
Keywords:
Lassa fever, Machine learning, SMOTE, Random forest, Class imbalanceAbstract
Class imbalance in epidemiological datasets, particularly for rare outcomes like Lassa Fever fatalities, complicates predictive modeling. This study addresses the issue by employing SMOTE to rebalance the dataset and Random Forest for classification while identifying significant predictors such as age, symptom severity, and residence. SMOTE successfully balanced the dataset (minority class recall improved from 0.60 to 1.00 in Random Forest), mitigating the bias toward majority classes. Without SMOTE, models including Random Forest, XGBoost, and LightGBM achieved high accuracy (> 99%) but demonstrated poor minority recall (?0.75), confirming the challenge of imbalanced data. Post-SMOTE balancing, these models achieved 100% accuracy, precision, recall, and F1-scores across major classes. Notably, the hybrid ensemble model further enhanced outcomes, achieving an F1-score of 0.80 for the rarest class. These results underscore the superiority of SMOTE in improving classification for underrepresented outcomes compared to reliance on Random Forest alone, demonstrating its value in developing equitable predictive tools for outbreak management.
Published
How to Cite
Issue
Section
Copyright (c) 2025 Osowomuabe Njama-Abang, Denis U. Ashishie, Paul T. Bukie

This work is licensed under a Creative Commons Attribution 4.0 International License.
How to Cite
Similar Articles
- Silifat Adaramaja Abdulraheem, Salisu Aliyu, Fatima Binta Abdullahi, Hyper-parameter tuning for support vector machine using an improved cat swarm optimization algorithm , Journal of the Nigerian Society of Physical Sciences: Volume 5, Issue 4, November 2023
- Emmanuel Gbenga Dada, Aishatu Ibrahim Birma, Abdulkarim Abbas Gora, Ensemble machine learning algorithm for cost-effective and timely detection of diabetes in Maiduguri, Borno State , Journal of the Nigerian Society of Physical Sciences: Volume 6, Issue 4, November 2024
- Olumide S. Adesina, Adedayo F. Adedotuun, Kayode S. Adekeye, Ogbu F. Imaga, Adeleke J. Adeyiga, Toluwalase J. Akingbade, On logistic regression versus support vectors machine using vaccination dataset , Journal of the Nigerian Society of Physical Sciences: Volume 6, Issue 1, February 2024
- Catherine N. Ogbizi-Ugbe, Osowomuabe Njama-Abang, Samuel Oladimeji, Idongetsit E. Eteng, Edim A. Emanuel, Synergistic intelligence: a novel hybrid model for precision agriculture using k-means, naive Bayes, and knowledge graphs , Journal of the Nigerian Society of Physical Sciences: Volume 8, Issue 1, February 2026
- Oluwaseun IGE, Keng Hoon Gan, Ensemble feature selection using weighted concatenated voting for text classification , Journal of the Nigerian Society of Physical Sciences: Volume 6, Issue 1, February 2024
- Akila Dabara Kayit, Mohd Tahir Ismail, Novel way to predict stock movements using multiple models and comprehensive analysis: leveraging voting meta-ensemble techniques , Journal of the Nigerian Society of Physical Sciences: Volume 6, Issue 3, August 2024
- S. I. Ele, U. R. Alo, H. F. Nweke, A. H. Okemiri, E. O. Uche-Nwachi, Deep convolutional neural network (DCNN)-based model for pneumonia detection using chest x-ray images , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 2, May 2025
- Nahid Salma, Majid Khan Majahar Ali, Raja Aqib Shamim, Machine learning-based feature selection for ultra-high-dimensional survival data: a computational approach , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 3, August 2025
- O. E. Ojo, A. Gelbukh, H. Calvo, O. O. Adebanji, Performance Study of N-grams in the Analysis of Sentiments , Journal of the Nigerian Society of Physical Sciences: Volume 3, Issue 4, November 2021
- Unyime Ufok Ibekwe, Uche M. Mbanaso, Nwojo Agwu Nnanna, Umar Adam Ibrahim, A machine learning sentiment classification of factors that shape trust in smart contracts , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 1, February 2025
You may also start an advanced similarity search for this article.
Most read articles by the same author(s)
- Paul Tawo Bukie, Idongesit E. Eteng, Eyo E. Essien, Development of internet of things-based petroleum pipeline topology leak monitoring and detection system using sensors , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 4, November 2025
- Catherine N. Ogbizi-Ugbe, Osowomuabe Njama-Abang, Samuel Oladimeji, Idongetsit E. Eteng, Edim A. Emanuel, Synergistic intelligence: a novel hybrid model for precision agriculture using k-means, naive Bayes, and knowledge graphs , Journal of the Nigerian Society of Physical Sciences: Volume 8, Issue 1, February 2026

