Effective tweets classification for disaster crisis based on ensemble of classifiers
Keywords:
Disaster Crisis Management, Social Media Analytics, Twitter, Machine Learning Classifiers, Ensemble Methods, Feature ExtractionAbstract
In the field of disaster management, social media analytics has gained significant recognition. Social media platforms, particularly Twitter, have become an invaluable source for disseminating information during disasters, offering real-time updates on events, crisis reports, and casualty information. However, the deluge of information on social media can also be overwhelming, with a substantial amount of irrelevant content. To address this challenge, researchers leverage machine learning (ML) classifiers to automatically categorize disaster-related tweets. However, ML classifiers, while being effective, also face issues such as overfitting and class imbalance. This study proposes an ensemble-based approach that integrates a variety of linguistic and word embedding features, including Parts-Of-Speech (POS), hashtags, Term Frequency-Inverse Document Frequency (TF-IDF), GloVe, Word2Vec, and BERT. A range of supervised learning algorithms like Decision Trees, Logistic Regression, Support Vector Machines, and Random Forests, were evaluated individually and as part of ensemble methods like AdaBoost, Bagging, and Random Subspace. The results show that combining TF-IDF with word embeddings and using the AdaBoost ensemble model yields superior performance, achieving a classification accuracy of 98.92%. This represents a notable improvement over the conventional standalone classifiers and highlights the advantage of ensemble methods in enhancing model robustness and minimizing overfitting. The proposed approach demonstrates not only high predictive capacity but also scalability for real-time tweet filtering during emergencies. In addition to demonstrating the efficacy of ensemble methods in disaster tweet classification, this study also provides valuable insights for improving social media-based crisis response. It also establishes a foundation for future research, particularly in multi-lingual and multi-disaster scenarios.
Published
How to Cite
Issue
Section
Copyright (c) 2025 Christopher Ifeanyi Eke, Kholoud Maswadi, Musa Phiri, Mulenga Mwege, Mohammad Imran, Dekera Kenneth Kwaghtyo, Akeremale Olusola Collins

This work is licensed under a Creative Commons Attribution 4.0 International License.
How to Cite
Similar Articles
- Silifat Adaramaja Abdulraheem, Salisu Aliyu, Fatima Binta Abdullahi, Hyper-parameter tuning for support vector machine using an improved cat swarm optimization algorithm , Journal of the Nigerian Society of Physical Sciences: Volume 5, Issue 4, November 2023
- Emmanuel Gbenga Dada, Aishatu Ibrahim Birma, Abdulkarim Abbas Gora, Ensemble machine learning algorithm for cost-effective and timely detection of diabetes in Maiduguri, Borno State , Journal of the Nigerian Society of Physical Sciences: Volume 6, Issue 4, November 2024
- L. G. Salaudeen, D. GABI, M. Garba, H. U. Suru, Deep convolutional neural network based synthetic minority over sampling technique: a forfending model for fraudulent credit card transactions in financial institution , Journal of the Nigerian Society of Physical Sciences: Volume 6, Issue 2, May 2024
- Osowomuabe Njama-Abang, Denis U. Ashishie, Paul T. Bukie, Addressing class imbalance in lassa fever epidemic data, using machine learning: a case study with SMOTE and random forest , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 3, August 2025
- A. K. Usman, Y. A. Hassan, A. A. Bery, A. S. Akingboye, M. D. Dick, B. M. Ahmed, R. O. Aderoju, Hybrid deep belief network and fuzzy clustering approach for geothermal prospectivity mapping in northeastern Nigeria using magnetic and landsat data , Journal of the Nigerian Society of Physical Sciences: Volume 8, Issue 1, February 2026
- Philemon Uten Emmoh, Christopher Ifeanyi Eke, Timothy Moses, A feature selection and scoring scheme for dimensionality reduction in a machine learning task , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 1, February 2025
- O. E. Ojo, A. Gelbukh, H. Calvo, O. O. Adebanji, Performance Study of N-grams in the Analysis of Sentiments , Journal of the Nigerian Society of Physical Sciences: Volume 3, Issue 4, November 2021
- Gabriel James, Anietie Ekong, Etimbuk Abraham, Enobong Oduobuk, Peace Okafor, Analysis of support vector machine and random forest models for predicting the scalability of a broadband network , Journal of the Nigerian Society of Physical Sciences: Volume 6, Issue 3, August 2024
- Gabriel James, Ime Umoren, Anietie Ekong, Saviour Inyang, Oscar Aloysius, Analysis of support vector machine and random forest models for classification of the impact of technostress in covid and post-covid era , Journal of the Nigerian Society of Physical Sciences: Volume 6, Issue 3, August 2024
- Unyime Ufok Ibekwe, Uche M. Mbanaso, Nwojo Agwu Nnanna, Umar Adam Ibrahim, A machine learning sentiment classification of factors that shape trust in smart contracts , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 1, February 2025
You may also start an advanced similarity search for this article.

