Robust M-estimators and Machine Learning Algorithms for Improving the Predictive Accuracy of Seaweed Contaminated Big Data
Keywords:
Robust method, Hybrid model, Machine learning, Outliers, Big dataAbstract
A common problem in regression analysis using ordinary least squares (OLS) is the effect of outliers or contaminated data on the estimates of the parameters. A robust method that is not sensitive to outliers and can handle contaminated data is needed. In this study, the objective is to determine the significant parameters that determine the moisture content of the seaweed after drying and develop a hybrid model to reduce the outliers. The data were collected with sensors from the v-Groove Hybrid Solar Drier (v-GHSD) at Semporna, South-Eastern Coast of Sabah, Malaysia. After the second order interaction, we have 435 drying parameters, each parameter has 1914 observations. First, we used four machine learning algorithms, such as random forest, support vector machine, bagging and boosting to determine the significant parameters by selecting 15, 25, 35 and 45 parameters. Second, we developed the hybrid model using robust methods such as M. Bi-Square, M. Hampel and M. Huber. The results show that there is a significant improvement in the reduction of the number of outliers and better prediction using hybrid model for the contaminated seaweed big data. For the highest variable importance of 45 significant drying parameters of seaweed, the hybrid model bagging M Bi-square performs better because it has the lowest percentage of outliers of 4.08 %.
Published
How to Cite
Issue
Section
Copyright (c) 2023 O. J. Ibidoja, F. P. Shan, Mukhtar, J. Sulaiman, M. K. M. Ali

This work is licensed under a Creative Commons Attribution 4.0 International License.
How to Cite
Similar Articles
- Nour Hamad Abu Afouna, Majid Khan Majahar Ali, Optimizing precision farming: enhancing machine learning efficiency with robust regression techniques in high-dimensional data , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 1, February 2025
- Shaymaa Mohammed Ahmed, Majid Khan Majahar Ali, Raja Aqib Shamim, Integrating robust feature selection with deep learning for ultra-high-dimensional survival analysis in renal cell carcinoma , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 4, November 2025
- Akila Dabara Kayit, Mohd Tahir Ismail, Novel way to predict stock movements using multiple models and comprehensive analysis: leveraging voting meta-ensemble techniques , Journal of the Nigerian Society of Physical Sciences: Volume 6, Issue 3, August 2024
- Christian N. Nwaeme, Adewale F. Lukman, Robust hybrid algorithms for regularization and variable selection in QSAR studies , Journal of the Nigerian Society of Physical Sciences: Volume 5, Issue 4, November 2023
- Oluwaseun IGE, Keng Hoon Gan, Ensemble feature selection using weighted concatenated voting for text classification , Journal of the Nigerian Society of Physical Sciences: Volume 6, Issue 1, February 2024
- Majid Khan Bin Majahar Ali, Shahida Shahnawaz, An inverse physics-informed neural network (I-PINN) framework for parameter estimation in mixed convection and melting effects , Journal of the Nigerian Society of Physical Sciences: Volume 8, Issue 2, May 2026
- A. K. Usman, Y. A. Hassan, A. A. Bery, A. S. Akingboye, M. D. Dick, B. M. Ahmed, R. O. Aderoju, Hybrid deep belief network and fuzzy clustering approach for geothermal prospectivity mapping in northeastern Nigeria using magnetic and landsat data , Journal of the Nigerian Society of Physical Sciences: Volume 8, Issue 1, February 2026
- Nahid Salma, Majid Khan Majahar Ali, Raja Aqib Shamim, Machine learning-based feature selection for ultra-high-dimensional survival data: a computational approach , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 3, August 2025
- Umaru Hassan, Mohd Tahir Ismail, Improving forecasting accuracy using quantile regression neural network combined with unrestricted mixed data sampling , Journal of the Nigerian Society of Physical Sciences: Volume 5, Issue 4, November 2023
- Gabriel James, Ifeoma Ohaeri, David Egete, John Odey, Samuel Oyong, Enefiok Etuk, Imeh Umoren, Ubong Etuk, Aloysius Akpanobong, Anietie Ekong, Saviour Inyang, Chikodili Orazulume, A fuzzy-optimized multi-level random forest (FOMRF) model for the classification of the impact of technostress , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 3, August 2025
You may also start an advanced similarity search for this article.
Most read articles by the same author(s)
- Xiaojie Zhou, Majid Khan Majahar Ali, Farah Aini Abdullah, Lili Wu, Ying Tian, Tao Li, Kaihui Li, Air quality prediction enhanced by a CNN-LSTM-Attention model optimized with an advanced dung beetle algorithm , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 3, August 2025
- Shaymaa Mohammed Ahmed, Majid Khan Majahar Ali, Raja Aqib Shamim, Integrating robust feature selection with deep learning for ultra-high-dimensional survival analysis in renal cell carcinoma , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 4, November 2025
- Paavithashnee Ravi Kumar, Majid Khan Majahar Ali, Olayemi Joshua Ibidoja, Identifying heterogeneity for increasing the prediction accuracy of machine learning models , Journal of the Nigerian Society of Physical Sciences: Volume 6, Issue 3, August 2024
- Ibrahim Adamu Mohammed, Majid Khan Majahar Ali, Sani Rabiu, Raja Aqib Shamim, Shahida Shahnawaz, Development and validation of hybrid drying kinetics models with finite element method integration for black paper in a v-groove solar dryer , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 4, November 2025
- Raja Aqib Shamim, Majid Khan Majahar Ali, Optimizing discrete dutch auctions with time considerations: a strategic approach for lognormal valuation distributions , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 1, February 2025
- Nahid Salma, Majid Khan Majahar Ali, Raja Aqib Shamim, Machine learning-based feature selection for ultra-high-dimensional survival data: a computational approach , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 3, August 2025
- Shaymaa Mohammed Ahmed, Majid Khan Majahar Ali, Arshad Hameed Hasan, Evaluating feature selection methods in a hybrid Weibull Freund-Cox proportional hazards model for renal cell carcinoma , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 3, August 2025
- Chuchu Liang, Majid Khan Majahar Ali, Lili Wu, A novel multi-class classification method for arrhythmias using Hankel dynamic mode decomposition and long short-term memory networks , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 2, May 2025
- Raja Aqib Shamim, Majid Khan Majahar Ali, Mohamed Farouk Haashir bin Hamdullah, Computational optimization of auctioneer revenue in modified discrete Dutch auctions with cara risk preferences , Journal of the Nigerian Society of Physical Sciences: Volume 8, Issue 1, February 2026
- xiaojie zhou, Majid Khan Majahar Ali, Farah Aini Abdullah, Lili Wu, Ying Tian, Tao Li, Kaihui Li, Implementing a dung beetle optimization algorithm enhanced with multi-strategy fusion techniques , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 2, May 2025

