Identifying heterogeneity for increasing the prediction accuracy of machine learning models
Keywords:
Machine learning, Agriculture, Variable Selection, seaweed, heterogeneityAbstract
In recent years, the significance of machine learning in agriculture has surged, particularly in post-harvest monitoring for sustainable aquaculture. Challenges like heterogeneity, irrelevant variables and multicollinearity hinder the implementation of smart monitoring systems. However, this study focuses on investigating heterogeneity among drying parameters that determine the moisture content removal during seaweed drying due to its limited attention, particularly within the field of agriculture. Additionally, a heterogeneity model within machine learning algorithms is proposed to enhance accuracy in predicting seaweed moisture content removal, both before and after the removal of heterogeneity parameters and also after the inclusion of single-eliminated heterogeneity parameters. The dataset consists of 1914 observations with 29 independent variables, but this study narrows down to five: Temperature (T1, T4, T7), Humidity (H5), and Solar Radiation (PY). These variables are interacted up to second-order interactions, resulting in 55 variables. Variance inflation factor and boxplots are employed to identify heterogeneity parameters. Two predictive machine learning models, namely random forest and elastic net are then utilized to identify the 15 and 20 highest important parameters for seaweed moisture content removal. Evaluation metrics (MSE, SSE, MAPE, and R-squared) are used to assess model performance. Results demonstrate that the random forest model outperforms the elastic net model in terms of higher accuracy and lower error, both before and after removing heterogeneity parameters, and even after reintroducing single-eliminated heterogeneity parameters. Notably, the random forest model exhibits higher accuracy before excluding heterogeneity parameters.
Published
How to Cite
Issue
Section
Copyright (c) 2024 Paavithashnee Ravi Kumar, Majid Khan Majahar Ali, Olayemi Joshua Ibidoja

This work is licensed under a Creative Commons Attribution 4.0 International License.
How to Cite
Similar Articles
- A. Abdulrahim, M. D Shehu, E Yisa, Z. A. Ishaq, Mathematical Models and Comparative Analysis for Rice and Soya Bean Irrigation Crop Water Needs: A Case Study of Bida Basin Niger State, Nigeria , Journal of the Nigerian Society of Physical Sciences: Volume 3, Issue 4, November 2021
- E. C. Duru, M. C. Anyanwu , T. N. Nnamani , C. N. Nwosu, G. C. E. Mbah, Semi-analytical and numerical simulation of a coinfection model of Malaria and Zika virus disease , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 2, May 2025
- J. A. Adebisi, O. M. Babatunde, Green Information and Communication Technologies Implementation in Textile Industry Using Multicriteria Method , Journal of the Nigerian Society of Physical Sciences: Volume 4, Issue 2, May 2022
- A. E. Ibor, D. O. Egete, A. O. Otiko, D. U. Ashishie, Detecting network intrusions in cyber-physical systems using deep autoencoder-based dimensionality reduction approach anddeep neural networks , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 3, August 2025
- O. M. Ogunlaran, M. A. Kehinde, M. A. Akanbi, E. I. AKINOLA, A Chebyshev polynomial based block integrator for the direct numerical solution of fourth order ordinary differential equations , Journal of the Nigerian Society of Physical Sciences: Volume 6, Issue 2, May 2024
- A. E. Ajetunmobi, A. O. Musthapha, I. C. Okeyode, A. M. Gbadebo, D. Al-Azmi, T. W. David, Assessing the need for radiation protection measures in artisanal and small scale mining of tantalite in Oke-Ogun, Oyo State, Nigeria , Journal of the Nigerian Society of Physical Sciences: Volume 4, Issue 3, August 2022
- F. O. Aweda, J. A. Akinpelu, T. K. Samson, M. Sanni, B. S. Olatinwo, Modeling and Forecasting Selected Meteorological Parameters for the Environmental Awareness in Sub-Sahel West Africa Stations , Journal of the Nigerian Society of Physical Sciences: Volume 4, Issue 3, August 2022
- M. E. Khan, C. E. Elum, A. O. Ijeomah, P. J. Ameji, I. G. Osigbemhe, E. E. Etim, J. V. Anyam, A. Abel, C. T. Agber, Isolation, Characterization, Antimicrobial and Theoretical Investigation of Some Bioactive Compounds Obtained from the Bulbs of Calotropisprocera , Journal of the Nigerian Society of Physical Sciences: Volume 5, Issue 3, August 2023
- A. Murali, K. Muthunagai, Some theorems on fixed points in bi-complex valued metric spaces with an application to integral equations , Journal of the Nigerian Society of Physical Sciences: Volume 6, Issue 2, May 2024
- Kehinde Sanni, Adeshola Dauda Adediran, Aliu Olaniyi Tajudeen, Numerical investigation of nonlinear radiative flux of non-Newtonian MHD fluid induced by nonlinear driven multi-physical curved mechanism with variable magnetic field , Journal of the Nigerian Society of Physical Sciences: Volume 5, Issue 3, August 2023
You may also start an advanced similarity search for this article.
Most read articles by the same author(s)
- O. J. Ibidoja, F. P. Shan, Mukhtar, J. Sulaiman, M. K. M. Ali, Robust M-estimators and Machine Learning Algorithms for Improving the Predictive Accuracy of Seaweed Contaminated Big Data , Journal of the Nigerian Society of Physical Sciences: Volume 5, Issue 1, February 2023
- Xiaojie Zhou, Majid Khan Majahar Ali, Farah Aini Abdullah, Lili Wu, Ying Tian, Tao Li, Kaihui Li, Air quality prediction enhanced by a CNN-LSTM-Attention model optimized with an advanced dung beetle algorithm , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 3, August 2025
- Shaymaa Mohammed Ahmed, Majid Khan Majahar Ali, Raja Aqib Shamim, Integrating robust feature selection with deep learning for ultra-high-dimensional survival analysis in renal cell carcinoma , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 4, November 2025
- Nahid Salma, Majid Khan Majahar Ali, Raja Aqib Shamim, Machine learning-based feature selection for ultra-high-dimensional survival data: a computational approach , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 3, August 2025
- Raja Aqib Shamim, Majid Khan Majahar Ali, Optimizing discrete dutch auctions with time considerations: a strategic approach for lognormal valuation distributions , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 1, February 2025
- Shaymaa Mohammed Ahmed, Majid Khan Majahar Ali, Arshad Hameed Hasan, Evaluating feature selection methods in a hybrid Weibull Freund-Cox proportional hazards model for renal cell carcinoma , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 3, August 2025
- Chuchu Liang, Majid Khan Majahar Ali, Lili Wu, A novel multi-class classification method for arrhythmias using Hankel dynamic mode decomposition and long short-term memory networks , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 2, May 2025
- Ibrahim Adamu Mohammed, Majid Khan Majahar Ali, Sani Rabiu, Raja Aqib Shamim, Shahida Shahnawaz, Development and validation of hybrid drying kinetics models with finite element method integration for black paper in a v-groove solar dryer , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 4, November 2025
- xiaojie zhou, Majid Khan Majahar Ali, Farah Aini Abdullah, Lili Wu, Ying Tian, Tao Li, Kaihui Li, Implementing a dung beetle optimization algorithm enhanced with multi-strategy fusion techniques , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 2, May 2025
- Raja Aqib Shamim, Majid Khan Majahar Ali, Mohamed Farouk Haashir bin Hamdullah, Computational optimization of auctioneer revenue in modified discrete Dutch auctions with cara risk preferences , Journal of the Nigerian Society of Physical Sciences: Volume 8, Issue 1, February 2026 (In Progress)

