Robust hybrid algorithms for regularization and variable selection in QSAR studies
Keywords:
High dimension, QSAR, Multicollinearity, Outliers, Sparse Least trimmed squares, Random forestAbstract
This study introduces a robust hybrid sparse learning approach for regularization and variable selection. This approach comprises two distinct steps. In the initial step, we segment the original dataset into separate training and test sets and standardize the training data using its mean and standard deviation. We then employ either the LASSO or sparse LTS algorithm to analyze the training set, facilitating the selection of variables with non-zero coefficients as essential features for the new dataset. Secondly, the new dataset is divided into training and test sets. The training set is further divided into k folds and evaluated using a combination of Random Forest, Ridge, Lasso, and Support Vector Regression machine learning algorithms. We introduce novel hybrid methods and juxtapose their performance against existing techniques. To validate the efficacy of our proposed methods, we conduct a comprehensive simulation study and apply them to a real-life QSAR analysis. The findings unequivocally demonstrate the superior performance of our proposed estimator, with particular distinction accorded to SLTS+LASSO. In summary, the twostep robust hybrid sparse learning approach offers an effective regularization and variable selection applicable to a wide spectrum of real-world problems.
Published
How to Cite
Issue
Section
Copyright (c) 2023 Adewale F. Lukman, Christian N. Nwaeme

This work is licensed under a Creative Commons Attribution 4.0 International License.
How to Cite
Similar Articles
- David Opeoluwa Oyewola, Emmanuel Gbenga Dada, Juliana Ngozi ndunagu, Terrang Abubakar Umar, Akinwunmi S.A, COVID-19 Risk Factors, Economic Factors, and Epidemiological Factors nexus on Economic Impact: Machine Learning and Structural Equation Modelling Approaches , Journal of the Nigerian Society of Physical Sciences: Volume 3, Issue 4, November 2021
- Oluwayemisi Oyeronke Alaba, B. M. Golam Kibria, The Efficiency of the K-L Estimator for the Seemingly Unrelated Regression Model: Simulation and Application , Journal of the Nigerian Society of Physical Sciences: Volume 5, Issue 3, August 2023
- Omodele Olubi, Ebeneze Oniya, Taoreed Owolabi, Development of Predictive Model for Radon-222 Estimation in the Atmosphere using Stepwise Regression and Grid Search Based-Random Forest Regression , Journal of the Nigerian Society of Physical Sciences: Volume 3, Issue 2, May 2021
- L. G. Salaudeen, D. GABI, M. Garba, H. U. Suru, Deep convolutional neural network based synthetic minority over sampling technique: a forfending model for fraudulent credit card transactions in financial institution , Journal of the Nigerian Society of Physical Sciences: Volume 6, Issue 2, May 2024
- Raphael Ozighor Enihe, Rajesh Prasad, Francisca Nonyelum Ogwueleka, Fatimah Binta Abdullahi, The effect of imbalance data mitigation techniques on cardiovascular disease prediction , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 2, May 2025
- Christopher Ifeanyi Eke, Kholoud Maswadi, Musa Phiri, Mulenga Mwege, Mohammad Imran, Dekera Kenneth Kwaghtyo, Akeremale Olusola Collins, Effective tweets classification for disaster crisis based on ensemble of classifiers , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 3, August 2025
- V Umarani, A Julian, J Deepa, Sentiment Analysis using various Machine Learning and Deep Learning Techniques , Journal of the Nigerian Society of Physical Sciences: Volume 3, Issue 4, November 2021
- Catherine N. Ogbizi-Ugbe, Osowomuabe Njama-Abang, Samuel Oladimeji, Idongetsit E. Eteng, Edim A. Emanuel, Synergistic intelligence: a novel hybrid model for precision agriculture using k-means, naive Bayes, and knowledge graphs , Journal of the Nigerian Society of Physical Sciences: Volume 8, Issue 1, February 2026
- Emmanuel Gbenga Dada, Aishatu Ibrahim Birma, Abdulkarim Abbas Gora, Ensemble machine learning algorithm for cost-effective and timely detection of diabetes in Maiduguri, Borno State , Journal of the Nigerian Society of Physical Sciences: Volume 6, Issue 4, November 2024
- Emmanuel P. Agbo, Golden C. Offorson, Abubakar S. Yusuf, John O. Bassey, Moses A. Okono, Ugochukwu Nkajoe, Patrick O. Ushie, Innovative trend analysis of precipitation changes over Nigeria: A case study of locations across the Niger and Benue Rivers , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 1, February 2025
You may also start an advanced similarity search for this article.
Most read articles by the same author(s)
- Segun L. Jegede, Adewale F. Lukman, Kayode Ayinde, Kehinde A. Odeniyi, Jackknife Kibria-Lukman M-Estimator: Simulation and Application , Journal of the Nigerian Society of Physical Sciences: Volume 4, Issue 2, May 2022

