Robust hybrid algorithms for regularization and variable selection in QSAR studies
Keywords:
High dimension, QSAR, Multicollinearity, Outliers, Sparse Least trimmed squares, Random forestAbstract
This study introduces a robust hybrid sparse learning approach for regularization and variable selection. This approach comprises two distinct steps. In the initial step, we segment the original dataset into separate training and test sets and standardize the training data using its mean and standard deviation. We then employ either the LASSO or sparse LTS algorithm to analyze the training set, facilitating the selection of variables with non-zero coefficients as essential features for the new dataset. Secondly, the new dataset is divided into training and test sets. The training set is further divided into k folds and evaluated using a combination of Random Forest, Ridge, Lasso, and Support Vector Regression machine learning algorithms. We introduce novel hybrid methods and juxtapose their performance against existing techniques. To validate the efficacy of our proposed methods, we conduct a comprehensive simulation study and apply them to a real-life QSAR analysis. The findings unequivocally demonstrate the superior performance of our proposed estimator, with particular distinction accorded to SLTS+LASSO. In summary, the twostep robust hybrid sparse learning approach offers an effective regularization and variable selection applicable to a wide spectrum of real-world problems.
Published
How to Cite
Issue
Section
Copyright (c) 2023 Adewale F. Lukman, Christian N. Nwaeme

This work is licensed under a Creative Commons Attribution 4.0 International License.
How to Cite
Similar Articles
- Muteeu A. Olopade, Anthony B. Adegboyega, Kayode I. Ogungbemi, Adeyinka D. Adewoyin, Investigation of the behaviour of tunable chalcogenide-Bismuth based perovskite BiTl (SxSe1-x)3(X = 0, 0.33, 0.67, 1): first principles calculations , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 1, February 2025
- N. Kure, H. I. Daniel, C. G. Afuwai, E. J. Adoyi, I. A. Bello, The Delineation of Groundwater and Geotechnical Parameters within Marmara Area of Chikun Local Government of Kaduna State, Nigeria , Journal of the Nigerian Society of Physical Sciences: Volume 1, Issue 1, February 2019
- E. O. Echeweozo, C. I. Nworie, A. O. Ojobeagu, P. B. Otah, I. J. Okoro, Health risk assessment due to environmental radioactivity and heavy metal contamination at the central solid waste dumpsite in Ebonyi State, Nigeria , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 2, May 2025
- K. O. Eberendu , J. I. Iheanyichukwu, O. M. Mac-kalunta, C. I. Nwankwo, I. E. Otuokere, J. C. Nnaji, Zn(II) and Fe(II) complexes of 2,4-dinitro-N-[(Z)-[(E)-3-(2-nitrophenyl)prop-2-enylidene] amino] aniline: synthesis, characterization and In Silico SARS-CoV-2 inhibition studies , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 1, February 2025
- James Andrawus, Kayode Isaac Omotoso, Agada Apeh Andrew, Felix Yakubu Eguda, Sunday Babuba, Kabiru Garba Ibrahim, Mathematical model analysis on the significance of surveillance and awareness on the transmission dynamics of diphtheria , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 4, November 2025
- B. N. Hikon, G. G. Yebpella, L. Jafiya, S. Ayuba, Preliminary Investigation of Microplastic as a Vector for Heavy Metals in Bye-ma Salt Mine, Wukari, Nigeria , Journal of the Nigerian Society of Physical Sciences: Volume 3, Issue 3, August 2021
- Olugbenga Oludayo Oluwasina, Analysis of Adenanthera pavonine L. (Febaceae) Pod and Seed as Potential Pyrolysis Feedstock for Energy production , Journal of the Nigerian Society of Physical Sciences: Volume 4, Issue 2, May 2022
- Adewunmi O. Adeyemi, Ismail A. Adeleke, Eno E. E. Akarawak, Modeling Extreme Stochastic Variations using the Maximum Order Statistics of Convoluted Distributions , Journal of the Nigerian Society of Physical Sciences: Volume 5, Issue 1, February 2023
- Idongesit E. Eteng, Udeze L. Chinedu, Ayei E. Ibor, A stacked ensemble approach with resampling techniques for highly effective fraud detection in imbalanced datasets , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 1, February 2025
- Mahesh Kumar Singh, Pushpa Choudhary, Arun Kumar Singh, Pushpendra Singh, LWRNPIP: Design of a light weight restrictive non-fungible token based on practically unclonable functions via image signature patterns , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 4, November 2025
You may also start an advanced similarity search for this article.
Most read articles by the same author(s)
- Segun L. Jegede, Adewale F. Lukman, Kayode Ayinde, Kehinde A. Odeniyi, Jackknife Kibria-Lukman M-Estimator: Simulation and Application , Journal of the Nigerian Society of Physical Sciences: Volume 4, Issue 2, May 2022

