Machine learning-based feature selection for ultra-high-dimensional survival data: a computational approach
Keywords:
Ultra-high dimension, Machine Learning, Feature Selection, Renal Cell Carcinoma, Survival DataAbstract
Ultra-high-dimensional (UHD) survival data presents significant computational challenges in biomedical research, particularly in Renal Cell Carcinoma (RCC), where genomic complexity complicates risk assessment. Effective feature selection is crucial for identifying key biomarkers that improve RCC diagnosis, prognosis, and treatment. This study evaluates machine learning (ML)-based feature selection methods to address limitations in scalability, feature redundancy, and predictive accuracy in UHD RCC survival data. Gene expression data from 4,224 differentially expressed genes across 74 individuals was analyzed using LASSO, EN, Adaptive LASSO, Group LASSO, SIS, ISIS, SCAD, and SVM. Models were assessed using Mean Squared Error (MSE), Root Mean Squared Error (RMSE), and R² values. SCAD demonstrated the best predictive performance (MSE: 529.00, RMSE: 23.00, R²: 0.69), surpassing ISIS (R²: 0.61), SIS (R²: 0.60), and EN (R²: 0.57). LASSO and Adaptive LASSO underperformed. SCAD identified 14 key genes—NCAM1, ATP1B3, NAT8, MT2A, GTF2F2, X4197, GUCY2C, SLC3A1, CRYZ, DES, MT1L, NFYB, PRKAR2B, and CLIP1—as potential RCC biomarkers. Gene interaction network analysis confirmed their role in RCC progression. Despite SCAD’s strong performance, it left 31% of data variability unexplained, suggesting hybrid ML models that integrate ensemble learning, two-component regression structures, and deep learning-based feature selection could further enhance gene selection and predictive accuracy. This research supports SDG 3 (Good Health and Well-being) and SDG 9 (Industry, Innovation, and Infrastructure) by advancing precision medicine, early RCC detection, and biomedical data-driven innovations for improved clinical decision-making.
Published
How to Cite
Issue
Section
Copyright (c) 2025 Nahid Salma, Majid Khan Majahar Ali, Raja Aqib Shamim

This work is licensed under a Creative Commons Attribution 4.0 International License.
How to Cite
Similar Articles
- Diva Marchandra Mulansari, Maulana Malik, Sindy Devila, Ibrahim Mohammed Sulaiman, Dian Lestari, Fevi Novkaniza, Fida Fathiyah Addini, A hybrid IFR-IDY conjugate gradient algorithm for unconstrained optimization and its application in portfolio selection , Journal of the Nigerian Society of Physical Sciences: Volume 8, Issue 1, February 2026
- Olayinka Oluwaseun Oluwasina, Mochamad Zakki Fahmi, Olugbenga Oludayo Oluwasina, Enhancing cellulose fiber properties from chromolaena odorata and anana comosus through novel pulping chemical mixtures , Journal of the Nigerian Society of Physical Sciences: Volume 6, Issue 2, May 2024
- E. O. Echeweozo, C. I. Nworie, A. O. Ojobeagu, P. B. Otah, I. J. Okoro, Health risk assessment due to environmental radioactivity and heavy metal contamination at the central solid waste dumpsite in Ebonyi State, Nigeria , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 2, May 2025
- Aladodo Sarafadeen Shehu, Ibrahim Bolaji Balogun, Ibrahim Yakubu Tudunwada, Variation, distribution and trends of aerosol optical properties in Africa during 2000-2022 , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 2, May 2025
- L. O. Animasahun, B. A. Taleatu, S. A. Adewinbi, H. S. Bolarinwa, A. Y. Fasasi, Synthesis of SnO2/CuO/SnO2 Multi-layered Structure for Photoabsorption: Compositional and Some Interfacial Structural Studies , Journal of the Nigerian Society of Physical Sciences: Volume 3, Issue 2, May 2021
- M. A. Salawu, J. A. Gbolahan, A. B. Alabi, Assessment of Radiation Shielding Properties of Polymer-Lead (II) Oxide Composites , Journal of the Nigerian Society of Physical Sciences: Volume 3, Issue 4, November 2021
- Rhoda Bernard Gusikit, Solomon Nehemiah Yusuf, Hyeladi Usman Dibal, Victor Bulus Diyelmak, Ahmed Isah Haruna, Comparative analysis of lithium enrichment mechanisms in aquifers in the Benue Trough , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 3, August 2025
- Ayomide Labulo, Elijah Temitope Adesuji, Charles Ojiefoh Oseghale, Elias Emeka Elemike, Adamu Usman, Akinola Kehinde Akinola, Enock Olugbenga Dare, Effect of benzophenone on the physicochemical properties of N-CNTs synthesized from 1-ferrocenylmethyl (2-methylimidazole) catalyst , Journal of the Nigerian Society of Physical Sciences: Volume 2, Issue 4, November 2020
- S. A. Adesokan, A. A. Giwa, I. A. Bello, Removal of Trimethoprim from Water using Carbonized Wood Waste as Adsorbents , Journal of the Nigerian Society of Physical Sciences: Volume 3, Issue 4, November 2021
- S. E. Shaibu, E. J. Inam, E. A. Moses, U. A. Ofon, O. K. Fatunla, C. O. Obadimu, N. D. Ibuotenang, N. O. Offiong, V. F. Ekpo, T. J. Adeoye, E. L. Udokang, D. P. Fapojuwo, Prospects of nanosorption and photocatalysis in remediation of oil spills , Journal of the Nigerian Society of Physical Sciences: Volume 5, Issue 1, February 2023
You may also start an advanced similarity search for this article.
Most read articles by the same author(s)
- O. J. Ibidoja, F. P. Shan, Mukhtar, J. Sulaiman, M. K. M. Ali, Robust M-estimators and Machine Learning Algorithms for Improving the Predictive Accuracy of Seaweed Contaminated Big Data , Journal of the Nigerian Society of Physical Sciences: Volume 5, Issue 1, February 2023
- Xiaojie Zhou, Majid Khan Majahar Ali, Farah Aini Abdullah, Lili Wu, Ying Tian, Tao Li, Kaihui Li, Air quality prediction enhanced by a CNN-LSTM-Attention model optimized with an advanced dung beetle algorithm , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 3, August 2025
- Paavithashnee Ravi Kumar, Majid Khan Majahar Ali, Olayemi Joshua Ibidoja, Identifying heterogeneity for increasing the prediction accuracy of machine learning models , Journal of the Nigerian Society of Physical Sciences: Volume 6, Issue 3, August 2024
- Shaymaa Mohammed Ahmed, Majid Khan Majahar Ali, Raja Aqib Shamim, Integrating robust feature selection with deep learning for ultra-high-dimensional survival analysis in renal cell carcinoma , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 4, November 2025
- Ibrahim Adamu Mohammed, Majid Khan Majahar Ali, Sani Rabiu, Raja Aqib Shamim, Shahida Shahnawaz, Development and validation of hybrid drying kinetics models with finite element method integration for black paper in a v-groove solar dryer , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 4, November 2025
- Raja Aqib Shamim, Majid Khan Majahar Ali, Optimizing discrete dutch auctions with time considerations: a strategic approach for lognormal valuation distributions , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 1, February 2025
- Shaymaa Mohammed Ahmed, Majid Khan Majahar Ali, Arshad Hameed Hasan, Evaluating feature selection methods in a hybrid Weibull Freund-Cox proportional hazards model for renal cell carcinoma , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 3, August 2025
- Raja Aqib Shamim, Majid Khan Majahar Ali, Mohamed Farouk Haashir bin Hamdullah, Computational optimization of auctioneer revenue in modified discrete Dutch auctions with cara risk preferences , Journal of the Nigerian Society of Physical Sciences: Volume 8, Issue 1, February 2026
- Chuchu Liang, Majid Khan Majahar Ali, Lili Wu, A novel multi-class classification method for arrhythmias using Hankel dynamic mode decomposition and long short-term memory networks , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 2, May 2025
- xiaojie zhou, Majid Khan Majahar Ali, Farah Aini Abdullah, Lili Wu, Ying Tian, Tao Li, Kaihui Li, Implementing a dung beetle optimization algorithm enhanced with multi-strategy fusion techniques , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 2, May 2025

