A feature selection and scoring scheme for dimensionality reduction in a machine learning task
Keywords:
Algorithm, Dataset, Dimensionality reduction, Feature selectionAbstract
Selection of important features is very vital in machine learning tasks involving high-dimensional dataset with large features. It helps in reducing the dimensionality of a dataset and improving model performance. Most of the feature selection techniques have restriction in the kind of dataset to be used. This study proposed a feature selection technique that is based on statistical lift measure to select important features from a dataset. The proposed technique is a generic approach that can be used in any binary classification dataset. The technique successfully determined the most important feature subset and outperformed the existing techniques. The proposed technique was tested on lungs cancer dataset and happiness classification dataset. The effectiveness of the proposed technique in selecting important features subset was evaluated and compared with other existing techniques, namely Chi-Square, Pearson Correlation and Information Gain. Both the proposed and the existing techniques were evaluated on five machine learning models using four standard evaluation metrics such as accuracy, precision, recall and F1-score. The experimental results of the proposed technique on lung cancer dataset shows that logistic regression, decision tree, adaboost, gradient boost and random forest produced a predictive accuracy of 0.919%, 0.935%, 0.919%, 0.935% and 0.935% respectively, and that of happiness classification dataset produced a predictive accuracy of 0.758%, 0.689%, 0.724%, 0.655% and 0.689% on random forest, k-nearest neighbor, decision tree, gradient boost and cat boost respectively, which outperformed the existing techniques.
Published
How to Cite
Issue
Section
Copyright (c) 2024 Philemon Uten Emmoh, Christopher Ifeanyi Eke, Timothy Moses

This work is licensed under a Creative Commons Attribution 4.0 International License.
How to Cite
Similar Articles
- Bolarinwa Bolaji, B. I. Omede, U. B. Odionyenma, P. B. Ojih, Abdullahi A. Ibrahim, Modelling the transmission dynamics of Omicron variant of COVID-19 in densely populated city of Lagos in Nigeria , Journal of the Nigerian Society of Physical Sciences: Volume 5, Issue 2, May 2023
- Monika Saini, Ashish Kumar, Vijay Singh Maan, Deepak Sinwar, Efficient and Intelligent Decision Support System for Smart Irrigation , Journal of the Nigerian Society of Physical Sciences: Volume 4, Issue 4, November 2022
- Sherifdeen O. Bolarinwa, Eli Danladi, Andrew Ichoja, Muhammad Y. Onimisia, Christopher U. Achem, Synergistic Study of Reduced Graphene Oxide as Interfacial Buffer Layer in HTL-free Perovskite Solar Cells with Carbon Electrode , Journal of the Nigerian Society of Physical Sciences: Volume 4, Issue 3, August 2022
- Olayiwola Babarinsa, Olalekan Ihinkalu, Veronica Cyril-Okeme, Hailiza Kamarulhaili, Arif Mandangan, Azfi Zaidi Mohammad Sofi, Akeem B. Disu, Application of hourglass matrix in Goldreich-Goldwasser-Halevi encryption scheme , Journal of the Nigerian Society of Physical Sciences: Volume 4, Issue 4, November 2022
- Gabriel James, Ime Umoren, Anietie Ekong, Saviour Inyang, Oscar Aloysius, Analysis of support vector machine and random forest models for classification of the impact of technostress in covid and post-covid era , Journal of the Nigerian Society of Physical Sciences: Volume 6, Issue 3, August 2024
- Emmanuel C. Ukekwe, Adaora A. Obayi, Akpa Johnson, Daniel A. Musa, Jonathan C. Agbo, Optimizing data and voice service delivery for mobile phones based on clients' demand and location using affinity propagation machine learning , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 2, May 2025
- B. T. Iorhuna, T. T. Awuhe, I. C. Azuaga, E Isaac, F. Shuaibu, B. Yohanna, Synthesis, Characterization and Antimicrobial Activities of Copper-Tea Leaves (Camellia Sinensis) Extract Nanoparticles.: None , Journal of the Nigerian Society of Physical Sciences: Volume 4, Issue 4, November 2022
- Chinedu L. Udeze, Idongesit E. Eteng, Ayei E. Ibor, Application of Machine Learning and Resampling Techniques to Credit Card Fraud Detection , Journal of the Nigerian Society of Physical Sciences: Volume 4, Issue 3, August 2022
- C. A. Oyelami, W. Akande, T. O. Kolawole, A Preliminary Geotechnical Assessment of Residual Tropical Soils around Osogbo Metropolis as Materials for Road Subgrade , Journal of the Nigerian Society of Physical Sciences: Volume 4, Issue 2, May 2022
- T. S. Fagbemigun, M. O. Olorunfemi, S. A. Wahab, Modeling of Self Potential (SP) Anomalies over a Polarized Rod with Finite Depth Extents , Journal of the Nigerian Society of Physical Sciences: Volume 1, Issue 2, May 2019
You may also start an advanced similarity search for this article.

