CropGAN: A conditional GAN framework for synthetic tabular data augmentation in crop recommendation systems
Keywords:
Generative adversarial network, Synthetic tabular data, Data imbalance, Precision agriculture, Crop recommendationAbstract
Crop recommendation systems play a crucial role in precision agriculture by enabling informed decisions about which crops to cultivate in a specific location. However, their performance is often hindered by the inherent scarcity, imbalance, and regional bias of tabular agricultural datasets. These limitations reduce the reliability and generalizability of crop recommendation models, especially in data-scarce regions. Existing synthetic data generation methods, such as the Synthetic Minority Oversampling Technique (SMOTE) and variational autoencoders (VAEs), struggle to handle high-dimensional, structured agricultural data with categorical variables. Moreover, existing generative adversarial networks (GANs) are primarily image-focused. This study proposes a re-engineered GAN, termed the Crop Recommendation GAN (CropGAN), to generate tabular, multi-crop recommendation data using a class-conditioning mechanism. The framework was designed to handle complex, non-linear datasets with both numerical and categorical variables, addressing dataset size, imbalance, and limited diversity in multi-crop datasets. CropGAN was trained on a dataset of 5,000 samples with 10 crop classes and evaluated against SMOTE and VAE using statistical data-quality and classification-performance metrics. In terms of data-quality assessment, SMOTE best preserved the original data distributions, while CropGAN introduced greater diversity in the synthetic datasets. In terms of classification performance, models trained on CropGAN-generated samples consistently achieved the highest performance, with the support vector machine (SVM) yielding the best accuracy of 99.4%. This result suggests that adversarial learning can be adapted for tabular agricultural datasets to improve the performance of crop recommendation systems in data-scarce regions.
Published
How to Cite
Issue
Section
Copyright (c) 2026 Dekera Kenneth Kwaghtyo, Christopher Ifeanyi Eke, Timothy Moses (Author)

This work is licensed under a Creative Commons Attribution 4.0 International License.
How to Cite
Similar Articles
- L. G. Salaudeen, D. GABI, M. Garba, H. U. Suru, Deep convolutional neural network based synthetic minority over sampling technique: a forfending model for fraudulent credit card transactions in financial institution , Journal of the Nigerian Society of Physical Sciences: Volume 6, Issue 2, May 2024
- Onyeke Idoko Charles, John Kolo Alhassan, Mohammed Danlami Abdulmalik, Kehinde Dele Tolorunse, A hybrid process-based and neural network post-processing model for cowpea yield prediction under climate variability in North Central Nigeria , Journal of the Nigerian Society of Physical Sciences: Volume 8, Issue 2, May 2026
- Raphael Ozighor Enihe, Rajesh Prasad, Francisca Nonyelum Ogwueleka, Fatimah Binta Abdullahi, The effect of imbalance data mitigation techniques on cardiovascular disease prediction , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 2, May 2025
- Catherine N. Ogbizi-Ugbe, Osowomuabe Njama-Abang, Samuel Oladimeji, Idongetsit E. Eteng, Edim A. Emanuel, Synergistic intelligence: a novel hybrid model for precision agriculture using k-means, naive Bayes, and knowledge graphs , Journal of the Nigerian Society of Physical Sciences: Volume 8, Issue 1, February 2026
- Gabriel James, Anietie Ekong, Etimbuk Abraham, Enobong Oduobuk, Peace Okafor, Analysis of support vector machine and random forest models for predicting the scalability of a broadband network , Journal of the Nigerian Society of Physical Sciences: Volume 6, Issue 3, August 2024
- Nahid Salma, Majid Khan Majahar Ali, Raja Aqib Shamim, Machine learning-based feature selection for ultra-high-dimensional survival data: a computational approach , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 3, August 2025
- A. Abdulrahim, M. D Shehu, E Yisa, Z. A. Ishaq, Mathematical Models and Comparative Analysis for Rice and Soya Bean Irrigation Crop Water Needs: A Case Study of Bida Basin Niger State, Nigeria , Journal of the Nigerian Society of Physical Sciences: Volume 3, Issue 4, November 2021
- Idongesit E. Eteng, Udeze L. Chinedu, Ayei E. Ibor, A stacked ensemble approach with resampling techniques for highly effective fraud detection in imbalanced datasets , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 1, February 2025
- Osowomuabe Njama-Abang, Denis U. Ashishie, Paul T. Bukie, Addressing class imbalance in lassa fever epidemic data, using machine learning: a case study with SMOTE and random forest , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 3, August 2025
- S. N. Enemuo, O. N. Akande, M. O. Lawrence, I. C. Saidu, Optimized aspect level sentiment analysis of tweet data using deep learning and rule-based techniques , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 2, May 2025
You may also start an advanced similarity search for this article.
Most read articles by the same author(s)
- Philemon Uten Emmoh, Christopher Ifeanyi Eke, Timothy Moses, A feature selection and scoring scheme for dimensionality reduction in a machine learning task , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 1, February 2025
- Christopher Ifeanyi Eke, Kholoud Maswadi, Musa Phiri, Mulenga Mwege, Mohammad Imran, Dekera Kenneth Kwaghtyo, Akeremale Olusola Collins, Effective tweets classification for disaster crisis based on ensemble of classifiers , Journal of the Nigerian Society of Physical Sciences: Volume 7, Issue 3, August 2025

