CropGAN: A conditional GAN framework for synthetic tabular data augmentation in crop recommendation systems

Authors

  • Dekera Kenneth Kwaghtyo
    Department of Computer Science, Faculty of Computing, Federal University of Lafia, P.M.B. 146, Lafia, Nigeria
  • Christopher Ifeanyi Eke
    Department of Computer Science, Faculty of Computing, Federal University of Lafia, P.M.B. 146, Lafia, Nigeria
  • Timothy Moses
    Department of Computer Science, Faculty of Computing, Federal University of Lafia, P.M.B. 146, Lafia, Nigeria

Keywords:

Generative adversarial network, Synthetic tabular data, Data imbalance, Precision agriculture, Crop recommendation

Abstract

Crop recommendation systems play a crucial role in precision agriculture by enabling informed decisions about which crops to cultivate in a specific location. However, their performance is often hindered by the inherent scarcity, imbalance, and regional bias of tabular agricultural datasets. These limitations reduce the reliability and generalizability of crop recommendation models, especially in data-scarce regions. Existing synthetic data generation methods, such as the Synthetic Minority Oversampling Technique (SMOTE) and variational autoencoders (VAEs), struggle to handle high-dimensional, structured agricultural data with categorical variables. Moreover, existing generative adversarial networks (GANs) are primarily image-focused. This study proposes a re-engineered GAN, termed the Crop Recommendation GAN (CropGAN), to generate tabular, multi-crop recommendation data using a class-conditioning mechanism. The framework was designed to handle complex, non-linear datasets with both numerical and categorical variables, addressing dataset size, imbalance, and limited diversity in multi-crop datasets. CropGAN was trained on a dataset of 5,000 samples with 10 crop classes and evaluated against SMOTE and VAE using statistical data-quality and classification-performance metrics. In terms of data-quality assessment, SMOTE best preserved the original data distributions, while CropGAN introduced greater diversity in the synthetic datasets. In terms of classification performance, models trained on CropGAN-generated samples consistently achieved the highest performance, with the support vector machine (SVM) yielding the best accuracy of 99.4%. This result suggests that adversarial learning can be adapted for tabular agricultural datasets to improve the performance of crop recommendation systems in data-scarce regions.

Dimensions

[1] D. K. Kwaghtyo & C. I. Eke, ``Smart farming prediction models for precision agriculture: a comprehensive survey'', Artificial Intelligence Review 56 (2023) 5729. https://doi.org/10.1007/s10462-022-10266-6.

[2] J. T. Iorzua, D. K. Kwaghtyo, T. P. Hule & A. T. Ibrahim, ``AI-driven approach to crop recommendation: tackling class imbalance and feature selection in precision agriculture'', Journal of Future Artificial Intelligence and Technologies 2 (2025) 270. https://doi.org/10.62411/faith.3048-3719-118.

[3] R. Akhter & S. A. Sofi, ``Precision agriculture using IoT data analytics and machine learning'', Journal of King Saud University--Computer and Information Sciences 34 (2022) 5602. https://doi.org/10.1016/j.jksuci.2021.05.013.

[4] Y. Wang, U. S. R. Dhamodharan, N. Sarwar, F. A. Almalki, Q. H. Naith, R. Sathiyaraj & D. Mohan, ``A hybrid approach for rice crop disease detection in agricultural IoT system'', Discover Sustainability 5 (2024) 99. https://doi.org/10.1007/s43621-024-00285-4.

[5] A. Iorliam, I. B. Iorliam & S. Bum, ``Internet of Things for smart agriculture in Nigeria and Africa: a review'', International Journal of Latest Technology in Engineering, Management & Applied Science 10 (2021) 7. https://www.researchgate.net/profile/Aamo-Iorliam/publication/349868793_.

[6] F. O. Isinkaye, M. O. Olusanya & A. A. Akinyelu, ``A multi-class hybrid variational autoencoder and vision transformer model for enhanced plant disease identification'', Intelligent Systems with Applications 26 (2025) 200490. https://doi.org/10.1016/j.iswa.2025.200490.

[7] D. K. Kwaghtyo, C. I. Eke, J. Abah, T. Moses, J. Agushaka & F. B. Fatokun, Soil and climate parameters-based crop recommendation model for Yandev, Gboko Local Government Area of Benue State, 2024 18th International Conference on Ubiquitous Information Management and Communication, Kuala Lumpur, Malaysia, 2024, pp. 1--7. https://doi.org/10.1109/IMCOM60618.2024.10418425.

[8] C. J. Ejiyi, D. Cai, F. O. Eze, M. B. Ejiyi, J. E. Idoko, S. K. Asere & T. U. Ejiyi, ``Polynomial-SHAP as a SMOTE alternative in conglomerate neural networks for realistic data augmentation in cardiovascular and breast cancer diagnosis'', Journal of Big Data 12 (2025) 97. https://doi.org/10.1186/s40537-025-01152-3.

[9] S. Hong, S. An & J. J. Jeon, ``Improving SMOTE via fusing conditional VAE for data-adaptive noise filtering'', Applied Intelligence 55 (2025) 841. https://doi.org/10.1007/s10489-025-06692-y.

[10] F. Lygerakis & E. Rueckert, ``ED-VAE: entropy decomposition of ELBO in variational autoencoders'', arXiv preprint, arXiv:2407.06797 (2024). https://arxiv.org/abs/2407.06797.

[11] D. Anuradha, R. Kuchipudi, B. Ashreetha, J. V. N. Ramesh & A. Rami, ``Enhancing agricultural yield forecasting with deep convolutional generative adversarial networks and satellite data'', International Journal of Advanced Computer Science and Applications 15 (2024) 661. https://doi.org/10.14569/IJACSA.2024.0150269.

[12] M. Fawakherji, V. Suriani, D. Nardi & D. D. Bloisi, ``Shape and style GAN-based multispectral data augmentation for crop/weed segmentation in precision farming'', Crop Protection 184 (2024) 106848. https://doi.org/10.1016/j.cropro.2024.106848.

[13] I. S. Rahat, H. Ghosh, S. Dara & S. Kant, ``Towards precision agriculture tea leaf disease detection using CNNs and image processing'', Scientific Reports 15 (2025) 17571. https://doi.org/10.1038/s41598-025-02378-0.

[14] C. Wang, Y. Xia, L. Xia, Q. Wang & L. Gu, ``Dual discriminator GAN-based synthetic crop disease image generation for precise crop disease identification'', Plant Methods 21 (2025) 46. https://doi.org/10.1186/s13007-025-01361-0.

[15] S. Sarkar, J. Zhou, A. Scaboo, J. Zhou, N. Aloysius & T. T. Lim, ``Assessment of soybean lodging using UAV imagery and machine learning'', Plants 12 (2023) 2893. https://doi.org/10.3390/plants12162893.

[16] C. Cuenca-Romero, O. E. Apolo-Apolo, J. N. Rodríguez Vázquez, G. Egea & M. Pérez-Ruiz, ``Tackling unbalanced datasets for yellow and brown rust detection in wheat'', Frontiers in Plant Science 15 (2024) 1392409. https://doi.org/10.3389/fpls.2024.1392409.

[17] M. K. Senapaty, A. Ray & N. Padhy, ``A decision support system for crop recommendation using machine learning classification algorithms'', Agriculture 14 (2024) 1256. https://doi.org/10.3390/agriculture14081256.

[18] K. G. Sapkal & A. B. Kadam, Class balancing for soil data: predictive modeling approach for crop recommendation using machine learning algorithms, EPJ Web of Conferences 328 (2025) 01026. https://doi.org/10.1051/epjconf/202532801026.

[19] M. Iatrou, C. Karydas, X. Tseni & S. Mourelatos, ``Representation learning with a variational autoencoder for predicting nitrogen requirement in rice'', Remote Sensing 14 (2022) 5978. https://doi.org/10.3390/rs14235978.

[20] Z. Cao, Y. Ma & Z. Zhang, ``Corn yield prediction based on remotely sensed variables using variational autoencoder and multiple instance regression'', Geoscience and Remote Sensing Letters. arXiv preprint, arXiv:2211.13286 (2022). https://arxiv.org/abs/2211.13286.

[21] M. A. Razavi, A. P. Nejadhashemi, B. Majidi, H. S. Razavi, J. Kpodo, R. Eeswaran, I. Ciampitti & P. V. Vara Prasad, ``Enhancing crop yield prediction in Senegal using advanced machine learning techniques and synthetic data'', Artificial Intelligence in Agriculture 14 (2024) 99. https://doi.org/10.1016/j.aiia.2024.11.005.

[22] L. Liu & X. Song, ``Accurate crop yield prediction via temporal convolutional network and variational autoencoder'', Turkish Journal of Agriculture and Forestry 49 (2025) 612. https://doi.org/10.55730/1300-011X.3290.

[23] C. Karam, M. Awad, Y. Abou Jawdah, N. Ezzeddine & A. Fardoun, ``GAN-based semi-automated augmentation online tool for agricultural pest detection: a case study on whiteflies'', Frontiers in Plant Science 13 (2022) 813050. https://doi.org/10.3389/fpls.2022.813050.

[24] J. J. Bird, C. M. Barnes, L. J. Manso, A. Ekárt & D. R. Faria, ``Fruit quality and defect image classification with conditional GAN data augmentation'', Scientia Horticulturae 293 (2022) 110684. https://doi.org/10.1016/j.scienta.2021.110684.

[25] L. Shumilo, A. Okhrimenko, N. Kussul, S. Drozd & O. Shkalikov, ``Generative adversarial network augmentation for solving the training-data imbalance problem in crop classification'', Remote Sensing Letters 14 (2023) 1129. https://doi.org/10.1080/2150704X.2023.2275551.

[26] Q. B. Kwong, Y. T. Kon, W. R. W. Rusik, M. N. A. Shabudin, S. S. A. Rahman, H. Kulaveerasingam & D. R. Appleton, ``Enhancing oil palm segmentation model with GAN-based augmentation'', Journal of Big Data 11 (2024) 126. https://doi.org/10.1186/s40537-024-00990-x.

[27] E.-J. Kim & P. Bansal, ``A deep generative model for feasible and diverse population synthesis'', arXiv preprint, arXiv:2208.01403 (2023). https://doi.org/10.48550/arXiv.2208.01403.

[28] G. Douzas & F. Bacao, ``Geometric SMOTE: A Geometric enhanced drop-in replacement for SMOTE'', Information Sciences 501 (2019) 118--135. https://doi.org/10.1016/j.ins.2019.06.007.

fig 3

Published

2026-06-24

How to Cite

CropGAN: A conditional GAN framework for synthetic tabular data augmentation in crop recommendation systems. (2026). Journal of the Nigerian Society of Physical Sciences, 8(3), 3291. https://doi.org/10.46481/jnsps.2026.3291

Issue

Section

Computer Science

How to Cite

CropGAN: A conditional GAN framework for synthetic tabular data augmentation in crop recommendation systems. (2026). Journal of the Nigerian Society of Physical Sciences, 8(3), 3291. https://doi.org/10.46481/jnsps.2026.3291

Similar Articles

21-30 of 193

You may also start an advanced similarity search for this article.

Most read articles by the same author(s)