Authors :
Kalpana D. Sonval; Omkar Santosh Shikhare; Omkar Lokhande; Sanjana Dipak Deo; Akshada Suresh Bhandwalkar
Volume/Issue :
Volume 11 - 2026, Issue 3 - March
Google Scholar :
https://tinyurl.com/3fjbfy8s
Scribd :
https://tinyurl.com/3czrdnfp
DOI :
https://doi.org/10.38124/ijisrt/26mar1056
Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.
Abstract :
The world faces numerous challenges with regard to ensuring food security, which can be exacerbated by
unpredictable climatic conditions and limited resources, thus necessitating more intelligent and sustainable ways of
managing agricultural systems. Precision Agriculture (PA) utilizes technology to manage the input of agricultural crops so
as to achieve higher crop yields. In PA, Artificial Intelligence (AI) and Machine Learning (ML) are key technologies in
support of using AI and ML to make decisions based on large amounts of data from the field. The purpose of this research
project was to provide a complete evaluation of the application of a Multilayer Perceptron (MLP) type neural network to
predict crop yields, utilizing multi-modal tabular agricultural data representing weather conditions, nutrient content of soils
and how farm managers have managed their farms. We designed, implemented and evaluated an MLP model, trained on
the multi-modal tabular data mentioned above. Our approach included the application of a systematic grid-search with 5-
fold-cross-validation for optimizing the hyper-parameters of the MLP, domain-specific feature engineering for improving
the quality of the data, and domain-specific regularization techniques for reducing overfitting and improving
generalizability. We used multiple regression metrics, i.e., R-squared (R
2
), Mean Absolute Error (MAE) and Root Mean
Square Error (RMSE), to compare the performance of the MLP with five other regression models, i.e., Multiple Linear
Regres-sion (MLR), Decision Tree Regressor (DTR), Support Vector Regression (SVR), and Random Forest Regressor
(RFR). The experimental results showed that the optimized MLP achieved competitive performance relative to the other
regression models; the performance metrics were R
2 = 0.89, MAE = 245.32 kg/ha, and RMSE = 312.45 kg/ha. The results
also showed that the MLP performed better than the linear regression and decision tree regression models but similarly to
the random forest regression model. A thorough ablation study provided further evidence to validate the effectiveness of
each of the architectural choices we made in the MLP model. Finally, the permutation-based feature importance analysis
validated the alignment of the features selected by the MLP model with those recommended by established agronomic
principles. This research study provides a new and important contribution to the current state of the art research literature
because it represents the first study to provide a direct head-to-head comparison of foundational neural network
architectures (i.e., MLP) and ensemble methods (i.e., RF) to predict crop yields from tabular agricultural data. As such, the
results of this study demonstrate that the MLP is a viable, scalable and accessible decision-support tool for precision
agriculture applications.
Keywords :
Crop Yield Prediction, Multilayer Perceptron, Neural Networks, Precision Agriculture, Machine Learning, Deep Learning, Random Forest, Agricultural Data Analytics, Feature Engineering, Hyperparameter Optimization.
References :
- Food and Agriculture Organization of the United Nations, “The future of food and agriculture: Trends and challenges,” FAO, Rome, 2017.https://www.fao.org/3/i6583e/i6583e.pdf
- R. Gebbers and V. I. Adamchuk, “Precision Agriculture and Food Security,” Science, vol. 327, no. 5967, pp. 828–831, Feb. 2010.https://doi.org/10.1126/science.1183899
- K. G. Liakos, P. Busato, D. Moshou, S. Pearson, and D. Bochtis, “Machine learning in agriculture: A review,” Sensors, vol. 18, no. 8, p. 2674, Aug. 2018. https://doi.org/10.3390/s18082674
- Anonymous Authors, “AI and Robotics in Agriculture: A Systematic and Quantitative Review... (2015–2025),” AI, vol. 5, no. 5, p. 75, 2025.https://www.mdpi.com/journal/a
- M. Shoaib, A. Sadeghi-Niaraki, F. Ali, I. Hussain, and S. Khalid, “Leveraging deep learning for plant disease and pest detection: a comprehensive review and future directions,” Frontiers in Plant Science, Feb. 2025. https://www.frontiersin.org/journals/plant-science
- S. Murugavalli and R. Gopi, “Plant leaf disease detection using vision transformers for precision agriculture,” Scientific Reports, vol. 15, Jul. 2025. https://www.nature.com/srep
- F. Lin, K. Guillot, S. Crawford, Y. Zhang, X. Yuan, and N.-F. Tzeng, “An Open and Large-Scale Dataset for Multi-Modal Climate Change-aware Crop Yield Predictions,” in Proc. 30th ACM SIGKDD Conf. Knowledge Discovery and Data Mining (KDD ’24), Aug. 2024. https://dl.acm.org/doi/proceedings/10.1145
- Y. Yan, Y. Wang, J. Li, J. Zhang, and X. Mo, “Crop Yield Time-Series Data Prediction Based on Multiple Hybrid Machine Learning Models,” arXiv:2502.10405, v1, Jan. 2025. https://arxiv.org/abs/2502.10405
- R. N. V. J. Mohan, P. S. Rayanoothala, and R. Praneetha Sree, “Next-gen agriculture: integrating AI and XAI for precision crop yield predictions,” Frontiers in Plant Science, vol. 15, Jan. 2025. https://www.frontiersin.org/journals/plant-science
- A. Ruiz-Gonzalez, “Multiplexed Quantification of Soil Nutrients Using an AI-Enhanced and Low-Cost Impedimetric Sensor,” Engineering Pro-ceedings, vol. 106, no. 1, p. 7, Sep. 2025. https://www.mdpi.com/journal/engproc
- D. P. Kingma and J. Ba, “Adam: A Method for Stochastic Optimization,” in Proc. 3rd Int. Conf. Learning Representations (ICLR), San Diego, CA, USA, 2015. https://arxiv.org/abs/1412.6980
- N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhut-dinov, “Dropout: A Simple Way to Prevent Neural Networks from Overfitting,” Journal of Machine Learning Research, vol. 15, pp. 1929–1958, 2014. https://www.jmlr.org/papers/v15/srivastava14a.html
- L. Breiman, “Random Forests,” Machine Learning, vol. 45, no. 1, pp. 5–32, Oct. 2001. https://doi.org/10.1023/A:1010933404324
- IPCC, “Climate Change 2021: The Physical Science Basis. Contribution of Working Group I to the Sixth Assessment Report,” Cambridge University Press, 2021. https://www.ipcc.ch/report/ar6/wg1/
- Grand View Research, “Precision Agriculture Market Size, Share & Trends Analysis Report,” 2023. https://www.grandviewresearch.com/industry-analysis/precision agriculture-market
- World Food Programme, “The State of Food Security and Nutrition in the World 2023,” FAO, Rome, 2023. https://www.fao.org/publications/sofi/en/
- K. Hornik, M. Stinchcombe, and H. White, “Multilayer feedforward networks are universal approximators,” Neural Networks, vol. 2, no. 5, pp. 359–366, 1989. https://doi.org/10.1016/0893-6080(89)90020-8
- A. Chlingaryan, S. Sukkarieh, and B. Whelan, “Machine learning approaches for crop yield prediction and nitrogen status estimation in precision agriculture: A review,” Computers and Electronics in Agricul-ture, vol. 151, pp. 61–69, 2018. https://doi.org/10.1016/j.compag.2018.05.012
- J. W. Jones et al., “The DSSAT cropping system model,” European Journal of Agronomy, vol. 18, no. 3-4, pp. 235–265, 2003. https://doi.org/10.1016/S1161-0301(02)00107-7
- S. Khaki and L. Wang, “Crop Yield Prediction Using Deep Neural Networks,” Frontiers in Plant Science, vol. 10, p. 621, 2019. https://doi.org/10.3389/fpls.2019.00621
- J. H. Jeong, J. P. Resop, N. D. Mueller, D. H. Fleisher, K. Yun, E. A. Butler, et al., “Random Forests for Global and Regional Crop Yield Predictions,” PLOS ONE, vol. 11, no. 6, p. e0156571, 2016. https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0156571
- A. J. Smola and B. Scho¨lkopf, “A tutorial on support vector regression,” Statistics and Computing, vol. 14, no. 3, pp. 199–222, 2004. https://doi.org/10.1023/B:STCO.0000035301.49549.88
- L. Grinsztajn, E. Oyallon, and G. Varoquaux, “Why do tree-based models still outperform deep learning on typical tabular data?,” in Proc. NeurIPS 2022 Datasets and Benchmarks Track, 2022. https://arxiv.org/abs/2207.08815
- S. O. Arik and T. Pfister, “TabNet: Attentive Interpretable Tabular Learning,” in Proc. AAAI Conf. Artificial Intelligence, vol. 35, no. 8, pp. 6679–6687, 2021. https://arxiv.org/abs/1908.07442
- Y. Gorishniy, I. Rubachev, V. Khrulkov, and A. Babenko, “Revisiting Deep Learning Models for Tabular Data,” in Proc. NeurIPS, 2021. https://arxiv.org/abs/2106.11959
- M. Weiss, F. Jacob, and G. Duveiller, “Remote sensing for agricultural applications: A meta-review,” Remote Sensing of Environment, vol. 236, p. 111402, 2020. https://doi.org/10.1016/j.rse.2019.111402
- S. M. Lundberg and S.-I. Lee, “A Unified Approach to Interpreting Model Predictions,” in Proc. NeurIPS, 2017. https://arxiv.org/abs/1705.07874
- C. Molnar, Interpretable Machine Learning: A Guide for Making Black Box Models Explainable, 2nd ed., 2022. https://christophm.github.io/interpretable-ml-book/
The world faces numerous challenges with regard to ensuring food security, which can be exacerbated by
unpredictable climatic conditions and limited resources, thus necessitating more intelligent and sustainable ways of
managing agricultural systems. Precision Agriculture (PA) utilizes technology to manage the input of agricultural crops so
as to achieve higher crop yields. In PA, Artificial Intelligence (AI) and Machine Learning (ML) are key technologies in
support of using AI and ML to make decisions based on large amounts of data from the field. The purpose of this research
project was to provide a complete evaluation of the application of a Multilayer Perceptron (MLP) type neural network to
predict crop yields, utilizing multi-modal tabular agricultural data representing weather conditions, nutrient content of soils
and how farm managers have managed their farms. We designed, implemented and evaluated an MLP model, trained on
the multi-modal tabular data mentioned above. Our approach included the application of a systematic grid-search with 5-
fold-cross-validation for optimizing the hyper-parameters of the MLP, domain-specific feature engineering for improving
the quality of the data, and domain-specific regularization techniques for reducing overfitting and improving
generalizability. We used multiple regression metrics, i.e., R-squared (R
2
), Mean Absolute Error (MAE) and Root Mean
Square Error (RMSE), to compare the performance of the MLP with five other regression models, i.e., Multiple Linear
Regres-sion (MLR), Decision Tree Regressor (DTR), Support Vector Regression (SVR), and Random Forest Regressor
(RFR). The experimental results showed that the optimized MLP achieved competitive performance relative to the other
regression models; the performance metrics were R
2 = 0.89, MAE = 245.32 kg/ha, and RMSE = 312.45 kg/ha. The results
also showed that the MLP performed better than the linear regression and decision tree regression models but similarly to
the random forest regression model. A thorough ablation study provided further evidence to validate the effectiveness of
each of the architectural choices we made in the MLP model. Finally, the permutation-based feature importance analysis
validated the alignment of the features selected by the MLP model with those recommended by established agronomic
principles. This research study provides a new and important contribution to the current state of the art research literature
because it represents the first study to provide a direct head-to-head comparison of foundational neural network
architectures (i.e., MLP) and ensemble methods (i.e., RF) to predict crop yields from tabular agricultural data. As such, the
results of this study demonstrate that the MLP is a viable, scalable and accessible decision-support tool for precision
agriculture applications.
Keywords :
Crop Yield Prediction, Multilayer Perceptron, Neural Networks, Precision Agriculture, Machine Learning, Deep Learning, Random Forest, Agricultural Data Analytics, Feature Engineering, Hyperparameter Optimization.