Articles | Volume 18, issue 6
https://doi.org/10.5194/essd-18-3997-2026
© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
https://doi.org/10.5194/essd-18-3997-2026
© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
CY-Bench: a comprehensive benchmark dataset for sub-national crop yield forecasting
Michiel Kallenberg
CORRESPONDING AUTHOR
Artificial Intelligence Group, Wageningen University and Research, P.O. Box 16, Wageningen, 6700 AA, the Netherlands
Dilli Paudel
Artificial Intelligence Group, Wageningen University and Research, P.O. Box 16, Wageningen, 6700 AA, the Netherlands
Stella Ofori-Ampofo
Chair of Data Science in Earth Observation, Technical University of Munich, Arcisstraße 21, Munich, 80333, Germany
Hilmy Baja
Artificial Intelligence Group, Wageningen University and Research, P.O. Box 16, Wageningen, 6700 AA, the Netherlands
Ron van Bree
Artificial Intelligence Group, Wageningen University and Research, P.O. Box 16, Wageningen, 6700 AA, the Netherlands
Aike Potze
Artificial Intelligence Group, Wageningen University and Research, P.O. Box 16, Wageningen, 6700 AA, the Netherlands
Pratishtha Poudel
Department of Agronomy, Purdue University, 915 Mitch Daniels Blvd, West Lafayette, IN 47907, United States
Abdelrahman Saleh
Department of Soil Science, University of Manitoba, 13 Freedman Crescent, Winnipeg, MB R3T 2N2, Canada
Weston Anderson
Department of Geographical Sciences, University of Maryland, 7251 Preinkert Drive, Collega Park, MD 20742, United States
Malte von Bloh
Chair of Data Science in Earth Observation, Technical University of Munich, Arcisstraße 21, Munich, 80333, Germany
Andres Castellano
GISS Impacts Group, NASA Goddard Institute for Space Studies, 535 West 116th Street, Mail Code 4312, New York, NY 10027, United States
Oumnia Ennaji
College of Computing, Mohammed VI Polytechnic University, Lot 660, Benguerir, 43150, Morocco
Raed Hamed
Institute for Environmental Studies, Vrije Universiteit Amsterdam, De Boelelaan 1105, Amsterdam, 1081 HV, the Netherlands
Rahel Laudien
Department of Climate Resilience, Potsdam Institute for Climate Impact Research, P.O. Box 60 12 03, Potsdam, 4412, Germany
Donghoon Lee
Department of Civil Engineering, University of Manitoba, 15 Gillson Street, Winnipeg, MB R3T 5V6, Canada
Inti Luna
Image Processing Laboratory, Universitat de València, C/Catedràtic Agustín Escardino Benlloch, 9, València, 46980, Spain
Dainius Masiliūnas
Laboratory of Geo-Information Science and Remote Sensing, Wageningen University and Research, P.O. Box 47, Wageningen, 6700 AA, the Netherlands
Michele Meroni
Seidor Consulting, Carrer dels Provençals 44, Barcelona, 08019, Spain
Janet Mumo Mutuku
West and Central Africa Region Hub, International Crops Research Institute for the Semi-Arid Tropics, P.O. Box 320, Bamako, Mali
Siyabusa Mkuhlani
Natural Resources Management, International Institute of Tropical Agriculture, P.O. Box 30677, Nairobi, 00100, Kenya
Jonathan Richetti
Agriculture and Food, Commonwealth Scientific and Industrial Research Organisation (CSIRO), 147 Underwood Av, Perth, WA 6014, Australia
Alex C. Ruane
GISS Impacts Group, NASA Goddard Institute for Space Studies, 535 West 116th Street, Mail Code 4312, New York, NY 10027, United States
Ritvik Sahajpal
Department of Geographical Sciences, University of Maryland, 7251 Preinkert Drive, Collega Park, MD 20742, United States
Guanyuan Shuai
Department of Geographical Sciences, University of Maryland, 7251 Preinkert Drive, Collega Park, MD 20742, United States
Vasileios Sitokonstantinou
Artificial Intelligence Group, Wageningen University and Research, P.O. Box 16, Wageningen, 6700 AA, the Netherlands
Rogério de S. Nóia-Júnior
UMR LEPSE, National Research Institute for Agriculture, Food and Environment (INRAE), 2 Pl. Pierre Viala, Montpellier, 34000, France
Amit Kumar Srivastava
Simulation and Data Science- Multiscale modelling and Forecasting, Leibniz Centre for Agricultural Landscape Research, Eberswalder Straße 84, Müncheberg, 15374, Germany
Robert Strong
Agricultural Leadership, Education, and Communications, Texas A&M University, 600 John Kimbrough Blvd, College Station, TX 77843-2116, United States
Lily-belle Sweet
Department of Computational Hydrosystems, Helmholtz Centre for Environmental Research, Permoserstraße 15, Leipzig, 04318, Germany
Petar Vojnović
Fincons s.p.a, Via Torri Bianche 10, Vimercate, 20871, Italy
Allard de Wit
Earth Observation and Environmental Informatics, Wageningen University and Research, P.O. Box 47, Wageningen, 6700 AA, the Netherlands
Maximilian Zachow
Chair of Digital Agriculture, Technical University of Munich, Liesel-Beckmann-Straße 2, Freising, 85354, Germany
Ioannis N. Athanasiadis
Artificial Intelligence Group, Wageningen University and Research, P.O. Box 16, Wageningen, 6700 AA, the Netherlands
Related authors
No articles found.
Detlef P. Van Vuuren, Brian C. O'Neill, Claudia Tebaldi, Benjamin M. Sanderson, Louise P. Chini, Pierre Friedlingstein, Tomoko Hasegawa, Keywan Riahi, Bala Govindasamy, Nico Bauer, Veronika Eyring, Cheikh M. N. Fall, Katja Frieler, Matthew J. Gidden, Laila K. Gohar, Annika Högner, Andrew D. Jones, Jarmo Kikstra, Andrew King, Reto Knutti, Elmar Kriegler, Peter Lawrence, Chris Lennard, Jason Lowe, Camilla Mathison, Shahbaz Mehmood, Zebedee Nicholls, Luciana F. Prado, Qiang Zhang, Steven K. Rose, Alex C. Ruane, Marit Sandstad, Carl-Friedrich Schleussner, Roland Seferian, Jana Sillmann, Chris Smith, Anna A. Sörensson, Swapna Panickal, Kaoru Tachiiri, Naomi Vaughan, Saritha S. Vishwanathan, Tokuta Yokohata, Marco Zecchetto, and Tilo Ziehn
Geosci. Model Dev., 19, 2627–2656, https://doi.org/10.5194/gmd-19-2627-2026, https://doi.org/10.5194/gmd-19-2627-2026, 2026
Short summary
Short summary
We propose a set of seven plausible 21st century emission scenarios, and their multi-century extensions, that will be used by the international community of climate modeling centers to produce the next generation of climate projections. These projections will support climate, impact and mitigation researchers, provide information to practitioners to address future risks from climate change, and contribute to policymakers’ considerations of the trade-offs among various levels of mitigation.
Bailing Li, Abheera Hazra, Amy McNally, Kimberly Slinski, Shraddhanand Shukla, and Weston Anderson
Hydrol. Earth Syst. Sci., 30, 1097–1115, https://doi.org/10.5194/hess-30-1097-2026, https://doi.org/10.5194/hess-30-1097-2026, 2026
Short summary
Short summary
Accurate prediction of terrestrial water storage (TWS) changes is essential for disaster response. This study evaluates TWS forecast skill over Africa using observations from the Gravity Recovery and Climate Experiment (GRACE). Results show that the NASA Catchment Land Surface Model (CLSM) outperforms Noah with Multi-Parameterization (Noah-MP) across 1-6 months lead times, owing to more accurate reanalysis-based initial conditions and stronger representation of TWS interannual variability.
Lorenzo Seguini, Anja Klisch, Michele Meroni, Anton Vrieling, Giacinto Manfron, Clement Atzberger, and Felix Rembold
Earth Syst. Sci. Data, 18, 309–331, https://doi.org/10.5194/essd-18-309-2026, https://doi.org/10.5194/essd-18-309-2026, 2026
Short summary
Short summary
We released a consistent, multi-sensor, timeseries of global biophysical data at 500 m resolution, updated every 10 d since 2000. This dataset is filtered and optimized for agricultural applications and meets the operational needs of the systems for early warning, and crop yield forecasting. Data are freely available at https://data.jrc.ec.europa.eu/dataset/1aac79d8-0d68-4f1c-a40f-b6e362264e50.
Alex C. Ruane, Charlotte L. Pascoe, Claas Teichmann, David J. Brayshaw, Carlo Buontempo, Ibrahima Diouf, Jesus Fernandez, Paula L. M. Gonzalez, Birgit Hassler, Vanessa Hernaman, Ulas Im, Doroteaciro Iovino, Martin Juckes, Iréne L. Lake, Timothy Lam, Xiaomao Lin, Jiafu Mao, Negin Nazarian, Sylvie Parey, Indrani Roy, Wan-Ling Tseng, Briony Turner, Andrew Wiebe, Lei Zhao, and Damaris Zurell
Geosci. Model Dev., 18, 9497–9540, https://doi.org/10.5194/gmd-18-9497-2025, https://doi.org/10.5194/gmd-18-9497-2025, 2025
Short summary
Short summary
This paper describes how the Coupled Model Intercomparison Project organized its 7th phase (CMIP7) to encourage the production of Earth system model outputs relevant for impacts and adaptation. Community engagement identified 13 opportunities for application across human and natural systems, 60 variable groups and 539 unique variables. We also show how simulations can more efficiently meet applications needs by targeting appropriate resolution, time slices, experiments and variable groups.
Riccardo Biella, Anastasiya Shyrokaya, Monica Ionita, Raffaele Vignola, Samuel J. Sutanto, Andrijana Todorovic, Claudia Teutschbein, Daniela Cid, Maria Carmen Llasat, Pedro Alencar, Alessia Matanó, Elena Ridolfi, Benedetta Moccia, Ilias Pechlivanidis, Anne van Loon, Doris E. Wendt, Elin Stenfors, Fabio Russo, Jean-Philippe Vidal, Lucy Barker, Mariana Madruga de Brito, Marleen Lam, Monika Bláhová, Patricia Trambauer, Raed Hamed, Scott J. McGrane, Serena Ceola, Sigrid J. Bakke, Svitlana Krakovska, Viorica Nagavciuc, Faranak Tootoonchi, Giuliano Di Baldassarre, Sandra Hauswirth, Shreedhar Maskey, Svitlana Zubkovych, Marthe Wens, and Lena M. Tallaksen
Nat. Hazards Earth Syst. Sci., 25, 4475–4501, https://doi.org/10.5194/nhess-25-4475-2025, https://doi.org/10.5194/nhess-25-4475-2025, 2025
Short summary
Short summary
The DitA (Drought in the Anthropocene) network's study on the 2022 European drought reveals growing risks, varied impacts, and fragmented, short-term management. Based on a survey of water managers, it explores risk, impacts, strategies, and their evolution. While challenges persist, signs of improvement show readiness for change. The authors call for a European Drought Directive to unify and guide future drought risk management.
Lily-belle Sweet, Christoph Müller, Jonas Jägermeyr, and Jakob Zscheischler
EGUsphere, https://doi.org/10.5194/egusphere-2025-3006, https://doi.org/10.5194/egusphere-2025-3006, 2025
Short summary
Short summary
This study presents a method to identify climate drivers of an impact, such as agricultural yield failure, from high-resolution weather data. The approach systematically generates, selects and combines predictors that generalise across different environments. Tested on crop model simulations, the identified drivers are used to create parsimonious models that achieve high predictive performance over long time horizons, offering a more interpretable alternative to black-box models.
Alessia Matanó, Raed Hamed, Manuela I. Brunner, Marlies H. Barendrecht, and Anne F. Van Loon
Hydrol. Earth Syst. Sci., 29, 2749–2764, https://doi.org/10.5194/hess-29-2749-2025, https://doi.org/10.5194/hess-29-2749-2025, 2025
Short summary
Short summary
Persistent droughts change how rivers respond to rainfall. Our study of over 5000 catchments worldwide found that hydrological and soil moisture droughts decrease river-flow response to rain, especially in arid regions, while vegetation decline slightly increases it. Snow-covered areas are more resilient due to stored water buffering changes. Droughts can also cause long-lasting changes, with short and intense droughts reducing river response to rainfall and prolonged droughts increasing it.
Jose Rafael Guarin, Jonas Jägermeyr, Elizabeth A. Ainsworth, Fabio A. A. Oliveira, Senthold Asseng, Kenneth Boote, Joshua Elliott, Lisa Emberson, Ian Foster, Gerrit Hoogenboom, David Kelly, Alex C. Ruane, and Katrina Sharps
Geosci. Model Dev., 17, 2547–2567, https://doi.org/10.5194/gmd-17-2547-2024, https://doi.org/10.5194/gmd-17-2547-2024, 2024
Short summary
Short summary
The effects of ozone (O3) stress on crop photosynthesis and leaf senescence were added to maize, rice, soybean, and wheat crop models. The modified models reproduced growth and yields under different O3 levels measured in field experiments and reported in the literature. The combined interactions between O3 and additional stresses were reproduced with the new models. These updated crop models can be used to simulate impacts of O3 stress under future climate change and air pollution scenarios.
Raed Hamed, Sem Vijverberg, Anne F. Van Loon, Jeroen Aerts, and Dim Coumou
Earth Syst. Dynam., 14, 255–272, https://doi.org/10.5194/esd-14-255-2023, https://doi.org/10.5194/esd-14-255-2023, 2023
Short summary
Short summary
Spatially compounding soy harvest failures can have important global impacts. Using causal networks, we show that soy yields are predominately driven by summer soil moisture conditions in North and South America. Summer soil moisture is affected by antecedent soil moisture and by remote extra-tropical SST patterns in both hemispheres. Both of these soil moisture drivers are again influenced by ENSO. Our results highlight physical pathways by which ENSO can drive spatially compounding impacts.
M. Hosseini, I. Becker-Reshef, R. Sahajpal, P. Lafluf, G. Leale, E. Puricelli, S. Skakun, and H. McNairn
ISPRS Ann. Photogramm. Remote Sens. Spatial Inf. Sci., V-3-2022, 405–410, https://doi.org/10.5194/isprs-annals-V-3-2022-405-2022, https://doi.org/10.5194/isprs-annals-V-3-2022-405-2022, 2022
Lei Ma, George Hurtt, Lesley Ott, Ritvik Sahajpal, Justin Fisk, Rachel Lamb, Hao Tang, Steve Flanagan, Louise Chini, Abhishek Chatterjee, and Joseph Sullivan
Geosci. Model Dev., 15, 1971–1994, https://doi.org/10.5194/gmd-15-1971-2022, https://doi.org/10.5194/gmd-15-1971-2022, 2022
Short summary
Short summary
We present a global version of the Ecosystem Demography (ED) model which can track vegetation 3-D structure and scale up ecological processes from individual vegetation to ecosystem scale. Model evaluation against multiple benchmarking datasets demonstrated the model’s capability to simulate global vegetation dynamics across a range of temporal and spatial scales. With this version, ED has the potential to be linked with remote sensing observations to address key scientific questions.
Raed Hamed, Anne F. Van Loon, Jeroen Aerts, and Dim Coumou
Earth Syst. Dynam., 12, 1371–1391, https://doi.org/10.5194/esd-12-1371-2021, https://doi.org/10.5194/esd-12-1371-2021, 2021
Short summary
Short summary
Soy yields in the US are affected by climate variability. We identify the main within-season climate drivers and highlight potential compound events and associated agricultural impacts. Our results show that soy yields are most negatively influenced by the combination of high temperature and low soil moisture during the summer crop reproductive period. Furthermore, we highlight the role of temperature and moisture coupling across the year in generating these hot–dry extremes and linked impacts.
Louise Chini, George Hurtt, Ritvik Sahajpal, Steve Frolking, Kees Klein Goldewijk, Stephen Sitch, Raphael Ganzenmüller, Lei Ma, Lesley Ott, Julia Pongratz, and Benjamin Poulter
Earth Syst. Sci. Data, 13, 4175–4189, https://doi.org/10.5194/essd-13-4175-2021, https://doi.org/10.5194/essd-13-4175-2021, 2021
Short summary
Short summary
Carbon emissions from land-use change are a large and uncertain component of the global carbon cycle. The Land-Use Harmonization 2 (LUH2) dataset was developed as an input to carbon and climate simulations and has been updated annually for the Global Carbon Budget (GCB) assessments. Here we discuss the methodology for producing these annual LUH2 updates and describe the 2019 version which used new cropland and grazing land data inputs for the globally important region of Brazil.
Jaber Rahimi, Expedit Evariste Ago, Augustine Ayantunde, Sina Berger, Jan Bogaert, Klaus Butterbach-Bahl, Bernard Cappelaere, Jean-Martial Cohard, Jérôme Demarty, Abdoul Aziz Diouf, Ulrike Falk, Edwin Haas, Pierre Hiernaux, David Kraus, Olivier Roupsard, Clemens Scheer, Amit Kumar Srivastava, Torbern Tagesson, and Rüdiger Grote
Geosci. Model Dev., 14, 3789–3812, https://doi.org/10.5194/gmd-14-3789-2021, https://doi.org/10.5194/gmd-14-3789-2021, 2021
Short summary
Short summary
West African Sahelian and Sudanian ecosystems are important regions for global carbon exchange, and they provide valuable food and fodder resources. Therefore, we simulated net ecosystem exchange and aboveground biomass of typical ecosystems in this region with an improved process-based biogeochemical model, LandscapeDNDC. Carbon stocks and exchange rates were particularly correlated with the abundance of trees. Grass and crop yields increased under humid climatic conditions.
Cited articles
Ambikapathi, R., Schneider, K. R., Davis, B., Herrero, M., Winters, P., and Fanzo, J. C.: Global food systems transitions have enabled affordable diets but had less favourable outcomes for nutrition, environmental health, inclusion and equity, Nature Food, 3, 764–779, https://doi.org/10.1038/s43016-022-00588-7, 2022. a
Argentina: Ministerio de Agrícultura, Ganaderia y Pesca. “Estimaciones Agrícolas”, https://datosestimaciones.magyp.gob.ar/reportes.php?reporte=Estimaciones (last access: 29 April 2016), 2016. a
Australia: ABARES. Australian Bureau of Agricultural and Resource Economics and Sciences Farm Data Portal, https://www.agriculture.gov.au/abares/data/farm-data-portal#data-download (last access: 5 March 2024), 2024. a
Basso, B. and Liu, L.: Seasonal crop yield forecast: Methods, applications, and accuracies, Adv. Agron., 154, 201–255, https://doi.org/10.1016/bs.agron.2018.11.002, 2019. a, b
Batjes, N. H.: Harmonized soil property values for broad-scale modelling (WISE30sec) with estimates of global soil carbon stocks, Geoderma, 269, 61–68, https://doi.org/10.1016/j.geoderma.2016.01.034, 2016. a, b
Becker-Reshef, I., Justice, C., Barker, B., Humber, M., Rembold, F., Bonifacio, R., Zappacosta, M., Budde, M., Magadzire, T., Shitote, C., Pound, J., Constantino, A., Nakalembe, C., Mwangi, K., Sobue, S., Newby, T., Whitcraft, A., Jarvis, I., and Verdin, J.: Strengthening agricultural decisions in countries at risk of food insecurity: The GEOGLAM Crop Monitor for Early Warning, Remote Sens. Environ., 237, 111553, https://doi.org/10.1016/j.rse.2019.111553, 2020. a
Becker-Reshef, I., Barker, B., Whitcraft, A., Oliva, P., Mobley, K., Justice, C., and Sahajpal, R.: Crop type maps for operational global agricultural monitoring, Sci. Data, 10, 172, https://doi.org/10.1038/s41597-023-02047-9, 2023. a
Benos, L., Tagarakis, A. C., Dolias, G., Berruto, R., Kateris, D., and Bochtis, D.: Machine learning in agriculture: A comprehensive updated review, Sensors, 21, 3758, https://doi.org/10.3390/s21113758, 2021. a
Boogaard, H., Schubert, J., De Wit, A., Lazebnik, J., Hutjes, R., and Van der Grijn, G.: Agrometeorological indicators from 1979 to present derived from reanalysis, Climate Data Store – Copernicus Climate Change Service, https://doi.org/10.24381/cds.6c68c9bb, 2022. a, b
Boryan, C., Yang, Z., Mueller, R., and Craig, M.: Monitoring US agriculture: the US department of agriculture, national agricultural statistics service, cropland data layer program, Geocarto Int., 26, 341–358, 2011. a
Brazil: IBGE SIDRA. “Tabela 1612: Área plantada, área colhida, quantidade produzida, rendimento médio e valor da produção das lavouras temporárias”, https://sidra.ibge.gov.br/tabela/1612 (last access: 6 February 2024), 2022. a
Burnell, R., Schellaert, W., Burden, J., Ullman, T. D., Martinez-Plumed, F., Tenenbaum, J. B., Rutar, D., Cheke, L. G., Sohl-Dickstein, J., Mitchell, M., Kiela, D., Shanahan, M., Voorhees, E. M., Cohn, A. G., Leibo, J. Z., and Hernandez-Orallo, J.: Rethink reporting of evaluation results in AI, Science, 380, 136–138, https://doi.org/10.1126/science.adf6369, 2023. a
Chen, B. and Villoria, N. B.: Foreign yield shocks and domestic price variability: the case of maize in developing countries, Environ. Res. Lett., 17, 124044, https://doi.org/10.1088/1748-9326/aca7d5, 2022. a
China: National Bureau of Statistics of China, National Data Portal, https://data.stats.gov.cn (last access: 18 February 2024), 2024. a
Chlingaryan, A., Sukkarieh, S., and Whelan, B.: Machine learning approaches for crop yield prediction and nitrogen status estimation in precision agriculture: A review, Comput. Electron. Agr., 151, 61–69, https://doi.org/10.1016/j.compag.2018.05.012, 2018. a
Coulibaly, S., Kamsu-Foguem, B., Kamissoko, D., and Traore, D.: Deep neural networks with transfer learning in millet crop images, Comput. Ind., 108, 115–120, 2019. a
Cunha, R. L., Silva, B., and Netto, M. A.: A scalable machine learning system for pre-season agriculture yield forecast, in: 2018 IEEE 14th international conference on e-science (e-Science), IEEE, 423–430, https://doi.org/10.1109/eScience.2018.00131, 2018. a
Darbyshire, R., Crean, J., Cashen, M., Anwar, M. R., Broadfoot, K. M., Simpson, M., Cobon, D. H., Pudmenzky, C., Kouadio, L., and Kodur, S.: Insights into the value of seasonal climate forecasts to agriculture, Aust. J. Agr. Resour. Ec., 64, 1034–1058, https://doi.org/10.1111/1467-8489.12389, 2020. a
Davis, K. F., Anderson, W., Ehrmann, S., Flach, R., Meyer, C., Proctor, J., Ray, D. K., You, L., Foley, M., Kerdiles, H., Hultgren, A., Huybers, P., Kebede, E., Meroni, M., Park, C., and Lee, D.: HarvestStat: A global effort towards open and standardized sub-national agricultural data, Environ. Res. Lett., 20, 052001, https://doi.org/10.1088/1748-9326/adcb54, 2025. a, b
Duden, C., Nacke, C., and Offermann, F.: German yield and area data for 11 crops from 1979 to 2021 at a harmonized spatial resolution of 397 districts, Sci. Data, 11, https://doi.org/10.1038/s41597-024-02951-8, 2024. a, b
Dueben, P. D., Schultz, M. G., Chantry, M., Gagne, D. J., Hall, D. M., and McGovern, A.: Challenges and Benchmark Datasets for Machine Learning in the Atmospheric Sciences: Definition, Status, and Outlook, Artificial Intelligence for the Earth Systems, 1, https://doi.org/10.1175/AIES-D-21-0002.1, 2022. a
Ennaji, O., Vergutz, L., and El Allali, A.: Machine learning in nutrient management: A review, Artificial Intelligence in Agriculture, 9, 1–11, https://doi.org/10.1016/j.aiia.2023.06.001, 2023. a
Ennaji, O., Baha, S., Vergutz, L., and El Allali, A.: Gradient boosting for yield prediction of elite maize hybrid ZhengDan 958, PloS One, 19, e0315493, https://doi.org/10.1371/journal.pone.0315493, 2024. a
Fan, J., Bai, J., Li, Z., Ortiz-Bobea, A., and Gomes, C.: A GNN-RNN Approach for Harnessing Geospatial and Temporal Information: Application to Crop Yield Prediction, Proceedings of the AAAI Conference on Artificial Intelligence, 36, 11873–11881, https://doi.org/10.1609/aaai.v36i11.21444, 2022. a
Fanzo, J.: Kicking around in the wreck, PLOS Climate, 3, e0000401, https://doi.org/10.1371/journal.pclm.0000401, 2024. a
FAO: Crop Calendar Food and Agriculture Organization of the United Nations, https://cropcalendar.apps.fao.org (last access: 11 February 2025), 2025. a
Fernandez-Beltran, R., Baidar, T., Kang, J., and Pla, F.: Rice-Yield Prediction with Multi-Temporal Sentinel-2 Data and 3D CNN: A Case Study in Nepal, Remote Sensing, 13, 1391, https://doi.org/10.3390/rs13071391, 2021. a
Franch, B., Cintas, J., Becker-Reshef, I., Sanchez-Torres, M. J., Roger, J., Skakun, S., Sobrino, J. A., Van Tricht, K., Degerickx, J., Gilliams, S., Koetz, B., Szantoi, Z., and Whitcraft, A.: Global crop calendars of maize and wheat in the framework of the WorldCereal project, GISci. Remote Sens., 59, 885–913, https://doi.org/10.1080/15481603.2022.2079273, 2022. a, b
Frieler, K., Schauberger, B., Arneth, A., Balkovič, J., Chryssanthacopoulos, J., Deryng, D., Elliott, J., Folberth, C., Khabarov, N., Müller, C., Olin, S., Pugh, T.A.M., Schaphoff, S., Schewe, J., Schmid, E., Warszawski, L., andLevermann, A.: Understanding the weather signal in national crop-yield variability, Earth's Future, 5, 605–616, https://doi.org/10.1002/2016EF000525, 2017. a
Gavasso-Rita, Y. L., Papalexiou, S. M., Li, Y., Elshorbagy, A., Li, Z., and Schuster-Wallace, C.: Crop models and their use in assessing crop production and food security: A review, Food and Energy Security, 13, e503, https://doi.org/10.1002/fes3.503, 2023. a
Good, D. L. and Irwin, S. H.: Understanding USDA corn and soybean production forecasts: An overview of methods, performance and market impacts, https://doi.org/10.22004/ag.econ.37496, 2003. a
Hatfield, J. L. and Prueger, J. H.: Temperature extremes: Effect on plant growth and development, Weather and Climate Extremes, 10, 4–10, 2015. a
He, D., Wang, E., Wang, J., and Robertson, M.: Data requirement for effective calibration of process-based crop models, Agr. Forest Meteorol., 234–235, 136–148, https://doi.org/10.1016/j.agrformet.2016.12.015, 2017. a
Hengl, T., Mendes de Jesus, J., Heuvelink, G. B., Ruiperez Gonzalez, M., Kilibarda, M., Blagotić, A., Shangguan, W., Wright, M. N., Geng, X., Bauer-Marschallinger, B., Guevara, M. A., Vargas, R., MacMillan, R. A., Batjes, N. H., Leenaars, J. G. B., Ribeiro, E., Wheeler, I., Mantel, S., and Kempen, B.: SoilGrids250m: Global gridded soil information based on machine learning, PLoS One, 12, e0169748, https://doi.org/10.1371/journal.pone.0169748, 2017. a
Hoffmann, H., Zhao, G., Asseng, S., Bindi, M., Biernath, C., Constantin, J., Coucheney, E., Dechow, R., Doro, L., Eckersten, H., Gaiser, T., Grosz, B., Heinlein, F., Kassie, B. T., Kersebaum, K.-C., Klein, C., Kuhnert, M., Lewan, E., Moriondo, M., Nendel, C., Priesack, E., Raynal, H., Roggero, P. P., Rötter, R. P., Siebert, S., Specka, X., Tao, F., Teixeira, E., Trombi, G., Wallach, D., Weihermüller, L., Yeluripati, J., and Ewert, F.: Impact of spatial soil and climate input data aggregation on regional yield simulations, PloS One, 11, e0151782, https://doi.org/10.1371/journal.pone.0151782, 2016. a
Höhl, A., Ofori-Ampofo, S., Obadic, I., Fernández-Torres, M.-Á., Salih Kuzu, R., and Zhu, X.: USCC: A Benchmark Dataset for Crop Yield Prediction under Climate Extremes, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-15540, https://doi.org/10.5194/egusphere-egu23-15540, 2023. a
Huete, A., Didan, K., Miura, T., Rodriguez, E. P., Gao, X., and Ferreira, L. G.: Overview of the radiometric and biophysical performance of the MODIS vegetation indices, Remote Sens. Environ., 83, 195–213, 2002. a
ICRISAT Mali: Compagnie Malienne pour le Developpement des Textiles (CMDT) Monitoring Operational Maize Yield Statistics, http://data.icrisat.org/dataverse/ (last access: 30 May 2026), 2018. a
India: ICRISAT. District Level Database, http://data.icrisat.org/dld/src/crops.html (last access: 9 February 2024), 2024. a
Johnson, D. M.: A comprehensive assessment of the correlations between field crop yields and commonly used MODIS products, Int. J. Appl. Earth Obs., 52, 65–81, https://doi.org/10.1016/j.jag.2016.05.010, 2016. a
Kallenberg, M., Paudel, D., Ofori-Ampofo, S., Baja, H., van Bree, R., Potze, A., Poudel, P., Saleh, A., Anderson, W., von Bloh, M., Castellano, A., Ennaji, O., Hamed, R., Laudien, R., Lee, D., Luna, I., Masiliūnas, D., Meroni, M., Mutuku, J. M., Mkuhlani, S., Richetti, J., Ruane, A. C., Sahajpal, R., Shuai, G., Sitokonstantinou, V., de Souza Noia Junior, R., Srivastava, A. K., Strong, R., Sweet, L.-b., Vojnović, P., de Wit, A., Zachow, M., and Athanasiadis, I. N.: CY-Bench: A comprehensive benchmark dataset for subnational crop yield forecasting, Zenodo [data set], https://doi.org/10.5281/zenodo.11502142, 2025. a, b
Kallenberg, M., Paudel, D., Ofori-Ampofo, S., Baja, H., van Bree, R., Potze, A., Poudel, P., Saleh, A., Anderson, W., von Bloh, M., Castellano, A., Ennaji, O., Hamed, R., Laudien, R., Lee, D., Luna, I., Masiliunas, D., Meroni, M., Mutuku, J. M., Mkuhlani, S., Richetti, J., Ruane, A. C., Sahajpal, R., Shuai, G., Sitokonstantinou, V., de Souza Noia Junior, R., Srivastava, A. K., Strong, R., Sweet, L.-b., Vojnovic, P., de Wit, A., Zachow, M., and Athanasiadis, I. N. : WUR-AI/AgMLCY-Bench: CY-Bench v1.0.0: ESSD paper, Zenodo [code], https://doi.org/10.5281/zenodo.20456375, 2026 a, b, c, d, e, f, g, h, i, j
Kamilaris, A. and Prenafeta-Boldú, F. X.: Deep learning in agriculture: A survey, Comput. Electron. Agr., 147, 70–90, https://doi.org/10.1016/j.compag.2018.02.016, 2018. a
Kapoor, S. and Narayanan, A.: Leakage and the reproducibility crisis in machine-learning-based science, Patterns, 4, 100804, https://doi.org/10.1016/j.patter.2023.100804, 2023. a
Kebede, E. A., Oluoch, K. O., Siebert, S., Mehta, P., Hartman, S., Jägermeyr, J., Ray, D., Ali, T., Brauman, K. A., Deng, Q., Xie, W., and Davis, K. F.: A global open-source dataset of monthly irrigated and rainfed cropped areas (MIRCA-OS) for the 21st century, Sci. Data, 12, 208, https://doi.org/10.1038/s41597-024-04313-w, 2025. a, b, c
Kerner, H., Tseng, G., Becker-Reshef, I., Nakalembe, C., Barker, B., Munshell, B., Paliyam, M., and Hosseini, M.: Rapid Response Crop Maps in Data Sparse Regions, in: ACM SIGKDD Conference on Data Mining and Knowledge Discovery Workshops, arXiv, https://doi.org/10.48550/arXiv.2006.16866, 2020. a
Khaki, S., Wang, L., and Archontoulis, S. V.: A CNN-RNN framework for crop yield prediction, Front. Plant Sci., 10, 1750, https://doi.org/10.3389/fpls.2019.01750, 2020. a, b, c
Kondmann, L., Toker, A., Ruß wurm, M., Camero, A., Peressuti, D., Milcinski, G., Mathieu, P.-P., Longepe, N., Davis, T., Marchisio, G., Leal-Taixé, L., and Zhu, X.: DENETHOR: The DynamicEarthNET dataset for Harmonized, inter-Operable, analysis-Ready, daily crop monitoring from space, in: Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks, edited by: Vanschoren, J. and Yeung, S., Vol. 1, Curran, https://openreview.net/pdf?id=uUa4jNMLjrL (last access: 30 May 2026), 2021. a
Koukos, A., Jo, H.-W., Sitokonstantinou, V., Tsoumas, I., Kontoes, C., and Lee, W.-K.: Towards Global Crop Maps with Transfer Learning, in: IGARSS 2024–2024 IEEE International Geoscience and Remote Sensing Symposium, 1540–1545, IEEE, https://doi.org/10.1109/IGARSS53475.2024.10641793, 2024. a
Lawes, R., Mata, G., Richetti, J., Fletcher, A., and Herrmann, C.: Using remote sensing, process-based crop models, and machine learning to evaluate crop rotations across 20 million hectares in Western Australia, Agron. Sustain. Dev., 42, 120, https://doi.org/10.1007/s13593-022-00851-y, 2022. a
Lecerf, R., Ceglar, A., López-Lozano, R., Van Der Velde, M., and Baruth, B.: Assessing the information in crop model and meteorological indicators to forecast crop yield over Europe, Agr. Syst., 168, 191–202, https://doi.org/10.1016/j.agsy.2018.03.002, 2019. a
Lee, D., Davenport, F., Shukla, S., Husak, G., Funk, C., Harrison, L., McNally, A., Rowland, J., Budde, M., and Verdin, J.: Maize yield forecasts for Sub-Saharan Africa using Earth Observation data and machine learning, Glob. Food Secur., 33, 100643, https://doi.org/10.1016/j.gfs.2022.100643, 2022. a
Lee, D., Anderson, W., Chen, X., Davenport, F., Shukla, S., Sahajpal, R., Budde, M., Rowland, J., Verdin, J., You, L., Ahouangbenon, M., Davis, K. F., Kebede, E., Ehrmann, S., Justice, C., and Meyer, C.: HarvestStat Africa–harmonized subnational crop statistics for sub-Saharan Africa, Sci. Data, 12, 690, https://doi.org/10.1038/s41597-025-05001-z, 2025. a, b
Lesk, C., Anderson, W., Rigden, A., Coast, O., Jägermeyr, J., McDermid, S., Davis, K. F., and Konar, M.: Compound heat and moisture extreme impacts on global crop yields under climate change, Nat. Rev. Earth Environ., 3, 872–889, https://doi.org/10.1038/s43017-022-00368-8, 2022. a
Liakos, K., Busato, P., Moshou, D., Pearson, S., and Bochtis, D.: Machine learning in agriculture: A review, Sensors, 18, 2674, https://doi.org/10.3390/s18082674, 2018. a
Liliane, T. N. and Charles, M. S.: Factors Affecting Yield of Crops, in: Agronomy, edited by: Amanullah, Chap. 2, IntechOpen, Rijeka, https://doi.org/10.5772/intechopen.90672, 2020. a, b
Lin, F., Guillot, K., Crawford, S., Zhang, Y., Yuan, X., and Tzeng, N.-F.: An Open and Large-Scale Dataset for Multi-Modal Climate Change-aware Crop Yield Predictions, in: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 5375–5386, ACM, https://doi.org/10.1145/3637528.3671536, 2024. a
Liu, Q., Yang, M., Mohammadi, K., Song, D., Bi, J., and Wang, G.: Machine Learning Crop Yield Models Based on Meteorological Features and Comparison with a Process-Based Model, Artificial Intelligence for the Earth Systems, 1, https://doi.org/10.1175/AIES-D-22-0002.1, 2022. a
López, J., Way, D. A., and Sadok, W.: Systemic effects of rising atmospheric vapor pressure deficit on plant physiology and productivity, Glob. Change Biol., 27, 1704–1720, 2021. a
Ma, Y., Yang, Z., Huang, Q., and Zhang, Z.: Improving the Transferability of Deep Learning Models for Crop Yield Prediction: A Partial Domain Adaptation Approach, Remote Sens., 15, https://doi.org/10.3390/rs15184562, 2023. a
Mateo-Sanchis, A., Piles, M., Amorós-López, J., Muñoz-Marí, J., Adsuara, J. E., Moreno-Martínez, Á., and Camps-Valls, G.: Learning main drivers of crop progress and failure in Europe with interpretable machine learning, Int. J. Appl. Earth Obs., 104, 102574, https://doi.org/10.1016/j.jag.2021.102574, 2021. a
Mehrabi, Z., Delzeit, R., Ignaciuk, A., Levers, C., Braich, G., Bajaj, K., Amo-Aidoo, A., Anderson, W., Balgah, R. A., Benton, T. G., Chari, M. M., Ellis, E. C., Gahi, N. Z., Gaupp, F., Garibaldi, L. A., Gerber, J. S., Godde, C. M., Grass, I., Heimann, T., Hirons, M., Hoogenboom, G., Jain, M., James, D., Makowski, D., Masamha, B., Meng, S., Monprapussorn, S., Müller, D., Nelson, A., Newlands, N. K., Noack, F., Oronje, M., Raymond, C., Reichstein, M., Rieseberg, L. H., Rodriguez-Llanes, J. M., Rosenstock, T., Rowhani, P., Sarhadi, A., Seppelt, R., Sidhu, B. S., Snapp, S., Soma, T., Sparks, A. H., Teh, L., Tigchelaar, M., Vogel, M. M., West, P. C., Wittman, H., and You, L.: Research priorities for global food security under extreme events, One Earth, 5, 756–766, https://doi.org/10.1016/j.oneear.2022.06.008, 2022. a
Meroni, M., Fasbender, D., Rembold, F., Atzberger, C., and Klisch, A.: Near real-time vegetation anomaly detection with MODIS NDVI: Timeliness vs. accuracy and effect of anomaly computation options, Remote Sens. Environ., 221, 508–521, https://doi.org/10.1016/j.rse.2018.11.041, 2019. a
Meroni, M., Waldner, F., Seguini, L., Kerdiles, H., and Rembold, F.: Yield forecasting with machine learning and small data: What gains for grains?, Agr. Forest Meteorol., 308, 108555, https://doi.org/10.1016/j.agrformet.2021.108555, 2021. a
Mexico: INEGI. Agricultural Census and Survey Data, https://www.inegi.org.mx (last access: 10 April 2024), 2019. a
Meyer, H. and Pebesma, E.: Machine learning-based global maps of ecological variables and the challenge of assessing them, Nat. Commun., 13, 2208, https://doi.org/10.1038/s41467-022-29838-9, 2022. a
Minoli, S., Egli, D. B., Rolinski, S., and Müller, C.: Modelling cropping periods of grain crops at the global scale, Global Planet. Change, 174, 35–46, https://doi.org/10.1016/j.gloplacha.2018.12.013, 2019. a
Miralles, D. G., Bonte, O., Koppa, A., Baez-Villanueva, O. M., Tronquo, E., Zhong, F., Beck, H. E., Hulsman, P., Dorigo, W., Verhoest, N. E., and Haghdoost, S.: GLEAM4: global land evaporation and soil moisture dataset at 0.1 resolution from 1980 to near present, Sci. Data, 12, 416, https://doi.org/10.1038/s41597-025-04610-y, 2025. a
Mitrovic, J., McWilliams, B., Walker, J., Buesing, L., and Blundell, C.: Representation learning via invariant causal mechanisms, arXiv [preprint], https://doi.org/10.48550/arXiv.2010.07922, 2020. a
Nakalembe, C. and Kerner, H.: Considerations for AI-EO for agriculture in Sub-Saharan Africa, Environ. Res. Lett., 18, 041002, https://doi.org/10.1088/1748-9326/acc476, 2023. a
Neophytides, S. P., Tsoumas, I., Tsalakou, A., Christoforou, M., Mavrovouniotis, M., Eliades, M., Papoutsa, C., Kontoes, C., and Hadjimitsis, D. G.: Invariant learning as a pathway to robust potato yield prediction, in: Remote Sensing for Agriculture, Ecosystems, and Hydrology XXVI, SPIE, 13191, 33–42, https://doi.org/10.1117/12.3031554, 2024. a
Nowakowski, A., Mrziglod, J., Spiller, D., Bonifacio, R., Ferrari, I., Mathieu, P. P., Garcia-Herranz, M., and Kim, D.-H.: Crop type mapping by using transfer learning, Int. J. Appl. Earth Obs., 98, 102313, https://doi.org/10.1016/j.jag.2021.102313, 2021. a
Oikonomidis, A., Catal, C., and Kassahun, A.: Deep learning for crop yield prediction: a systematic literature review, New Zeal. J. Crop Hort., 1–26, https://doi.org/10.1080/01140671.2022.2032213, 2022. a
Paudel, D., Boogaard, H., de Wit, A., Janssen, S., Osinga, S., Pylianidis, C., and Athanasiadis, I. N.: Machine learning for large-scale crop yield forecasting, Agr. Syst., 187, 103016, https://doi.org/10.1016/j.agsy.2020.103016, 2021. a, b, c
Paudel, D., Boogaard, H., de Wit, A., van der Velde, M., Claverie, M., Nisini, L., Janssen, S., Osinga, S., and Athanasiadis, I. N.: Machine learning for regional crop yield forecasting in Europe, Field Crop. Res., 276, 108377, https://doi.org/10.1016/j.fcr.2021.108377, 2022. a, b
Paudel, D., Marcos, D., de Wit, A., Boogaard, H., and Athanasiadis, I. N.: A weakly supervised framework for high-resolution crop yield forecasts, Environ. Res. Lett., 18, 094062, https://doi.org/10.1088/1748-9326/acf50e, 2023. a
Paudel, D. R., Marcos, D., de Wit, A., Boogaard, H., and Athanasiadis, I. N.: A weakly supervised framework for high resolution crop yield forecasts, Environ. Res. Lett., 18, 094062, https://doi.org/10.1088/1748-9326/acf50e, 2023b. a, b
Pelletier, C., Webb, G. I., and Petitjean, F.: Temporal Convolutional Neural Network for the Classification of Satellite Image Time Series, Remote Sensing, 11, https://doi.org/10.3390/rs11050523, 2019. a
Poggio, L., de Sousa, L. M., Batjes, N. H., Heuvelink, G. B. M., Kempen, B., Ribeiro, E., and Rossiter, D.: SoilGrids 2.0: producing soil information for the globe with quantified spatial uncertainty, SOIL, 7, 217–240, https://doi.org/10.5194/soil-7-217-2021, 2021. a
Portmann, F. T., Siebert, S., and Döll, P.: MIRCA2000 – Global monthly irrigated and rainfed crop areas around the year 2000: A new high-resolution data set for agricultural and hydrological modeling, Global Biogeochem. Cy., 24, https://doi.org/10.1029/2008GB003435, 2010. a
Potter, N. A.: rnassqs: An “R” package to access agricultural data via the USDA National Agricultural Statistics Service (USDA-NASS) “Quick Stats” API, Journal of Open Source Software, 4, 1880https://doi.org/10.21105/joss.01880, 2019. a
Priyatikanto, R., Lu, Y., Dash, J., and Sheffield, J.: Improving generalisability and transferability of machine-learning-based maize yield prediction model through domain adaptation, Agr. Forest Meteorol., 341, 109652, https://doi.org/10.1016/j.agrformet.2023.109652, 2023. a
Pushkarna, M., Zaldivar, A., and Kjartansson, O.: Data Cards: Purposeful and transparent dataset documentation for responsible AI, in: Proceedings of the 2022 ACM Conference on Fairness, Accountability, and Transparency, 1776–1826, https://doi.org/10.1145/3531146.3533231, 2022. a
Richards, M., Kirichenko, P., Bouchacourt, D., and Ibrahim, M.: Does Progress On Object Recognition Benchmarks Improve Real-World Generalization?, arXiv, https://doi.org/10.48550/arXiv.2307.13136, 2023. a
Richetti, J., Diakogianis, F. I., Bender, A., Colaço, A. F., and Lawes, R. A.: A methods guideline for deep learning for tabular data in agriculture with a case study to forecast cereal yield, Comput. Electron. Agr., 205, 107642, https://doi.org/10.1016/j.compag.2023.107642, 2023. a, b
Rodell, M., Houser, P., Jambor, U., Gottschalck, J., Mitchell, K., Meng, C.-J., Arsenault, K., Cosgrove, B., Radakovich, J., Bosilovich, M., Entin, J. K., Walker, J. P., Lohmann, D., and Toll, D.: The global land data assimilation system, B. Am. Meteorol. Soc., 85, 381–394, https://doi.org/10.1175/BAMS-85-3-381, 2004. a, b
Rolnick, D., Aspuru-Guzik, A., Beery, S., Dilkina, B., Donti, P. L., Ghassemi, M., Kerner, H., Monteleoni, C., Rolf, E., Tambe, M., and White, A.: Application-Driven Innovation in Machine Learning, arXiv, https://doi.org/10.48550/arXiv.2403.17381, 2024. a
Ronchetti, G., Nisini Scacchiafichi, L., Seguini, L., Cerrani, I., and van der Velde, M.: Harmonized European Union subnational crop statistics can reveal climate impacts and crop cultivation shifts, Earth Syst. Sci. Data, 16, 1623–1649, https://doi.org/10.5194/essd-16-1623-2024, 2024. a, b, c
Rußwurm, M., Lefèvre, S., and Körner, M.: BreizhCrops: A Satellite Time Series Dataset for Crop Type Identification, arXiv, https://doi.org/10.48550/arXiv.1905.11893, 2019. a, b
Rußwurm, M. and Körner, M.: Multi-Temporal Land Cover Classification with Sequential Recurrent Encoders, ISPRS Int. J. Geo-Inf., 7, https://doi.org/10.3390/ijgi7040129, 2018. a
Sacks, W. J., Deryng, D., Foley, J. A., and Ramankutty, N.: Crop planting dates: an analysis of global patterns, Global Ecol. Biogeogr., 19, 607–620, 2010. a
Sainte Fare Garnot, V., Landrieu, L., Giordano, S., and Chehata, N.: Satellite Image Time Series Classification with Pixel-Set Encoders and Temporal Self-Attention, CVPR, https://doi.org/10.1109/CVPR42600.2020.01234, 2020. a
Schauberger, B., Jägermeyr, J., and Gornott, C.: A systematic review of local to regional yield forecasting approaches and frequently used data resources, Eur. J. Agron., 120, 126153, https://doi.org/10.1016/j.eja.2020.126153, 2020. a, b
Schlenker, W. and Roberts, M. J.: Nonlinear temperature effects indicate severe damages to U.S. crop yields under climate change, P. Natl. Acad. Sci., 106, 15594–15598, https://doi.org/10.1073/pnas.0906865106, 2009. a
Schneider, K. R., Fanzo, J., Haddad, L., Herrero, M., Moncayo, J. R., Herforth, A., Remans, R., Guarin, A., Resnick, D., Covic, N., Béné, C., Cattaneo, A., Aburto, N., Ambikapathi, R., Aytekin, D., Barquera, S., Battersby, J., Beal, T., Molina, P. B., Cafiero, C., Campeau, C., Caron, P., Conforti, P., Damerau, K., Di Girolamo, M., DeClerck, F., Dewi, D., Elouafi, I., Fabi, C., Foley, P., Frazier, T. J., Gephart, J., Golden, C., Fischer, C. G., Hendriks, S., Honorati, M., Huang, J., Kennedy, G., Laar, A., Lal, R., Lidder, P., Loken, B., Marshall, Q., Masuda, Y. J., McLaren, R., Miachon, L., Muñoz, H., Nordhagen, S., Qayyum, N., Saisana, M., Suhardiman, D., Sumaila, U. R., Cullen, M. T., Tubiello, F. N., Vivero-Pol, J.-L., Webb, P., and Wiebe, K.: The state of food systems worldwide in the countdown to 2030, Nature Food, 4, 1090–1110, https://doi.org/10.1038/s43016-023-00885-9, 2023a. a
Schneider, M., Schelte, T., Schmitz, F., and Körner, M.: EuroCrops: The Largest Harmonized Open Crop Dataset Across the European Union, Sci. Data, 10, 612, https://doi.org/10.1038/s41597-023-02517-0, 2023b. a
Seguini, L., Klisch, A., Meroni, M., Vrieling, A., Manfron, G., Atzberger, C., and Rembold, F.: Global near real-time 500 m 10 d FPAR dataset from MODIS and VIIRS for operational agricultural monitoring and crop yield forecasting, Earth Syst. Sci. Data, 18, 309–331, https://doi.org/10.5194/essd-18-309-2026, 2026. a, b
Seidel, S., Palosuo, T., Thorburn, P., and Wallach, D.: Towards improved calibration of crop models – Where are we now and where should we go?, Eur. J. Agron., 94, 25–35, https://doi.org/10.1016/j.eja.2018.01.006, 2018. a
Singh, K., Singh, D., Goroshi, S., Yashu, S., Kumar, Y., Solanki, S., and Sagar, S.: Crop Yield forecasting under FASAL (Forecasting Agricultural output using Space Agrometeorology and Land based observations), Tech. rep., India Meteorological Department (IMD), Ministry of Earth Sciences (MoES), Government of India, https://doi.org/10.13140/RG.2.2.11593.42083, 2022. a
Stuart, L., Hobbins, M., Niebuhr, E., Ruane, A. C., Pulwarty, R., Hoell, A., Thiaw, W., Rosenzweig, C., Muñoz-Arriola, F., Jahn, M., and Farrar, M.: Enhancing Global Food Security: Opportunities for the American Meteorological Society, B. Am. Meteorol. Soc., 104, E760–E777, https://doi.org/10.1175/BAMS-D-22-0106.1, 2024. a
Sweet, L.-b., Müller, C., Anand, M., and Zscheischler, J.: Cross-Validation Strategy Impacts the Performance and Interpretation of Machine Learning Models, Artificial Intelligence for the Earth Systems, 2, https://doi.org/10.1175/AIES-D-23-0026.1, 2023. a
Sweet, L. B., Athanasiadis, I. N., van Bree, R., Castellano, A., Martre, P., Paudel, D., Ruane, A. C., and Zscheischler, J.: Transdisciplinary coordination is essential for advancing agricultural modeling with machine learning, One Earth, https://doi.org/10.1016/j.oneear.2025.101233, 2025. a, b
Tadesse, T., Senay, G. B., Berhan, G., Regassa, T., and Beyene, S.: Evaluating a satellite-based seasonal evapotranspiration product and identifying its relationship with other satellite-derived products and crop yield: A case study for Ethiopia, Int. J. Applied Earth Obs., 40, 39–54, 2015. a
Tanaka, T., Sun, L., Becker-Reshef, I., Song, X.-P., and Puricelli, E.: Satellite forecasting of crop harvest can trigger a cross-hemispheric production response and improve global food security, Commun. Earth Environ., 4, 1–9, https://doi.org/10.1038/s43247-023-00992-2, 2023. a
Thomas, R. L. and Uminsky, D.: Reliance on metrics is a fundamental challenge for AI, Patterns, 3, 100476, https://doi.org/10.1016/j.patter.2022.100476, 2022. a
Tsaftaris, S. A. and Scharr, H.: Sharing the Right Data Right: A Symbiosis with Machine Learning, Trends Plant Sci., 24, 99–102, https://doi.org/10.1016/j.tplants.2018.10.016, 2019. a
Tseng, G., Kerner, H., Nakalembe, C., and Becker-Reshef, I.: Learning to predict crop type from heterogeneous sparse labels using meta-learning, in: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 1111–1120, https://doi.org/10.1109/CVPRW53098.2021.00122, 2021a. a
Tseng, G., Zvonkov, I., Nakalembe, C., and Kerner, H.: CropHarvest: A global dataset for crop-type classification, in: Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks, edited by: Vanschoren, J. and Yeung, S., Vol. 1, Curran, https://openreview.net/pdf?id=JtjzUXPEaCu (last access: 30 May 2026), 2021b. a
Tseng, G., Kerner, H., and Rolnick, D.: TIML: Task-Informed Meta-Learning for Agriculture, arXiv, https://doi.org/10.48550/arXiv.2202.02124, 2022. a
Tzachor, A., Devare, M., King, B., Avin, S., and Ó hÉigeartaigh, S.: Responsible artificial intelligence in agriculture requires systemic understanding of risks and externalities, Nature Machine Intelligence, 4, 104–109, https://doi.org/10.1038/s42256-022-00440-4, 2022. a
van der Velde, M. and Nisini, L.: Performance of the MARS-crop yield forecasting system for the European Union: Assessing accuracy, in-season, and year-to-year improvements from 1993 to 2015, Agr. Syst., 168, 203–212, https://doi.org/10.1016/j.agsy.2018.06.009, 2019. a, b, c
Van Klompenburg, T., Kassahun, A., and Catal, C.: Crop yield prediction using machine learning: A systematic literature review, Comput. Electron. Agr., 177, 105709, https://doi.org/10.1016/j.compag.2020.105709, 2020. a
Van Tricht, K., Degerickx, J., Gilliams, S., Zanaga, D., Battude, M., Grosu, A., Brombacher, J., Lesiv, M., Bayas, J. C. L., Karanam, S., Fritz, S., Becker-Reshef, I., Franch, B., Mollà-Bononad, B., Boogaard, H., Pratihast, A. K., Koetz, B., and Szantoi, Z.: WorldCereal: a dynamic open-source system for global-scale, seasonal, and reproducible crop and irrigation mapping, Earth Syst. Sci. Data, 15, 5491–5515, https://doi.org/10.5194/essd-15-5491-2023, 2023. a, b, c
Vermote, E.: MOD09CMG MODIS/Terra Surface Reflectance Daily L3 Global 0.05Deg CMG V006, NASA Land Processes Distributed Active Archive Center [data set], https://doi.org/10.5067/MODIS/MOD09CMG.006, 2015. a, b
Vijverberg, S., Hamed, R., and Coumou, D.: Skillful U.S. Soy Yield Forecasts at Presowing Lead Times, Artificial Intelligence for the Earth Systems, 2, https://doi.org/10.1175/AIES-D-21-0009.1, 2023. a
Waha, K., Van Bussel, L., Müller, C., and Bondeau, A.: Climate-driven simulation of global crop sowing dates, Global Ecol. Biogeogr., 21, 247–259, https://doi.org/10.1111/j.1466-8238.2011.00678.x, 2012. a
Wallach, D., Palosuo, T., Thorburn, P., Hochman, Z., Gourdain, E., Andrianasolo, F., Asseng, S., Basso, B., Buis, S., Crout, N., Dibari, C., Dumont, B., Ferrise, R., Gaiser, T., Garcia, C., Gayler, S., Ghahramani, A., Hiremath, S., Hoek, S., Horan, H., Hoogenboom, G., Huang, M., Jabloun, M., Jansson, P.-E., Jing, Q., Justes, E., Kersebaum, K. C., Klosterhalfen, A., Launay, M., Lewan, E., Luo, Q., Maestrini, B., Mielenz, H., Moriondo, M., Nariman Zadeh, H., Padovan, G., Olesen, J. E., Poyda, A., Priesack, E., Pullens, J. W. M., Qian, B., Schütze, N., Shelia, V., Souissi, A., Specka, X., Srivastava, A. K., Stella, T., Streck, T., Trombi, G., Wallor, E., Wang, J., Weber, T. K., Weihermüller, L., de Wit, A., Wöhling, T., Xiao, L., Zhao, C., Zhu, Y., and Seidel, S. J.: The chaos in calibrating crop models: Lessons learned from a multi-model calibration exercise, Environ. Model. Softw., 145, 105206, https://doi.org/10.1016/j.envsoft.2021.105206, 2021. a
Wang, Y., Albrecht, C. M., Braham, N. A. A., Mou, L., and Zhu, X. X.: Self-supervised learning in remote sensing: A review, IEEE Geosci. Remote S., 10, 213–247, 2022. a
Watson, P. A. G.: Machine learning applications for weather and climate need greater focus on extremes, Environ. Res. Lett., 17, 111004, https://doi.org/10.1088/1748-9326/ac9d4e, 2022. a
Xu, Y., Ma, Y., and Zhang, Z.: Self-supervised pre-training for large-scale crop mapping using Sentinel-2 time series, ISPRS J. Photogramm., 207, 312–325, 2024. a
Yeh, C., Meng, C., Wang, S., Driscoll, A., Rozi, E., Liu, P., Lee, J., Burke, M., Lobell, D., and Ermon, S.: SustainBench: Benchmarks for Monitoring the Sustainable Development Goals with Machine Learning, in: Thirty-fifth Conference on Neural Information Processing Systems, Datasets and Benchmarks Track (Round 2), Curran Associates, Inc., https://openreview.net/forum?id=5HR3vCylqD (last access: 30 May 2026), 2021. a, b, c, d
You, J., Li, X., Low, M., Lobell, D., and Ermon, S.: Deep Gaussian Process for Crop Yield Prediction Based on Remote Sensing Data, in: Proceedings of the AAAI Conference on Artificial Intelligence, Association for the Advancement of Artificial Intelligence (AAAI), Vol. 31, https://doi.org/10.1609/aaai.v31i1.11172, 2017. a, b, c
Zelingher, R. and Makowski, D.: Investigating and forecasting the impact of crop production shocks on global commodity prices, Environ. Res. Lett., 19, 014026, https://doi.org/10.1088/1748-9326/ad0dda, 2023. a
Zhang, T., van der Wiel, K., Wei, T., Screen, J., Yue, X., Zheng, B., Selten, F., Bintanja, R., Anderson, W., Blackport, R., Glomsrød, S., Liu, Y., Cui, X., and Yang, X.: Increased wheat price spikes and larger economic inequality with 2 °C global warming, One Earth, 5, 907–916, https://doi.org/10.1016/j.oneear.2022.07.004, 2022. a
Short summary
Improving crop yield predictions is crucial for food security. Prior research relied on case studies, making it hard to compare methods and track progress. We introduce CY-Bench (Crop Yield Benchmark), a global dataset for forecasting maize and wheat yields across diverse farming systems in over 25 countries. It includes standardized weather, soil, and satellite data, curated by a diverse set of experts. CY-Bench supports the development of tools to help decision-makers plan for food security.
Improving crop yield predictions is crucial for food security. Prior research relied on case...
Altmetrics
Final-revised paper
Preprint