the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
A 500-m crop water requirement and irrigation water demand dataset for 25 crop types in the Yellow River Basin (2000–2020): Revealing significant underestimation from incomplete crop coverage
Abstract. Accurate estimation of irrigation water demand (IWD) is fundamental for sustainable water resources management in water-scarce agricultural regions. However, existing IWD assessments typically consider only a limited number of major crops, potentially leading to systematic underestimation of basin-scale water demand. This study develops a comprehensive high-resolution (500 m) dataset of crop water requirement (CWR, equivalent to the net irrigation water requirement, i.e., crop evapotranspiration minus effective precipitation) and IWD (CWR divided by irrigation water use efficiency) for the Yellow River Basin (YRB) covering 25 crop types from 2000 to 2020. We first evaluated eight remote sensing-based cropland datasets against statistical records from 135 administrative units, identifying the Global Land Analysis and Discovery (GLAD) dataset as optimal. CWR and IWD were estimated using the FAO Penman-Monteith approach with spatially explicit crop coefficients and spatiotemporally dynamic irrigation water use efficiency coefficients. The dataset provides two complementary versions: a sown area-based version reflecting the full theoretical agricultural water gap (multi-year average CWR and IWD of 548.3 × 108 m3 and 1086.8 × 108 m3, respectively), and an irrigated area-based version constrained by actual irrigation extent (258.6 × 108 m3 and 508.4 × 108 m3), broadly consistent with independent estimates of actual irrigation water consumption. Our results reveal that considering only five major crops would underestimate CWR and IWD by approximately 33 % and 34 %, respectively, with the largest underestimation in the upper reach (approximately 45 %). More importantly, incomplete crop coverage not only causes quantitative bias but also misrepresents temporal dynamics, yielding opposite trend directions in some sub-basins. Sensitivity analysis indicates that 12–15 crop types are required to capture over 90–95 % of basin-scale water demand, with vegetables and tubers ranking among the top six contributors despite being frequently omitted in previous studies. The dataset (Tang and Zhang, 2026) is publicly available at https://doi.org/10.5281/zenodo.18628324.
- Preprint
(2353 KB) - Metadata XML
-
Supplement
(445 KB) - BibTeX
- EndNote
Status: final response (author comments only)
- RC1: 'Comment on essd-2026-128', Anonymous Referee #1, 06 May 2026
-
RC2: 'Comment on essd-2026-128', Anonymous Referee #2, 02 Jun 2026
This manuscript presents a new crop water requirement (CWR) and irrigation water demand (IWD) dataset for the Yellow River Basin and argues that previous studies substantially underestimate water demand because they consider too few crop types. The topic is important, and the dataset could be useful for the community. However, several key methodological assumptions raise concerns about the reliability of the spatial patterns and quantitative estimates. The manuscript currently places strong emphasis on the underestimation caused by incomplete crop coverage, while the uncertainty introduced by its own assumptions is not adequately evaluated.
1. The dataset is described as 500 m resolution, but the actual information content is much coarser
The manuscript repeatedly highlights the development of a 500 m dataset. However, crop-specific statistics are only available at county or prefecture level and are distributed uniformly across all cropland pixels within each administrative unit.
As a result, the dataset does not actually know where individual crops are located within a county. Every cropland pixel receives the same crop composition. Therefore, although the output grid is 500 m, the crop information is effectively available only at the county level.
This distinction is important because readers may assume that the dataset represents realistic crop distributions at 500 m resolution, which it does not.
The authors should clearly explain this limitation and avoid overstating the effective spatial resolution of the dataset.
2. The uniform allocation of crops is a very strong assumption
The study assumes that all crops are evenly distributed across all cropland pixels within a county.
In reality, crop locations are strongly influenced by:
- irrigation availability,
- elevation,
- climate,
- soil conditions,
- local agricultural practices.
For example, rice, vegetables, cotton, potatoes, and wheat are typically concentrated in different parts of a county rather than being uniformly distributed.
This assumption likely has a major impact on the spatial patterns shown in the maps. Yet the manuscript provides no assessment of how much uncertainty this introduces.
The authors should discuss this limitation more openly and, if possible, evaluate its influence on the results.
3. Planting dates and growing seasons are assumed to be identical across the entire basin
The Yellow River Basin covers a very large geographic area with substantial differences in climate and elevation.
Nevertheless, the study applies the same planting dates and growth durations across the basin for each crop type.
This is difficult to justify. Crops in Qinghai, Inner Mongolia, Henan, and Shandong do not have the same growing calendar.
Because crop water demand is highly sensitive to crop timing, this assumption may introduce substantial errors. The manuscript should explain why spatially varying crop calendars were not considered and discuss the resulting uncertainty.
4. Validation is incomplete
The manuscript validates cropland area estimates and selects the GLAD dataset as the best-performing product.
However, the final product depends on several additional components:
- crop allocation,
- crop coefficients,
- crop calendars,
- irrigation efficiency.
None of these components are validated.
Therefore, the validation only demonstrates that the selected cropland map reasonably represents total cropland area. It does not demonstrate that the final crop-specific CWR and IWD estimates are accurate.
The manuscript should be more cautious when discussing dataset reliability.
5. The uncertainty analysis is insufficient
The paper argues that previous studies underestimate water demand by about 30–45% because they include too few crop types.
This may be true.
However, the manuscript does not quantify uncertainty associated with:
- crop allocation assumptions,
- crop coefficient interpolation,
- irrigation efficiency estimates,
- planting date assumptions,
- effective precipitation calculations.
Without such analysis, it is difficult to determine whether the reported underestimation is larger than the uncertainty of the dataset itself.
A more comprehensive uncertainty assessment is needed.
6. Irrigation efficiency before 2009 is reconstructed rather than observed
The irrigation water use efficiency values before 2009 are estimated using extrapolation because observations are unavailable.
Since irrigation efficiency directly affects IWD calculations, this assumption may strongly influence long-term trends.
The manuscript should provide a sensitivity analysis showing how different assumptions about historical irrigation efficiency affect the results.
7. The key conclusion is partly expected from the data
The central conclusion is that considering only five crops underestimates water demand.
However, the manuscript shows that these five crops account for only about two-thirds of total sown area.
Therefore, some degree of underestimation is inevitable.
The more important question is not whether underestimation exists, but which omitted crops contribute most and why their water demand is disproportionately important.
The manuscript would be stronger if it focused more on the contributions of individual crop groups rather than simply comparing five crops versus twenty-five crops.
More specifically,
Many abbreviations are difficult to follow and are not clearly defined when first introduced. Examples include IWUEC, CACD, CLCD, MCID, GLAD, CLUD-A, and GLASS-GLC.
The manuscript should explain how datasets with resolutions ranging from 30 m to 5 km were compared fairly during cropland validation.
The use of inverse distance weighting for meteorological interpolation should be justified.
The manuscript combines county-level and prefecture-level agricultural statistics. The possible impacts of mixing these spatial scales should be discussed.
The discussion section focuses heavily on the strengths of the dataset but gives relatively little attention to its limitations.
Citation: https://doi.org/10.5194/essd-2026-128-RC2
Data sets
A 500-m crop water requirement and irrigation water demand dataset for 25 crop types in the Yellow River Basin (2000–2020) Shizhen Tang and Hongbo Zhang https://doi.org/10.5281/zenodo.18628324
Viewed
| HTML | XML | Total | Supplement | BibTeX | EndNote | |
|---|---|---|---|---|---|---|
| 479 | 259 | 28 | 766 | 61 | 31 | 36 |
- HTML: 479
- PDF: 259
- XML: 28
- Total: 766
- Supplement: 61
- BibTeX: 31
- EndNote: 36
Viewed (geographical distribution)
| Country | # | Views | % |
|---|
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
This study develops a 500 m crop water requirement (CWR) and irrigation water demand (IWD) dataset for 25 crop types in the Yellow River Basin from 2000–2020. It finds that incomplete crop coverage can substantially underestimate basin-scale water demand, which is important and policy relevant.
However, the interpretation of the 500 m product needs to be clarified. A key limitation is that crop-specific sown areas are uniformly allocated to cropland pixels within each administrative unit, rather than derived from actual crop-type maps. Therefore, the dataset is not an actual fully validated 500 m crop-specific irrigation-demand map.
Several assumptions in the CWR calculation also need clearer treatment, especially the effective-rainfall method and the use of apparently fixed crop calendars. These limitations are discussed, but they should be integrated more directly into the Abstract, data description, uncertainty assessment, and dataset-use guidance.
Detailed comments: