the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
State-of-the-art hydrological datasets exhibit low water balance consistency globally
Abstract. The proliferation and diversification of hydrological datasets have significantly advanced hydrological research. However, the coherence across these datasets remains poorly understood, hindering the comparability of findings derived from different data sources and variables. Here, we demonstrate that state-of-the-art hydrological datasets exhibit overall low consistency when evaluated through the lens of water balance – specifically, the relationship between variations in soil moisture and the difference between precipitation, evapotranspiration, and runoff. Our analysis reveals that satellite-based precipitation datasets generally show the highest consistency, while gauge-based datasets perform better in densely monitored regions of the Northern Hemisphere. For evapotranspiration, runoff, and soil moisture, reanalysis datasets demonstrate broader areas of higher consistency compared to gauge- or satellite-based products. Spatial patterns of consistency are strongly influenced by aridity and temperature, which affect measurement and modelling accuracy, while vegetation cover further modulates the performance of soil moisture datasets. Notably, dataset consistency has improved significantly in northern mid-latitudes over recent decades, likely reflecting advancements in observational technologies and the effects of climate warming. These findings underscore the importance of continued efforts to enhance dataset coherence and reliability for robust hydrological assessments.
- Preprint
(1502 KB) - Metadata XML
-
Supplement
(6160 KB) - BibTeX
- EndNote
Status: open (until 16 Nov 2025)
- RC1: 'Comment on essd-2025-376', Anonymous Referee #1, 24 Sep 2025 reply
-
RC2: 'Comment on essd-2025-376', Anonymous Referee #2, 08 Oct 2025
reply
My review comments are structured as follows: Overall Assessment, Major Strengths, and Recommendations for Improvement.
I. Overall Assessment
This paper presents a systematic evaluation of the water balance consistency of 47 state-of-the-art hydrological datasets (precipitation, evapotranspiration, runoff, and soil moisture) using 8,294 independent combinations. The methodology is rigorous, the data coverage is extensive, and the study holds significant scientific and practical value. It reveals a widespread lack of water balance consistency in current global hydrological datasets and provides an in-depth analysis of the spatial patterns, influencing factors, and temporal trends. The manuscript is well-structured, the methods are transparent, and the results are credible. I recommend acceptance after minor revisions.
II. Major Strengths
1.High Novelty: This is the first study to systematically assess the consistency of multi-source, multi-variable hydrological datasets from a water balance perspective, filling a critical gap in the current literature.
2.Methodological Rigor:
a.The use of independent dataset combinations effectively avoids spurious consistency arising from the use of the same model or forcing data.
b.The use of adjusted R² as the consistency metric mitigates errors introduced by unit inconsistencies between variables.
c.The application of SHAP for factor attribution enhances the interpretability of the results.
3.Comprehensive Data Coverage: The inclusion of gauge-based, satellite-based, and reanalysis products ensures broad spatiotemporal coverage and strong representativeness.
4.Insightful and Actionable Results:
a.Clearly identifies the strengths and weaknesses of different data sources across various regions and climatic conditions.
b.Highlights the significant impact of soil moisture data depth on consistency.
c.Reveals an improvement in dataset consistency in mid-to-high latitude regions of the Northern Hemisphere in recent decades.
III. Recommendations for Improvement1. Clarifications in the Methods Section
Handling Soil Moisture Depth Differences: While the manuscript states that ΔSM represents "change," the response of soil moisture at different depths to P-ET-R varies. It would be beneficial to clarify if any normalization or sensitivity analysis was performed for ΔSM across different depths.
Temporal Scale Analysis: The significant differences in consistency between daily and annual scales warrant further discussion of the underlying physical mechanisms (e.g., high noise at daily scales, strong smoothing effects at annual scales).
2. Deepening the Results and Discussion
Root Causes of Low Consistency: Beyond the mentioned observational errors and model structures, could factors like surface-groundwater exchange or human activities (e.g., irrigation, reservoir regulation) also contribute? Expanding the discussion on this point would be valuable.
Mechanisms Behind Spatial Consistency Patterns: For instance, is the low consistency in high-latitude regions linked to insufficient representation of processes like snowpack and permafrost? Further interpretation in the context of existing literature is recommended.
3. Figures and Presentation
Figure 1: The meaning of the asterisk * and dashed lines in the boxplots should be explicitly stated in the figure caption.
Figure 2: The grey areas, indicating "multiple datasets show similar performance or low consistency," would benefit from having the specific thresholds for "similar" and "low" defined in the caption or figure.
Supplementary Material: Briefly mentioning the names of the best/worst performing datasets from Figures S13–S28 in the main text would help readers quickly grasp key findings.
4. Language and Formatting
Some sentences are quite long; breaking them up would improve readability.
Terminology should be checked for consistency (e.g., unified use of "gauge-based" vs. "station-based").
IV. Recommendation
Recommendation: Minor Revision
This manuscript makes a pioneering contribution to the evaluation of hydrological datasets. It is scientifically sound, its conclusions are robust, and it provides crucial insights for hydrological model development, data fusion, and climate change research. I recommend acceptance after the authors address the points above.
Model code and software
Assess water balance consistency of state-of-the-art hydrological datasets Hao Huang and René Orth https://github.com/HowHuang/WaterBalanceConsistency
Viewed
HTML | XML | Total | Supplement | BibTeX | EndNote | |
---|---|---|---|---|---|---|
1,294 | 87 | 33 | 1,414 | 48 | 30 | 40 |
- HTML: 1,294
- PDF: 87
- XML: 33
- Total: 1,414
- Supplement: 48
- BibTeX: 30
- EndNote: 40
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1
Manuscript: ESSD-2025-376
Huang et al. contribute to understanding the limitations of hydrological datasets (ground-, satellite-, and reanalysis-based) in capturing the relationship between monthly variations in soil moisture (SM) and the difference between precipitation (P), evapotranspiration (ET), and runoff (R) at a pixel scale around the world. Additionally, the manuscript’s results contribute to identifying the most suitable datasets for different geographical and ecological regions, which is important for reducing uncertainty in ecological, climatological, and hydrological studies using the evaluated datasets.
Overall, I found the paper well written and organized, and suitable for publication in the ESSD journal, but I have some comments that should be addressed before publication consideration. Particularly, some work is required to improve the clarity of the methods and results sections: (i) Explain how lateral flows and water table depth may potentially bias the proposed water balance at pixel scale, leading to the low water balance consistency reported in the manuscript; (ii) provide a clearer explanation of the linear relationship between SM, P, ET, and R (P – ET - R)s = k ΔSMs) at the monthly scale, including the potential limitations of assuming a linear relationship.
Major comments:
Minor comments:
Technical corrections:
References
Fan, Y., Li, H., and Miguez-Macho, G.: Global Patterns of Groundwater Table Depth, Science, 339, 940–943, https://doi.org/10.1126/science.1229881, 2013.
Miguez-Macho, G. and Fan, Y.: A global humidity index with lateral hydrologic flows, Nature, 644, 413–419, https://doi.org/10.1038/s41586-025-09359-3, 2025.
Nobre, A. D., Cuartas, L. A., Hodnett, M., Rennó, C. D., Rodrigues, G., Silveira, A., Waterloo, M., and Saleska, S.: Height Above the Nearest Drainage – a hydrologically relevant new terrain model, Journal of Hydrology, 404, 13–29, https://doi.org/10.1016/j.jhydrol.2011.03.051, 2011.
Vargas Godoy, M. R., Markonis, Y., Thomson, J. R., Ballarin, A. S., Perri, S., Miao, C., Sun, Q., Hanel, M., Papalexiou, S. M., Kummerow, C., Oki, T., and Molini, A.: Which Precipitation Dataset to Choose for Hydrological Studies of the Terrestrial Water Cycle?, Bulletin of the American Meteorological Society, BAMS-D-24-0306.1, https://doi.org/10.1175/BAMS-D-24-0306.1, 2025.