the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Ten years of hydrometeorological observations at 10-minute resolution and its application in machine learning hydrological models
Abstract. Accurate urban flash flood forecasting relies on well-spatialized rainfall data distribution. This study introduces and utilizes the TTI-HydroMet dataset, a publicly available and unique collection for the Tamanduateí River Watershed, in Sao Paulo (Brazil). The dataset includes rainfall measurements from 23 rain gauge stations, stage observations from a hydrological gauge near the outlet, and quantitative precipitation estimates at 1-km radar resolution, accumulated in 10-minute precipitation fields over 10 years. The weather radar data presents missing values for only 0.3 % of timestamps during rainfall events observed by rain gauges. The Spearman correlation coefficient between weather radar and rain gauges varies from 0.675 (full period) to 0.949 (a specific event). It was used to assess the predictive capacity of Machine Learning (ML) hydrological models trained on accumulated rainfall data from rain gauges and estimated by a weather radar. Using an advanced cross-validation framework, two representative algorithms (LinearSVR and XGBRegressor) were tested across different rainfall source configurations and showed strong performance at lead times up to 120 minutes. The Nash–Sutcliffe Efficiency index ranges from 0.781 to 0.996. The statistically comparable performance of ML models driven by radar and rain gauge rainfall indicates that radar-based ML approaches can represent a viable alternative for short-term stage forecasting in regions lacking rain gauge networks.
Competing interests: At least one of the (co-)authors is a member of the editorial board of Earth System Science Data.
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.- Preprint
(8784 KB) - Metadata XML
-
Supplement
(214 KB) - BibTeX
- EndNote
Status: open (until 14 Mar 2026)
-
RC1: 'Comment on essd-2025-824', Anonymous Referee #1, 16 Feb 2026
reply
-
AC1: 'Reply on RC1', Leonardo Santos, 24 Feb 2026
reply
Dear Referee #1,
We appreciate your careful review of our manuscript and your valuable feedback. We understand your concern that our article's structure may seem more like a general hydrological research paper than a standard EESD article. However, our primary goal is to provide a comprehensive, well-documented dataset. It is important to note that the application example does not drive the manuscript; rather, it serves as a brief demonstration of the dataset’s practical value.
For these reasons, we believe the manuscript fits the scope of ESSD, as it provides a comprehensive description and assessment of a dataset that is not currently available elsewhere and can support a wide range of hydrological and urban‑flood studies, aligning with ESSD's primary aim (‘reuse of high-quality data’).
We thank the reviewer again for the constructive feedback and the opportunity to clarify the manuscript's positioning.
Best regards,
The authorsCitation: https://doi.org/10.5194/essd-2025-824-AC1
-
AC1: 'Reply on RC1', Leonardo Santos, 24 Feb 2026
reply
-
RC2: 'Comment on essd-2025-824', Anonymous Referee #2, 19 Feb 2026
reply
Dear Editors and authors,
I fully acknowledge, that a 10 years set of gridded 10 minutes weather radar data as combined with serveral rainfall gauges and additional contents for an exemplarly application in Brazil is a rather valuable asset to test several hypothesis in flood and urban flood modelling.
Unfortunately the authors decided to frame the study centered on the application and not on the description and assessemnt of the quality of the data set.
The resulting manuscript reads like a classic HESS/NHESS manuscript and not as an ESSD paper. My suggestion would be to transfer this manuscript to NHESS, or re-shape it focussing on the data set as to be expected for ESSD.
Best regards
Citation: https://doi.org/10.5194/essd-2025-824-RC2 -
AC2: 'Reply on RC2', Leonardo Santos, 24 Feb 2026
reply
Dear Referee #2,
We sincerely appreciate the reviewer's thoughtful and constructive feedback and for acknowledging the value of this dataset.
We understand the concern about including an application example and would like to clarify our intent. Our main goal is to present, document, and evaluate a unique, long-term, high-resolution rainfall dataset for an urban area in Brazil. The application was included solely to demonstrate the dataset’s relevance and potential use.
The core focus of the manuscript is on the dataset itself: (i) its development and processing workflow, (ii) the merging of radar and gauge observations, (iii) the quality control procedures, (iv) validation and uncertainty assessment, and (v) its structure, accessibility, and reusability. These aspects constitute the main contribution of the work and align with ESSD’s goal of publishing well-documented, high-quality datasets that support future scientific research. The application example is not the manuscript’s main focus; instead, it acts as a brief illustration of the dataset’s practical utility.
Therefore, we believe our manuscript fits within ESSD’s scope, as it provides a detailed description and assessment of a dataset that is not currently available elsewhere and could support various hydrological and urban flood studies, aligning with ESSD's primary aim (‘reuse of high-quality data’).
We thank the reviewer again for the constructive feedback and the opportunity to clarify the manuscript's purpose.
Best regards,
The authorsCitation: https://doi.org/10.5194/essd-2025-824-AC2
-
AC2: 'Reply on RC2', Leonardo Santos, 24 Feb 2026
reply
Data sets
TTI-HydroMet: A Decade of High-Resolution Rainfall and Streamflow for the Tamanduateí River Watershed, Brazil Elton Vicente Escobar-Silva et al. https://zenodo.org/records/17654660
Viewed
| HTML | XML | Total | Supplement | BibTeX | EndNote | |
|---|---|---|---|---|---|---|
| 243 | 107 | 21 | 371 | 30 | 20 | 16 |
- HTML: 243
- PDF: 107
- XML: 21
- Total: 371
- Supplement: 30
- BibTeX: 20
- EndNote: 16
Viewed (geographical distribution)
| Country | # | Views | % |
|---|
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
Dear Authors & Editor,
I read the manuscript. The manuscript is potentially interesting, but I have the impression the data doesnot comply with the aims ('any interpretation of data is outside the scope of regular articles'). It reads more like a general research article for a hydrological journal (eg HESS/HESSD and similar) with some more information about the input data
With kind regards,