Preprints
https://doi.org/10.5194/essd-2025-824
https://doi.org/10.5194/essd-2025-824
27 Jan 2026
 | 27 Jan 2026
Status: this preprint is currently under review for the journal ESSD.

Ten years of hydrometeorological observations at 10-minute resolution and its application in machine learning hydrological models

Kleber L. Rocha-Filho, Lidiane S. Lima, Elton V. Escobar-Silva, Rafael M. P. Teixeira, Andrea S. Viteri López, Glauston R. T. Lima, Jaqueline A. J. P. Soares, Cristiano W. Eichholz, Flavio Conde, Carlos A. M. Rodriguez, Joaquin I. B. Garcia, and Leonardo B. L. Santos

Abstract. Accurate urban flash flood forecasting relies on well-spatialized rainfall data distribution. This study introduces and utilizes the TTI-HydroMet dataset, a publicly available and unique collection for the Tamanduateí River Watershed, in Sao Paulo (Brazil). The dataset includes rainfall measurements from 23 rain gauge stations, stage observations from a hydrological gauge near the outlet, and quantitative precipitation estimates at 1-km radar resolution, accumulated in 10-minute precipitation fields over 10 years. The weather radar data presents missing values for only 0.3 % of timestamps during rainfall events observed by rain gauges. The Spearman correlation coefficient between weather radar and rain gauges varies from 0.675 (full period) to 0.949 (a specific event). It was used to assess the predictive capacity of Machine Learning (ML) hydrological models trained on accumulated rainfall data from rain gauges and estimated by a weather radar. Using an advanced cross-validation framework, two representative algorithms (LinearSVR and XGBRegressor) were tested across different rainfall source configurations and showed strong performance at lead times up to 120 minutes. The Nash–Sutcliffe Efficiency index ranges from 0.781 to 0.996. The statistically comparable performance of ML models driven by radar and rain gauge rainfall indicates that radar-based ML approaches can represent a viable alternative for short-term stage forecasting in regions lacking rain gauge networks.

Competing interests: At least one of the (co-)authors is a member of the editorial board of Earth System Science Data.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Kleber L. Rocha-Filho, Lidiane S. Lima, Elton V. Escobar-Silva, Rafael M. P. Teixeira, Andrea S. Viteri López, Glauston R. T. Lima, Jaqueline A. J. P. Soares, Cristiano W. Eichholz, Flavio Conde, Carlos A. M. Rodriguez, Joaquin I. B. Garcia, and Leonardo B. L. Santos

Status: open (until 05 Mar 2026)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Kleber L. Rocha-Filho, Lidiane S. Lima, Elton V. Escobar-Silva, Rafael M. P. Teixeira, Andrea S. Viteri López, Glauston R. T. Lima, Jaqueline A. J. P. Soares, Cristiano W. Eichholz, Flavio Conde, Carlos A. M. Rodriguez, Joaquin I. B. Garcia, and Leonardo B. L. Santos

Data sets

TTI-HydroMet: A Decade of High-Resolution Rainfall and Streamflow for the Tamanduateí River Watershed, Brazil Elton Vicente Escobar-Silva et al. https://zenodo.org/records/17654660

Kleber L. Rocha-Filho, Lidiane S. Lima, Elton V. Escobar-Silva, Rafael M. P. Teixeira, Andrea S. Viteri López, Glauston R. T. Lima, Jaqueline A. J. P. Soares, Cristiano W. Eichholz, Flavio Conde, Carlos A. M. Rodriguez, Joaquin I. B. Garcia, and Leonardo B. L. Santos
Latest update: 28 Jan 2026
Download
Short summary
This study introduces and utilizes the TTI-HydroMet dataset, including rainfall data from 23 rain gauge stations, stage observations from a hydrological gauge, and quantitative precipitation estimates at high temporal and spatial resolution. The paper's main message is that radar-based Machine Learning approaches can be a viable alternative for short-term stage forecasting in regions lacking rain gauge networks, supported by a reproducible case study.
Share
Altmetrics