the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Satellite Altimetry-based Extension of global-scale in situ river discharge Measurements (SAEM)
Abstract. River discharge is a crucial measurement, indicating the volume of water flowing through a river cross-section at any given time. However, the existing network of river discharge gauges faces significant issues, largely due to the declining number of active gauges and temporal gaps. Remote sensing, especially radar-based techniques, offers an effective means to this issue. This study introduces the Satellite Altimetry-based Extension of the global-scale in situ river discharge Measurements (SAEM) data set, which utilizes multiple satellite altimetry missions and estimates discharge using the existing worldwide networks of national and international gauges. In SAEM, we have explored 47 000 gauges and estimated height-based discharge for 8 730 of them which is approximately three times the number of gauges of the largest existing remote sensing-based data set. These gauges cover approximately 88 % of the total gauged discharge volume. The height-based discharge estimates in SAEM demonstrate a median Kling-Gupta Efficiency (KGE) of 0.48, outperforming current global data sets. In addition to the river discharge time series, the SAEMdata set comprises three more products, each contributing a unique facet to better usage of our data: (1) A catalog of Virtual Stations (VSs), defined by certain predefined criteria. In addition to each station’s coordinates, this catalog provides information on satellite altimetry missions, distance to the discharge gauge, and relevant quality flags.(2) The altimetric water level time series of those VSs are included, for which we ultimately obtained good-quality discharge data. These water level time series are sourced from both existing Level-3 water level time series and newly generated ones within this study. The Level-3 data are gathered from pre-existing data sets, including Hydroweb.Next (former Hydroweb), the Database of Hydrological Time Series of Inland Waters (DAHITI), the Global River Radar Altimeter Time Series (GRRATS), and HydroSat. (3) SAEM’s third product is rating curves for the defined VSs, which map water level values into discharge values, derived using a Nonparametric Stochastic Quantile Mapping Function approach. The SAEM data set can be used to improve hydrological models, inform water resource management, and address non-linear water-related challenges under climate change. The SAEM data set is available from (Saemian et al., 2024) https://doi.org/10.18419/darus-4475 during the review process.
- Preprint
(4608 KB) - Metadata XML
- BibTeX
- EndNote
Status: open (extended)
-
RC1: 'Comment on essd-2024-406', Adrien Paris, 14 Nov 2024
reply
Please find here after my comment on essd-2024-406 manuscript. Overall, the manuscript is well written and easy to read. The dataset presented here will be useful for a large panel of users and my recommendation would be that the manuscript is accepted after revisions.
Some questions remain regarding the possible discrepancies within the dataset not being acknowledged (intermission biases, local differences due to geoid, retracking methods) in this version, regarding the way uncertainties are taken into consideration and regarding some methodological choices. The results also should be further criticized on the light of the input data, given that they may depend on the validation method and/or validation data availability.
Hereafter are my main remarks with the corresponding lines in the initial version.
Kind regards.
Section 2.3.1
I miss a § at the end of 2.3.1 stating the differences of the aforementioned databases (open accessibility or not, timeliness, NRT availability or not, etc.), so that the reader understands why several databases were used; This could be done extending Table 2.
Has any inter-validation been performed when/where overlaps (in VSs) were found? Are there any in the literature? In this section, the reader should understand what were the choices made in case of overlap and why such choices were made. This is crucial for people that would like to duplicate / extend the dataset.
Section 2.3.1 §1 Hydroweb.next
- Please check the satellites list (L97-98) which were used to produce Hwb-Next time series of rivers WL;
- The data sources (L99,100) and the reference apply for lakes and not rivers; For rivers, please cite Santos da Silva J., S. Calmant, O. Rotuono Filho, F. Seyler, G. Cochonneau, E. Roux, J. W. Mansour, Water Levels in the Amazon basin derived from the ERS-2 and ENVISAT Radar Altimetry Missions, Remote Sensing of the Environment, 2010, doi:10.1016/j.rse.2010.04.020
Section 3.1 Construction of VS catalog
- L138-139: please specify the dams and reservoirs database used in input
- Specify that SWORD V16 (from Fig. 3) was used. Is it version dependent, or did you created a customized one, possibly with connectivity issues corrected? Will the code for creating this VS catalog with connectivity and hydrological constraints be made available to the community together with the database?
- L142-143: put the ENVISAT series list in chronological order
Section 3.2 WL time series
- Since SAEM includes L3 WL time series, I consider a dedicated paragraph on inter-mission biases is mandatory. Even though discharges are estimated through mono-mission non parametric curves, such biases shall be taken in consideration by whoever want to use the L3 WL TS from this database or any other all together to build long TS. Moreover since different retrackers are used among the missions.
- L152-153: this masking choice implies, for past missions, to keep a very low number of "Hi" measurements from your raw altimetric data. For small rivers, this would mean keeping only the one measurement geolocated over the water. What are your statistics on this point (percentage of dates with <2 Hi measurements for example), and do this have an influence on the final median that you process (and which uncertainty do you provide in this case)? I believe this should be further explained.
- L171: Is XGM2019 used in one of the L3 databases used in this study? Differences between gravity field models can lead to lat/lon dependent biases between the series from external databases (e.g. Hydroweb.next) that are on EGM2008 and the L3 processed time series. In a matter of uniformity, using the same GGM would be recommendable.
- L180-184: it would be worth having a table with those statistics. I.e. how many TS were generated, how many (in %?) passed the QC check (total and by mission), and why.
3.3 Non parametric rating curve
- L187-193: I suggest discriminating the advantages as a function of the approach. Indeed,
- the "does not necessitate simultaneous ..." is not due to the non-parametric and could be also the case for linear regression;
- "it assumes no specific predefined ..." is only for the non-parametric, and it should be stated why this is an advantage comparing to other formulations;
- realistic uncertainty: is not dependent on the formulation but comes from the MC simulations and the creation of a stack of WL and Q. - L206-208: This means that the uncertainty raisen from previous steps is not being used. Why this choice? The “10% multiplicative uncertainty” should be better explained. Also, a sensitivity analysis of the MC algorithm to the uncertainty (and consequently, the sensitivity of the non-parametric RCs derived) shall be investigated (testing different bounds for uncertainty, and even informative uncertainty such as the one coming from WL processing).
- L208: Convergence of MC algorithms is an important point. What convergence criteria was considered, and did all the Q/H Ts reached convergence ? If not, how many did not passed, and on which reaches, rivers, etc?
5 Results
- L275-279: A statistics on the step where data was rejected could be very important: was it due to a lack of raw (gdr) data, during the WL processing, during the NPQM, quality test? etc. The visualization could be done maybe with a geographical (lat/lon) view?
- L300: remove the “exceptionally”. The model performs indeed well in these regions, yet KGE and Corr remain in the “good” domain and not in the “exceptional”
- L305: “the overall good accuracy” instead of “high accuracy”, for the same reasons than above
- L306: is 73% the right value? Please check, this seems pretty high to me.
- L310-311: I am quite surprised by the low values show by S3B (same as T/P Jason). It is important to bring some elements of explanation in the discussion section, since S3B is expected to perform at least as good as S3A, at least in terms of WL estimates. What can explain such difference? The length of the TS? The lack of in situ data for validation on the recent period? Anythink else? Please discuss it. Regarding Topex/Jason also, are all the missions giving similar results? Or is the global quality impacted by the oldest missions? When ENVISAT is mentioned, is it only ENVISAT or is it the family (ERS/ENV/SRL) ?
- L345-348: provide other metrics to complement KGE (KGE improvement of 0.15 is somehow unease to quantify) (e.g. NRMSE in %, other)
- L361: CCI WL rely mostly on hand-processed time series using Altis software (and in particular it is the case for Niger TSs, please correct.
- Figure 10: I would rather show the comparison in terms of anomaly (or also show), in order to better evidence differences between SAEM and CCI for all the discharge amplitude (can be anomaly or normalized anomaly). Differences for small discharges do not appear clearly here. Same for WL.
5.3 Discussion
- I miss a paragraph dedicated to the discussion around uncertainties. For example, does the CCI RC fall inside the uncertainty bound for SAEM RCs? What is the contribution of providing uncertainty in discharge estimate. Does this uncertainty is useful, and in line with other (e.g. CCI) provided uncertainties?
- L420-425: This matter is worth discussing a little more. See Kirchner 2006 "getting the right answers", the choice of non-parametric vs parametric RCs can be discussed here, bringing together the advantages (as already shown) but also the drawbacks of both methods (hydraulic-based vs data-fit based)
- As said above, the relative performance of SAEM discharges as a function of the mission should be discussed, since the classical “quality evolution” with time is not respected here. This can be due to several aspects, which need to be discussed so that the reader/user understand what lies in this validation statistics.
Citation: https://doi.org/10.5194/essd-2024-406-RC1
Data sets
Satellite Altimetry-based Extension of global-scale in situ river discharge Measurements (SAEM) Peyman Saemian, Omid Elmi, Molly Stroud, Ryan Riggs, Benjamin M. Kitambo, Fabrice Papa, George H. Allen, and Mohammad J. Tourian https://doi.org/10.18419/darus-4475
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
222 | 48 | 63 | 333 | 6 | 8 |
- HTML: 222
- PDF: 48
- XML: 63
- Total: 333
- BibTeX: 6
- EndNote: 8
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1