StageIV-IRC: A High-resolution Dataset of Extreme Orographic Quantitative Precipitation Estimates (QPE) Constrained to Water Budget Closure for Historical Floods in the Appalachian Mountains

Liao, Mochi; Barros, Ana

doi:10.5194/essd-2025-554

Preprints

https://doi.org/10.5194/essd-2025-554

Preprints

19 Sep 2025

| 19 Sep 2025

Status: a revised version of this preprint was accepted for the journal ESSD and is expected to appear here in due course.

StageIV-IRC: A High-resolution Dataset of Extreme Orographic Quantitative Precipitation Estimates (QPE) Constrained to Water Budget Closure for Historical Floods in the Appalachian Mountains

Mochi Liao and Ana Barros

Abstract. Quantitative Flood Estimation (QFE) in complex terrain remains a grand challenge in operational hydrology due to the lack of accurate high-resolution Quantitative Precipitation Estimates (QPE) for operational forecasting and for calibrating hydrologic models. Here, we present a high-resolution (i.e., 250 m, 5-minute-hourly) QPE dataset for 215 extreme rainfall events occurred in 26 gauged mountainous basins in the Appalachian Mountains from 2008 to 2024. This dataset is developed by applying inverse rainfall corrections (IRC) derived from physically-based rainfall-runoff modeling (Liao and Barros, 2022 and 2023) to the Next Generation Weather Radar (NEXRAD) Stage IV analysis (4 km resolution, hourly). The corrected Stage IV analysis QPE is referred to as StageIV-IRC (StageIV with Inverse Rainfall Correction). The unique advantage of this StageIV-IRC QPE dataset is its agreement with ground-based rainfall measurements while achieving water budget closure at the storm-flood event scale within observational uncertainty of streamflow observations, which is the gold standard in hydrological modeling. This dataset is the first QPE dataset aiming to improve QFE in the complex terrain by reducing biases for extreme precipitation events, and it can be used to evaluate the skill of hydrologic models in the same basins and support model calibration. The StageIV-IRC QPE dataset is publicly available at https://doi.org/10.5281/zenodo.14028866, and improved initial soil moisture maps for the studied extreme precipitation events, derived from the same IRC framework, are available in the same repository (Liao and Barros, 2025c).

Received: 11 Sep 2025 – Discussion started: 19 Sep 2025

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Mochi Liao and Ana Barros

Status: closed

RC1:
'Comment on essd-2025-554', Anonymous Referee #1, 02 Oct 2025

The author developed a High-resolution Dataset of Extreme Orographic QPE by closing the water budget using stream gauge measurements. This is a novel method and will be of great value if further validated. Therefore, I recommend a major revision, as some clarification is needed, and more dataset evaluation may be beneficial.
Major comments:

1. I would recommend that the authors mention ICC as well in the abstract, as it is also one step in the precipitation data generation.

2. I recommend that the author provide a brief code to show how to read the data. The current format and structure of the data are unclear. It will be helpful for readers to try the data.

3. Are the ICC and IRC corrections implemented simultaneously in windows 2 and 5? Intuitively, overestimated rainfall values can compensate for an underestimated initial soil moisture condition. I am curious whether this compensation causes some difficulties in determining precipitation.

4. In the inverse correction process, there are likely more unknowns (precipitation at each pixel) than the knowns (observed discharge). Is it possible to obtain two different precipitation fields that can generate very similar discharge? How can you guarantee that you can get the "optimal" precipitation fields compared to other possible realizations? Is it reasonable to obtain an ensemble precipitation dataset to account for this variability?

5. Why did the authors select Stage IV as the primary precipitation source? In the first step, the authors downscale the precipitation field from 4km to 1km. Other available precipitation datasets, such as MRMS and AORC, provide precipitation estimates at a 1km resolution. If the authors use these 1km datasets, the downscale step can be removed.

6. L201-204, what does "self-similar statistics" mean? In L213, what does "the same rainfall statistics" mean here? I am curious which type of rainfall statistics is preserved in the downscaling process.

7. What is the size of the rainfall field in Ordinary Kriging? Is it a basin-based correction? Ordinary Kriging has the assumption of geostationary, which may not perform optimally when applied to a large complex region.

8. L505-L508, the authors mentioned that "The climatologically corrected STIV_DBKC fields have a significantly accurate diurnal cycle compared to only event-scale bias-corrected STIV_DBK." But in Figure 5, I did not see many differences between the blue and green lines. And should not the "STIV_DBK" here be "STIV_DB"?

9. L610, the authors mentioned that "IRC-ICC" is the recommended dataset. In Section 5, the author provides the citation for "IRC". Why don't the authors publish IRC-ICC?

10. I recommend that the authors provide the results of STIV_IRC_ICC in Figures 5, 6, and 7. I understand that the lack of rainfall ground truth makes the evaluation of precipitation data a little bit hard. The better discharge estimates from your methods cannot reflect the absolute accuracy of precipitation data, as the discharge is your objective function. I would recommend more evaluation of the precipitation data itself. Alternatively, you can use STIV_IRC_ICC to drive another hydrologic model to evaluate whether you can also have a better discharge prediction than Stage IV. Model calibration can also be implemented, as hydrologists usually do so with a precipitation dataset.
Minor comments:

1. I recommend that the authors clarify the terminology usage. In Figure 2, the event scale bias correction is noted as STIV_BD. But in some places of the figure and the article, STIV_DBK is used.

2. L690. The resolution of StageIV_D is "1km, hourly" in Figure 1, but you mention " the same resolution as StageIV_D datasets (250m, 5min)".

3. Provide the legend in Figure A3, Figure 8,9, 11, 12

4. Provide the unit in Figure 10

5. Provide the y-axis in Figure 11

Citation: https://doi.org/10.5194/essd-2025-554-RC1
- AC1: 'Reply on RC1', Mochi Liao, 24 Nov 2025
  
  The comment was uploaded in the form of a supplement: https://essd.copernicus.org/preprints/essd-2025-554/essd-2025-554-AC1-supplement.pdf
  
  Citation: https://doi.org/10.5194/essd-2025-554-AC1
RC2:
'Comment on essd-2025-554', Anonymous Referee #2, 21 Nov 2025

Excellent work! My only concern is regarding the Inverse Correction process. What is the solution process given that there is more precipitation data than observed discharge? is there an averaging process of the rainfall in the basin ? I think this is a similar question to comment 4 by reviewer 1. Thanks !

Citation: https://doi.org/10.5194/essd-2025-554-RC2
- AC2: 'Reply on RC2', Mochi Liao, 24 Nov 2025
  
  Please see the comments in the attached file.
  
  Citation: https://doi.org/10.5194/essd-2025-554-AC2

Status: closed

RC1:
'Comment on essd-2025-554', Anonymous Referee #1, 02 Oct 2025

The author developed a High-resolution Dataset of Extreme Orographic QPE by closing the water budget using stream gauge measurements. This is a novel method and will be of great value if further validated. Therefore, I recommend a major revision, as some clarification is needed, and more dataset evaluation may be beneficial.
Major comments:

1. I would recommend that the authors mention ICC as well in the abstract, as it is also one step in the precipitation data generation.

2. I recommend that the author provide a brief code to show how to read the data. The current format and structure of the data are unclear. It will be helpful for readers to try the data.

3. Are the ICC and IRC corrections implemented simultaneously in windows 2 and 5? Intuitively, overestimated rainfall values can compensate for an underestimated initial soil moisture condition. I am curious whether this compensation causes some difficulties in determining precipitation.

4. In the inverse correction process, there are likely more unknowns (precipitation at each pixel) than the knowns (observed discharge). Is it possible to obtain two different precipitation fields that can generate very similar discharge? How can you guarantee that you can get the "optimal" precipitation fields compared to other possible realizations? Is it reasonable to obtain an ensemble precipitation dataset to account for this variability?

5. Why did the authors select Stage IV as the primary precipitation source? In the first step, the authors downscale the precipitation field from 4km to 1km. Other available precipitation datasets, such as MRMS and AORC, provide precipitation estimates at a 1km resolution. If the authors use these 1km datasets, the downscale step can be removed.

6. L201-204, what does "self-similar statistics" mean? In L213, what does "the same rainfall statistics" mean here? I am curious which type of rainfall statistics is preserved in the downscaling process.

7. What is the size of the rainfall field in Ordinary Kriging? Is it a basin-based correction? Ordinary Kriging has the assumption of geostationary, which may not perform optimally when applied to a large complex region.

8. L505-L508, the authors mentioned that "The climatologically corrected STIV_DBKC fields have a significantly accurate diurnal cycle compared to only event-scale bias-corrected STIV_DBK." But in Figure 5, I did not see many differences between the blue and green lines. And should not the "STIV_DBK" here be "STIV_DB"?

9. L610, the authors mentioned that "IRC-ICC" is the recommended dataset. In Section 5, the author provides the citation for "IRC". Why don't the authors publish IRC-ICC?

10. I recommend that the authors provide the results of STIV_IRC_ICC in Figures 5, 6, and 7. I understand that the lack of rainfall ground truth makes the evaluation of precipitation data a little bit hard. The better discharge estimates from your methods cannot reflect the absolute accuracy of precipitation data, as the discharge is your objective function. I would recommend more evaluation of the precipitation data itself. Alternatively, you can use STIV_IRC_ICC to drive another hydrologic model to evaluate whether you can also have a better discharge prediction than Stage IV. Model calibration can also be implemented, as hydrologists usually do so with a precipitation dataset.
Minor comments:

1. I recommend that the authors clarify the terminology usage. In Figure 2, the event scale bias correction is noted as STIV_BD. But in some places of the figure and the article, STIV_DBK is used.

2. L690. The resolution of StageIV_D is "1km, hourly" in Figure 1, but you mention " the same resolution as StageIV_D datasets (250m, 5min)".

3. Provide the legend in Figure A3, Figure 8,9, 11, 12

4. Provide the unit in Figure 10

5. Provide the y-axis in Figure 11

Citation: https://doi.org/10.5194/essd-2025-554-RC1
- AC1: 'Reply on RC1', Mochi Liao, 24 Nov 2025
  
  The comment was uploaded in the form of a supplement: https://essd.copernicus.org/preprints/essd-2025-554/essd-2025-554-AC1-supplement.pdf
  
  Citation: https://doi.org/10.5194/essd-2025-554-AC1
RC2:
'Comment on essd-2025-554', Anonymous Referee #2, 21 Nov 2025

Excellent work! My only concern is regarding the Inverse Correction process. What is the solution process given that there is more precipitation data than observed discharge? is there an averaging process of the rainfall in the basin ? I think this is a similar question to comment 4 by reviewer 1. Thanks !

Citation: https://doi.org/10.5194/essd-2025-554-RC2
- AC2: 'Reply on RC2', Mochi Liao, 24 Nov 2025
  
  Please see the comments in the attached file.
  
  Citation: https://doi.org/10.5194/essd-2025-554-AC2

Mochi Liao and Ana Barros

Data sets

StageIV-IRC: A High-resolution Dataset of Extreme Orographic Quantitative Precipitation Estimates (QPE) Constrained to Water Budget Closure for Historical Floods in the Appalachian Mountains Mochi Liao and Ana Barros https://doi.org/10.5281/zenodo.14028866

Mochi Liao and Ana Barros

Viewed

Total article views: 1,004 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
824	150	30	1,004	38	56

HTML: 824
PDF: 150
XML: 30
Total: 1,004
BibTeX: 38
EndNote: 56

Views and downloads (calculated since 19 Sep 2025)

Month	HTML	PDF	XML	Total
Sep 2025	545	11	4	560
Oct 2025	109	21	4	134
Nov 2025	73	33	11	117
Dec 2025	43	23	9	75
Jan 2026	35	35	2	72
Feb 2026	19	27	0	46

Cumulative views and downloads (calculated since 19 Sep 2025)

Month	HTML	PDF	XML	Total
Sep 2025	545	11	4	560
Oct 2025	109	21	4	134
Nov 2025	73	33	11	117
Dec 2025	43	23	9	75
Jan 2026	35	35	2	72
Feb 2026	19	27	0	46

Viewed (geographical distribution)

Total article views: 1,000 (including HTML, PDF, and XML) Thereof 1,000 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 28 Feb 2026

Short summary

The StageIV-IRC is the first precipitation dataset developed for extreme precipitation events in the mountains. This dataset strongly suggest the use of Inverse Rainfall Correction (IRC) framework to produce physically-meaningful corrections for precipitation products in the mountains, where precipitation estimation is problematic due to topography blockage. Post-IRC precipitation estimation produces improved hydrological responses, and it shows a good agreement with raingauge observations.


Total:	0
HTML:	0
PDF:	0
XML:	0