the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Geospatial dataset for Hydrologic analyses in India (GHI): A quality controlled dataset on river gauges, catchment boundaries and hydrometeorological time series
Abstract. Streamflow gauging stations not only track the pulse of rivers but also act as common reference points for hydrologic and other environmental analyses. As such, streamflow data and metadata on gauging stations - GIS data on station locations, their upstream catchment boundaries and river flow networks, are critical for analyses. However, for India's river basins, availability of such data is limited; when available, data is not in an analysis-ready format and can have substantial errors. Studies often use available information from India's water agencies as is, without checking for its validity. This study addresses the above limitations by building a new dataset using existing metadata (from CWC and WRIS) and checking it against publicly available information from global data sources (e.g., WWF, MERIT and Copernicus), and online maps (e.g., Google Maps). The quality control process categorizes existing metadata based on its consistency with these sources; also, existing metadata is supplemented with additional information where needed. The new dataset developed here is called the `Geospatial dataset for Hydrologic analyses in India' (GHI) and uses HydroSHEDS data as the underlying template. GHI has both geospatial and time series information. In this initial version of GHI, the spatial domain includes only the river basins of Peninsular India where daily streamflow data is publicly available.
Following the quality control process, CWC's 645 stations in Peninsular India were categorized into three groups - Group 1 (reliable metadata, adequate daily streamflow data; 213 stations), Group 2 (reliable metadata, inadequate or no daily streamflow data; 259 stations), and Group 3 (missing or unreliable metadata; 173 stations). For each of the 472 stations falling in Groups 1 and 2, catchment-specific annual and monthly time series spanning 71 water years (1950–2020) of the following were compiled: observed precipitation from IMD, observed streamflow from WRIS, estimated precipitation, evapotranspiration (ET) and streamflow from ERA5-Land, and ET from GLEAM. A preliminary analysis of catchment-scale time series data indicates that while the compiled data appears reasonable over most of the study domain, spurious runoff-precipitation ratios were observed in the hilly coastal regions of Western India. This adds to yet another data-related obstacle faced by the hydrologic community. In order to quantify historical changes and reconcile them with anticipated future changes, the community needs robust and reliable hydrographic and hydrometeorological datasets, as well as unrestricted access to such datasets. The goal of this study is to highlight the limitations of existing datasets and pave the way for a community-led effort towards building the needed datasets. GHI serves as a placeholder until such datasets become available. Potential improvements to GHI are discussed.
- Preprint
(2571 KB) - Metadata XML
- BibTeX
- EndNote
Gopi Goteti
Status: closed
-
RC1: 'Comment on essd-2023-46', Anonymous Referee #1, 03 Jul 2023
This paper aims to enhance both the quantity and quality of the streamflow data and metadata on gauging stations in India. The author integrated multiple datasets to update the information of gauging stations and compile historical streamflow data through a rigorous quality control process. The information and datasets provided in this study will prove valuable to hydrologic communities, while the methodologies presented can be adopted by both the administrative organizations and research communities to homogenize the hydrological datasets in India. I found this paper is generally well written. I recommend this paper to be accepted after few minor comments below are addressed.
- The GHI dataset is compiled from multiple datasets. It will be good to add a flowchart showing the steps of how different datasets are processed and the relationships between GHI and these datasets. Figure 4 and Table 5 serve such purpose but the usage and relations of different datasets are not clear.
- The base maps in figures are indistinct (e.g., figures 3 and 5). It seems that the author used screenshots of Google Maps and enlarged the images without properly dealing with the resolution. The letters of street names and landmark names are blurry. Please make sure to use high-resolution images for base map when making these figures.
- In Section 3.5, the method of how hydrometeorological variables are compiled is not explicitly mentioned. Are these variables compiled directly from available datasets as listed in Table 5? Are there any newly generated time series estimates from existing data by, such as, model outputs or statistical calculations?
- Some of the citation formats are incorrect. I see many places having the citation ends with double right parenthesis. For example, in Line 43, “River Discharge Data Set (RivDIS, Vorosmarty et al. (1998))”. It can be written as “River Discharge Data Set (RivDIS) (Vorosmarty et al., 1998)”. There are many other places with this issue (i.e., Lines 44, 54, 59, 115, 190, 242, 248, etc.). Please check throughout the manuscript.
Citation: https://doi.org/10.5194/essd-2023-46-RC1 - AC2: 'Reply on RC1', Gopi Goteti, 15 Jul 2023
-
RC2: 'Comment on essd-2023-46', Anonymous Referee #2, 04 Jul 2023
This manuscript can be ACCEPTED after minor corrections based on the detailed report in the PDF file attached above.
- AC1: 'Reply on RC2', Gopi Goteti, 15 Jul 2023
Status: closed
-
RC1: 'Comment on essd-2023-46', Anonymous Referee #1, 03 Jul 2023
This paper aims to enhance both the quantity and quality of the streamflow data and metadata on gauging stations in India. The author integrated multiple datasets to update the information of gauging stations and compile historical streamflow data through a rigorous quality control process. The information and datasets provided in this study will prove valuable to hydrologic communities, while the methodologies presented can be adopted by both the administrative organizations and research communities to homogenize the hydrological datasets in India. I found this paper is generally well written. I recommend this paper to be accepted after few minor comments below are addressed.
- The GHI dataset is compiled from multiple datasets. It will be good to add a flowchart showing the steps of how different datasets are processed and the relationships between GHI and these datasets. Figure 4 and Table 5 serve such purpose but the usage and relations of different datasets are not clear.
- The base maps in figures are indistinct (e.g., figures 3 and 5). It seems that the author used screenshots of Google Maps and enlarged the images without properly dealing with the resolution. The letters of street names and landmark names are blurry. Please make sure to use high-resolution images for base map when making these figures.
- In Section 3.5, the method of how hydrometeorological variables are compiled is not explicitly mentioned. Are these variables compiled directly from available datasets as listed in Table 5? Are there any newly generated time series estimates from existing data by, such as, model outputs or statistical calculations?
- Some of the citation formats are incorrect. I see many places having the citation ends with double right parenthesis. For example, in Line 43, “River Discharge Data Set (RivDIS, Vorosmarty et al. (1998))”. It can be written as “River Discharge Data Set (RivDIS) (Vorosmarty et al., 1998)”. There are many other places with this issue (i.e., Lines 44, 54, 59, 115, 190, 242, 248, etc.). Please check throughout the manuscript.
Citation: https://doi.org/10.5194/essd-2023-46-RC1 - AC2: 'Reply on RC1', Gopi Goteti, 15 Jul 2023
-
RC2: 'Comment on essd-2023-46', Anonymous Referee #2, 04 Jul 2023
This manuscript can be ACCEPTED after minor corrections based on the detailed report in the PDF file attached above.
- AC1: 'Reply on RC2', Gopi Goteti, 15 Jul 2023
Gopi Goteti
Data sets
Geospatial dataset for Hydrologic analyses in India (GHI): A quality controlled dataset on river gauges, catchment boundaries and hydrometeorological time series Gopi Goteti https://doi.org/10.5281/zenodo.7563599
Gopi Goteti
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
348 | 94 | 22 | 464 | 8 | 9 |
- HTML: 348
- PDF: 94
- XML: 22
- Total: 464
- BibTeX: 8
- EndNote: 9
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1