the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
CAMELS-DK: Hydrometeorological Time Series and Landscape Attributes for 3330 Catchments in Denmark
Abstract. Large samples of hydrometeorological time series and catchment attributes are critical for improving the understanding of complex hydrological processes, hydrological model development and performance benchmarking. CAMELS (Catchment Attributes and Meteorological time series for Large Samples) datasets have been developed in several countries and regions around the world, providing valuable data sources and testbeds for hydrological analysis and new frontiers in data-driven hydrological modelling. Regarding the lack of samples from low-land, groundwater-dominated, small-sized catchments, we develop an extensive repository of a CAMELS-style dataset for Denmark (CAMELS-DK). This CAMELS addition is the first containing both, gauged and ungauged catchments as well as detailed groundwater information. The dataset provides dynamic and static variables for 3330 catchments from various hydrogeological datasets, meteorological observations, and a well-established national-scale hydrological model. The dataset is enhanced with streamflow observations in 304 of those catchments. The spatially dense and full spatial coverage, supplying data for 3330 catchments, instead of only gauged catchments, together with the addition of simulation data from a distributed, process-based model enhance the applicability of such CAMELS data. This is especially relevant for the development of data-driven and hybrid physical informed modelling frameworks. We also provide quantities related to human impact on the hydrological system in Denmark, such as groundwater abstraction and irrigation. The CAMELS-DK dataset is freely available at https://doi.org/10.22008/FK2/AZXSYP (Koch et al., 2024).
- Preprint
(1744 KB) - Metadata XML
- BibTeX
- EndNote
Status: final response (author comments only)
-
CC1: 'Comment on essd-2024-292', Ather Abbas, 20 Aug 2024
Thanks for the paper and data.
There is a similar dataset at https://zenodo.org/records/7962379 by one of the co-authors (Julian Koch). Can the authors please clarify that if this dataset is an updated version of the one provided at zenodo in terms of observed streamflow?
Regards,
Ather Abbas
Citation: https://doi.org/10.5194/essd-2024-292-CC1 -
AC1: 'Reply on CC1', Jun Liu, 20 Aug 2024
Yes, the dataset you mention is related. The Caravan extension published on Zenodo is connected to a manuscript (https://doi.org/10.34194/geusb.v49.8292) and a dataset (https://doi.org/10.22008/FK2/YCQXTR). The current dataset associated to this manuscript, however, is more extensive, including both, simulated and observed data and the temporal coverage is updated and extends beyond 2019. Another major difference is that the current dataset contains information for all ~3300 catchments in Denmark, and not only the ~300 gauged catchments. In addition, the number of static catchment attributes has been expanded substantially.
Citation: https://doi.org/10.5194/essd-2024-292-AC1
-
AC1: 'Reply on CC1', Jun Liu, 20 Aug 2024
-
RC1: 'Comment on essd-2024-292', Alexander Dolich, 18 Sep 2024
Congratulations to the authors for the compilation of the CAMELS-DK dataset and the high-quality documentation and provision of extensive information in this manuscript.
CAMELS-DK is a very valuable addition to the family of CAMELS datasets, with the incorporation of groundwater data being a milestone for large-sample hydrological datasets.I have attached my comments as a PDF file, although there are many comments, most are minor or technical or suggestions for improvement, the quality of the manuscript and dataset is already very high.
-
AC2: 'Reply on RC1', Jun Liu, 26 Oct 2024
We appreciate the reviewer’s time and effort in thoroughly reviewing our manuscript, dataset, and data description. The reviewer’s suggestions provide helpful insights that help us improve the quality of our work.
We fully agree with the suggestion to align our title with the conventions of CAMELS datasets that emphasize the number of gauged catchments. We will modify the title to "CAMELS-DK: Hydrometeorological Time Series and Landscape Attributes for 3330 Danish Catchments with 304 Gauged."
We will carefully address each specific comment and ensure clarity and consistency throughout the manuscript and data description, including providing additional citations where needed, expanding descriptions of data sources and processing methods, and addressing any inconsistencies within the dataset.
Citation: https://doi.org/10.5194/essd-2024-292-AC2
-
AC2: 'Reply on RC1', Jun Liu, 26 Oct 2024
-
CC2: 'Comment on essd-2024-292', Vazken Andréassian, 19 Sep 2024
Dear colleagues,
I read your paper and downloaded the dataset. Congratulations for this huge work.
While reading the data, I have found a case where it seems that there has been a conflation between the missing value code and the zero value : catchment 42600042, calendar year 2010 and 2011 (which follow a period with missing values). Could you please check? I may have made a mistake while reading the data… if so please accept my apologies.
Best regards,
Vazken Andréassian
Citation: https://doi.org/10.5194/essd-2024-292-CC2 -
AC4: 'Reply on CC2', Jun Liu, 28 Oct 2024
Dear Dr. Andréassian,
Thank you for your interest in our dataset. We have reviewed the zero values for catchment 42600042 during the years 2010 and 2011. This period appears unusual, as the source data shows zero values throughout. We will reach out to the data provider to investigate further, and we hope to determine the cause. Once we have more information, we will decide how best to address the zero-value data.
Best regards,
JunCitation: https://doi.org/10.5194/essd-2024-292-AC4
-
AC4: 'Reply on CC2', Jun Liu, 28 Oct 2024
-
RC2: 'Comment on essd-2024-292', Ashutosh Sharma, 23 Oct 2024
The manuscript describes the CAMELS-DK dataset, which consists of hydrometeorological time series and catchment attributes for a large sample of catchments in Denmark. I appreciate the valuable contribution of the authors in extending the CAMELS dataset family to Denmark and providing a valuable dataset for the hydrological community. However, the manuscript requires substantial revision before it can be published.
One of the major concerns is the lack of observed streamflow data for many of the catchments. As described in the manuscript, the observed streamflow data is available only for the 304 catchments out of 3330 mentioned in the Title. A key characteristic of CAMELS datasets is providing observed streamflow data – without this, the datasets are less useful for large-sample hydrological analyses. I appreciate that the authors have included a modeled flow time series. However, I recommend keeping 304 catchments in the Title, i.e., where the authors have calculated hydrological signatures based on the observed data.
Specific comments:
1) Title: The authors should keep the number of gauged stations (i.e., 304) in the Title of the manuscript to be consistent with other CAMELS datasets.
2) Abstract: Line 17-22 can be rephrased to clearly distinguish that CAMELS-DK provides dynamic and static variables, including observed streamflow and their temporal range for 304 catchments, and provides dynamic and static variables for an additional 3026 catchments for further applications. Also, can the author mention how the additional catchments can enhance applicability without observed streamflow data? I understand that modeled streamflow is provided, but I am not convinced about its applicability given many catchments are very small in area, limited model resolution, and missing validation.
3) Line 44-48: Please include the access dates of the cited weblinks.
4) There are several typos throughout the manuscript. I am restricting myself from providing a complete list here. However, the authors should thoroughly proofread the manuscript to correct the typos and grammatical mistakes.
5) Line 103: “consistent data for 3300 catchments”? – As described in the manuscript, all data is not consistent for all 3330 catchments. Maybe “consistent meteorological and static attribute data for 3330 catchments? Moreover, is it 3300 or 3330? Please be specific.
6) Line 126: Please include a reference for ID15v2.5. Also, if ID15v2.5 includes 3351 catchments, why do the authors include 3330 in the manuscript? Given that many of these catchments are very small, how do authors justify the reliability of the model performance with 500 m resolution data [Line 208]?
7) There are several mistakes in the figures and visualization. For example, Figure 1(b): “Catchment area” instead of “Area”; Figure 2(a): the size of the circles is enlarged in the legend; Figure 2(c) is not colorblind-friendly; Figure 3(a): “ele_mean” instead of “DEM”.
8) Line 359-360: “CAMELS-DK provides observed streamflow data from 304 hydrological stations and simulated streamflow data for 2,942 catchments” – which makes it a total of 3246 catchments?; and it mentions, “However, 388 catchments lack observed or simulated streamflow data.” – then why are those relevant/included in the CAMELS-DK dataset?
9) Can authors include more discussion on the DK-Model performance, especially the causes of deteriorated performance in the central and northern regions? Since the authors provide a modeled streamflow based on this model, the readers/users must understand the limitations.
10) Line 437-438 and Line 450: The Python script for processing the time series, landscape attributes, and original raster data is not included in the dataset. I suggest including the Python script for data processing or adding more details on data processing, detailing the different products and steps used in generating the dataset to build user confidence. I would also recommend adding the license/disclaimer as a text file within the dataset to ensure this is readily available when a user directly downloads the product.
11) There are several mistakes in the naming convention in the dataset. For example, CAMELS_DK_soil.csv uses column name “Id15_model” instead of “catch_id”; “CAMELS_DK_signature_obs_based.csv” and “CAMELS_DK_signature_sim_based.csv” includes the same data. I suggest checking all the files and making necessary corrections thoroughly.
Citation: https://doi.org/10.5194/essd-2024-292-RC2 -
AC3: 'Reply on RC2', Jun Liu, 26 Oct 2024
We appreciate the time and effort the reviewer has dedicated to evaluating the manuscript and dataset. We agree that both the manuscript and dataset require revisions, and we will thoroughly revise them in response to the reviewer’s comments. We will modify the title to highlight the number of basins where we have observations.
We agree with the specific comments and we will address them point by point, including correcting typographical errors, adding references, expanding the discussion on model performance, and providing more details on data processing.
Citation: https://doi.org/10.5194/essd-2024-292-AC3
-
AC3: 'Reply on RC2', Jun Liu, 26 Oct 2024
Data sets
CAMELS-DK: Hydrometeorological Time Series and Landscape Attributes for 3330 Catchments in Denmark Jun Liu, Julian Koch, Simon Stisen, Lars Troldborg, Anker Lajer Højberg, Hans Thodsen, Mark F. T. Hansen, and Raphael J. M. Schneider https://doi.org/10.22008/FK2/AZXSYP
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
458 | 144 | 81 | 683 | 9 | 11 |
- HTML: 458
- PDF: 144
- XML: 81
- Total: 683
- BibTeX: 9
- EndNote: 11
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1