Articles | Volume 17, issue 4
https://doi.org/10.5194/essd-17-1551-2025
https://doi.org/10.5194/essd-17-1551-2025
Data description paper
 | 
14 Apr 2025
Data description paper |  | 14 Apr 2025

CAMELS-DK: hydrometeorological time series and landscape attributes for 3330 Danish catchments with streamflow observations from 304 gauged stations

Jun Liu, Julian Koch, Simon Stisen, Lars Troldborg, Anker Lajer Højberg, Hans Thodsen, Mark F. T. Hansen, and Raphael J. M. Schneider
Abstract

Large samples of hydrometeorological time series and catchment attributes are critical for improving the understanding of complex hydrological processes, hydrological model development, and performance benchmarking. CAMELS (Catchment Attributes and Meteorology for Large-sample Studies) datasets have been developed in several countries and regions around the world, providing valuable data sources and test beds for hydrological analysis and new frontiers in data-driven hydrological modeling. Regarding the lack of samples from lowland, groundwater-dominated, small-sized catchments, we develop an extensive repository of a CAMELS-style dataset for Denmark (CAMELS-DK). This CAMELS addition is the first containing both gauged and ungauged catchments as well as detailed groundwater information. The dataset provides dynamic and static variables for 3330 catchments covering all of Denmark from various hydrogeological datasets, meteorological observations, and a well-established national-scale hydrological model. For 304 of those catchments, streamflow observations are provided, whereas simulated streamflow is provided for all 3330 catchments. The dataset contains time series spanning 30 years (1989–2019) with a daily time step, and the data will be updated once new observations and model simulations become available. The dense and full spatial coverage for all 3330 catchments, instead of only gauged catchments, together with the addition of various simulation data from a distributed, process-based model, enhances the applicability of such CAMELS data, for example, for the development of data-driven and hybrid physically informed modeling frameworks or other cases where consistent full spatial coverage is required. We also provide quantities related to the human impact on the hydrological system in Denmark, such as groundwater abstraction and irrigation. The CAMELS-DK dataset is freely available at https://doi.org/10.22008/FK2/AZXSYP (Koch et al., 2024).

Share
1 Introduction

Hydrometeorological time series and catchment attributes are crucial for understanding and modeling hydrological systems (Andersson et al., 2015; McMillan et al., 2018). Long-term records of water cycle variables, such as precipitation, evapotranspiration (ET), streamflow, and groundwater levels, give profound insights into dynamics and trends of water movement. These records form the basis for supporting water resource management and climate change adaptation. Sufficient hydrological data are required by scientists and engineers to manage water resources effectively and to make accurate predictions of hydrological extremes (Van Loon, 2015). Catchment attributes provide information on the physical characteristics of catchments, such as topography, soil type, land use, and geology, which are important for understanding how catchments respond to meteorological events and are relevant for land use planning and catchment management. The similarity of catchment attributes is useful in comparative hydrology studies, which facilitate the transfer of knowledge from data-rich to ungauged catchments or regions where direct hydrological measurements are scarce or nonexistent (Sawicz et al., 2011; Singh et al., 2014; Tegegn et al., 2022; Tshimanga et al., 2022). Being able to predict in ungauged catchments enhances our ability to manage water resources efficiently and sustainably.

The availability of open-access environmental data is increasing; however, data sources are distributed across different platforms and stored in various formats, which requires further efforts in data collection and pre-processing. Many state authorities and governmental research institutes comply with open data policies, significantly breaking down the barriers related to data sharing issues that have existed in the hydrology community for decades (Kibler et al., 2014). Numerous platforms and websites offer hydrological data from various sources, which greatly benefits large-scale studies. For example, the Global Runoff Data Centre (GRDC) provides streamflow measurements from over 10 000 hydrological stations (GRDC, 2020), the HydroSHEDS database offers a suite of hydrographic data (Lehner et al., 2008), the European Centre for Medium-Range Weather Forecasts provides multiple climate reanalysis products (Hersbach et al., 2020), and Hydroweb supplies satellite-derived water levels for global rivers and lakes (Da Silva et al., 2010). However, large-scale studies face a key challenge: they require a vast amount of data for development, and users often spend considerable time and effort navigating through different platforms for data collection and employing various programming packages for data pre-processing.

Many studies are, consequently, focused on creating large-sample hydrology (LSH) datasets providing streamflow data for a large number of catchments following the findable, accessible, interoperable, and reusable (FAIR) principle (Wilkinson et al., 2016). Newman et al. (2015) created a comprehensive hydrometeorological dataset, which includes daily forcings and hydrologic response data for 671 small- to medium-sized basins throughout the contiguous United States (CONUS). This dataset was further enriched by Addor et al. (2017), who coined the name Catchment Attributes and Meteorology for Large-sample Studies (CAMELS) to describe their collection of catchment attributes for the same basins. Since then, CAMELS datasets have been developed in several countries, including Chile (Alvarez-Garreton et al., 2018), Brazil (Chagas et al., 2020), Great Britain (Coxon et al., 2020), Australia (Fowler et al., 2021), and Switzerland (Höge et al., 2023). Some other datasets provide hydrometeorological time series and catchment characteristics similar to CAMELS conventions, but with diverging naming, such as LArge-SaMple DAta for Hydrology and Environmental Sciences for Central Europe (LamaH-CE; Klingler et al., 2021) and Iceland (LamaH-Ice; Helgason and Nijssen, 2024), historical hydrometeorological time series and signatures for 24 catchments in Haiti (Simbi; Bathelemy et al., 2023), a large-scale benchmark dataset for data-driven streamflow forecasting (WaterBench-Iowa; Demir et al., 2022), and the China Catchment Attributes and Meteorology dataset (CCAM; Hao et al., 2021). These datasets are open-access, are well-formatted, and encompass a wide range of comprehensive variables related to hydrological processes (Klingler et al., 2021). They provide information for hydrological studies and water resources management (Frame et al., 2021; Jehn et al., 2020; Meyer Oliveira et al., 2023; Tang et al., 2023) but also serve as a reference for the development of hydrological models, as well as the training and testing of data-driven algorithms (Kratzert et al., 2018, 2019, 2021; Liu et al., 2024b; Mai et al., 2022; Nearing et al., 2024; Yin et al., 2022). Details about recent progress and their applications have been summarized by Addor et al. (2020). Kratzert et al. (2023) introduced the Caravan platform, which consolidates the national CAMELS datasets into a singular dataset derived from global sources; this further improves the accessibility of the dataset. However, Clerc-Schwarzenbach et al. (2024) expressed concern that the Caravan dataset, which uses ERA5-Land reanalysis data, for exhibiting an unrealistically high potential evapotranspiration and a significant discrepancy in precipitation relative to the original CAMELS datasets. Such differences in meteorological forcings influence model results. Consequently, they advocated for the augmentation of the Caravan dataset with the forcing data present in the original CAMELS dataset, underscoring the ongoing relevance of the original CAMELS development.

While the already existing CAMELS datasets cover a wide range of hydroclimatic conditions and catchment characteristics it can still be considered incomplete in terms of the catchment diversity, especially a lack of samples from low-lying, small, and groundwater-dominated catchments. In the existing CAMELS datasets with a total of 3308 catchments (CH: 331, GB: 671, CL: 516, BR: 897, AUS: 222, US: 671), only 7.5 % of the catchments (250 out of 3308) have an elevation lower than 100 m, and 9.5 % of the catchments have an area size smaller than 50 km2. Hydrological regimes, with respect to their baseflow and peak flow, differ between small and large catchments. The same holds for low-lying catchments, where hydrological regimes are more affected by groundwater-related processes. Hence, samples from small-sized and groundwater-dominated catchments are necessary to increase the diversity of the current existing CAMELS family. Additionally, information and data from well-established physical hydrological models have not yet been provided by previous LSH datasets. Simulated runoff, obtained from hydrological models, offers a valuable benchmark dataset and can potentially also be provided at ungauged basins.

It is worth mentioning that LSH provides an invaluable foundation to scientists aiming to apply data-driven machine learning (ML) techniques in hydrology, as they undoubtedly offer an ideal environment for benchmarking, training, and testing ML algorithms. CAMELS and similar LSH have already contributed to ML advancements, such as predicting streamflow (Wilbrand et al., 2023), transfer learning (Ma et al., 2021), testing advanced algorithms (Yin et al., 2022), hybrid modeling (Acuña Espinoza et al., 2024), and global-scale flood forecasting (Nearing et al., 2024). In the context of ML-based hydrological modeling, physically informed data-driven algorithms, which combine the strength of traditional physics-based approaches with ML models, often show enhanced performance (Konapala et al., 2020). Therefore, hydrological information from well-established hydrological models (if existing) would further benefit applications of CAMELS datasets. Previous CAMELS datasets provide simulated streamflow from conceptual hydrological models, such as LamaH-CE, which is insufficient in many Danish river systems. Liu et al. (2024a) tested several hybrid schemes of combining simulations from a physically based hydrological model (PBM) and climate variables for streamflow estimation in Denmark, and the study found that the combination of PBM simulations (such as shallow groundwater levels, soil water content, and streamflow) and climate variables achieves better performance with an ML model. As part of a ML-based rainfall runoff modeling study published by Koch and Schneider (2022), hydrometeorological time series and catchment attributes were made available alongside the observed discharge for 301 Danish catchments. The published dataset only covers a selection of very few catchment attributes, does not comply fully with FAIR principles, and therefore does not fulfill CAMELS standards.

The points presented above have motivated us to compile and introduce CAMELS-DK for Danish catchments. Denmark covers roughly 43 000 km2; the topography is flat (highest point is 170 m above sea level) and the climate is temperate with precipitation evenly distributed over the year (annual average precipitation ranging between 600 mm in the east and 1000 mm in the west). The flat terrain and wet climate generate about 69 000 km of river courses and 195 000 lakes (Danish Environmental Protection Agency, 2023). Groundwater levels close to the ground surface are found throughout the country, and reliable surface water modeling in Denmark always requires information on groundwater dynamics (Duque et al., 2023; Koch et al., 2021; Schneider et al., 2022). Ignoring groundwater contributions in streamflow modeling can lead to significant errors, such as the underestimation of baseflows and the misrepresentation of seasonal variations. Spatiotemporally continuous groundwater measurements are difficult to obtain, which means not all catchments have groundwater information when developing CAMELS-style datasets. On the other hand, well-developed national-scale hydrological models with three-dimensional groundwater movement are available in Denmark, which has been calibrated with well measurements of groundwater levels. These simulations provide valuable insights for surface hydrological modeling.

CMALE-DK provides consistent data for 3330 catchments, which cover almost the entire land area of Denmark. The dataset has a median catchment area of 19.61 km2 with a median elevation of 31.94 m. Around 10 %, i.e., 304 of these catchments, are gauged and contain observed runoff, but all catchments in this dataset contain consistent simulated hydrological data, including simulated runoff and groundwater dynamics, in addition to hydroclimatic forcing and catchment attributes. These simulations are from a spatially distributed hydrological model, the National Hydrological Model of Denmark (DK-model), which has been thoroughly validated against data from 304 streamflow stations and approximately 40 000 groundwater wells. The DK-model provides a satisfying and consistent simulation of streamflow and incorporates a sophisticated 3D representation of groundwater processes, including a detailed hydrogeological model. With the first CAMELS datasets to provide a broad range of consistent and high-quality simulated data, we intend to generate a test bed for hybrid or physics-aware ML developments to further accelerate the development of such model types.

The objective of this study is to describe the extensive repository of hydrometeorological time series and catchment attributes in Denmark. We followed the routine of previous CAMELS dataset and to ensure that this dataset is comparable to and interoperable with the already existing CAMELS datasets. Additionally, some new features are introduced which have not been found in the previously published CAMELS datasets, such as the incorporation of simulation results (shallow and deep groundwater levels, irrigation) for a spatially distributed hydrological model as well as features of a hydrogeological model and observed groundwater abstraction for irrigation. The dataset follows the FAIR guiding principles for scientific data management and stewardship. We believe this dataset will enrich the existing CAMELS database with catchments that have different features and hydrological regimes than the already existing datasets with respect to groundwater influence and size. The paper is organized as follows: Sect. 2 describes the catchment delineation. Section 3 presents the dynamic variables including climate forcings, observed streamflow, and simulations from the hydrological model. Section 4 presents the sources and features of catchment attributes of topography, soil types, land use, and geology. In Sect. 5, we discuss the hydrological processes in Denmark based on the provided dataset. Section 6 contains a short summary of the paper.

2 Catchment delineation

Denmark has been divided into 3351 topographical catchments (Højberg et al., 2021), referred to as ID15 catchments (Fig. 1). The division was originally prompted by the need to know the topographic areas upstream of monitoring stations, lakes, and marine waterbodies, thus enabling the calculation of runoff and loads of nutrients, suspended matter, and other chemical components coupled to the corresponding topographical area. For modeling purposes, the observed catchments were further subdivided and supplemented by a delineation for unmonitored catchments. A topology for the sub-catchments was established, describing the downstream relations.

https://essd.copernicus.org/articles/17/1551/2025/essd-17-1551-2025-f01

Figure 1CAMELS-DK domain. (a) ID15 catchments and geological regions of Denmark, including northern Zealand (NZ), eastern Denmark (ED), western Jutland (WJ), central Jutland (CJ), Djursland (DL), Himmerland (HL), northwestern Jutland (NwJ), northern Jutland (NJ), and Bornholm (BH). (b) Distribution of catchment area and (c) an example of river channels and catchment boundaries delineated by ID15 catchments in northern Zealand. The green lines are river channels, the red dot indicates the location of a hydrological station downstream of the river section, blue dots are example basin outlets, and the corresponding catchments upstream of drainage areas are filled with light color.

The catchment delineation seeks to provide sub-catchment areas at a size of about 15 km2 (hence the name ID15 catchments) where possible. However, the criteria that all lake outlets and monitoring stations must coincide with an outlet from a sub-catchment introduce sub-catchments sometimes considerably smaller than 15 km2. Alongside the establishment of new lakes or additional monitoring stations, the delineation is subject to periodic updates, with additions of new or alteration of existing catchment boundaries. The most updated version of ID15 catchments includes 3351 sub-catchments with a median area of 12.98 km2, varying from less than 1 km2 to more than 100 km2 for the ID15 catchments containing the largest Danish lake. Both very large and some very small ID15 catchments were created to accommodate lake catchments and monitoring stations. The delineation of the ID15 catchments follows topographic catchments. However, especially in flat areas and areas with modified streams and canals, local knowledge of such alterations was accounted for to delineate the actual surface water catchments. Additionally, ID15 catchments are also coupled with the National Hydrological Model of Denmark (details are in Sect. 3.3); time series of simulated streamflow are available at catchment outlets where streams are included in the national hydrological model.

3 Dynamic variables

3.1 Climate data

The climate time series are based on national gridded products produced by the Danish Meteorological Institute (DMI). Datasets with daily time steps going back to the year 1989 are based on observations from a network of in situ weather stations. Daily rainfall time series, aggregated to catchment scale with an area-weighted mean function, were derived from the 10 km2 “Klimagrid Danmark” data from DMI (Scharling, 1999b). The gridded dataset is based on an inverse-distance interpolation of daily observations of precipitation at DMI's stations. The station density changes from year to year but generally fluctuates between 200 and 500 stations, distributed across all of Denmark. The impact of the reduction in the number of gauges was evaluated by DMI and found to be minimal at the national scale, but it can have local impacts (Andersen, 2021). As many hydrological applications are extremely sensitive to the exact amount of precipitation, the raw observations of precipitation were corrected for sensor and gauge undercatch. Operational precipitation gauges are typically situated 1–1.5 m above the ground and are influenced by turbulence around the gauge. This turbulence leads to a systematic undercatch of measured precipitation compared to true precipitation. The extent of undercatch depends on wind speed and precipitation type, with particularly significant undercatch observed for snow during windy conditions. However, even liquid precipitation is subject to undercatch. In Denmark, the undercatch is estimated to average around 5 %–10 % in summer and 15 %–20 % in winter. This systematic bias results in an underestimation of true precipitation, making correction essential for any water resource analysis or hydrological modeling exercise. The precipitation correction model applied in this study is based on a Danish study of the Hellman rain gauge. The model uses empirical relationships to estimate undercatch based on wind speed, shelter conditions at the gauge (turbulence), and temperature (to determine precipitation type – liquid, solid, or mixed). Additionally, the model accounts for wetting losses caused by evaporation of water from the inner sides of the rain gauge before measurement. The correction was applied dynamically, accounting for rainfall intensity, wind speed, and temperature (solid or liquid precipitation). CAMELS-DK provides corrected precipitation at the ID15 catchment scale based on freely available uncorrected precipitation observations, wind speed, and temperature data in 10 km grid format and the correction approach described by Stisen et al. (2011). The uncorrected or DMI corrected precipitation data are not part of this dataset. Similarly to the precipitation dataset, DMI also provides gridded datasets of air temperature and potential evapotranspiration (Scharling, 1999a). Potential evapotranspiration is calculated based on the Makkink formula (Van Kraalingen and Stol, 1997) by DMI, specifically modified for Danish conditions. Temperature and potential evapotranspiration are provided at daily time steps but at an original resolution of 20 km2.

The climate data were downloaded directly from the free data application programming interface of DMI (2024). Precipitation correction was then applied to all grids to account for the undercatch biases. The raster data were subsequently clipped to each catchment boundary, and a mean daily value was calculated based on all grids that fall within or touch the boundaries. This process forms the climate time series included in CAMELS-DK.

Table 1Dynamic variables in CAMELS-DK. The time series are on a daily scale and spatially aggregated with the mean function.

Download Print Version | Download XLSX

3.2 Observed streamflow

Streamflow is provided by Aarhus University through the Surface Water Database (DCE, 2020). Water levels are measured sub-daily by hydraulic sensors at hydrological stations and aggregated to daily values. Daily streamflow is calculated by the sub-daily water levels, applying these to the established dynamic rating curves. The rating curves vary throughout the year due to changes in river cross sections caused by erosion and sedimentation or regulation, as well as by vegetation growth and cutting. Mean daily water discharges are calculated from sub-daily time series and are accessible through DCE (2020). Quality control is performed by the Danish Environmental Protection Agency. Streamflow time series are available for over 1000 hydrological stations spread across Denmark. However, many stations are unsuited for use in hydrological modeling. Some stations may have limited time series lengths (basin_id 35321223 has the shortest streamflow record of 1762 d during the period 1989–2019 in CAMELS-DK), and some stations may contain questionable discharge values. Others may show anthropogenic influence (e.g., owing to the operation of sluices or pumping stations), which cannot be represented adequately in hydrological models. This meant that, based on experience with the calibration and validation of the DK-model, the observation dataset was limited to 304 stations (Stisen et al., 2020). Additionally, some of the stations (four stations) were located in the middle of a ID15 catchment, and these stations were excluded from the dataset. The spatial distribution of hydrological stations is displayed in Fig. 2a. Small catchments (area <20 km2, 37 stations) are located upstream of larger river systems and in coastal areas. Large catchments (area >200 km2) are mainly located on the Jutland peninsular. The largest river of Denmark, the Gudenå River in central Jutland, has a catchment area of 2602 km2 at the most downstream station.

https://essd.copernicus.org/articles/17/1551/2025/essd-17-1551-2025-f02

Figure 2Streamflow data availability for all gauges contained in the CAMELS-DK dataset. (a) The distribution of hydrometrical stations and adherent catchments, (b) the number of stations with full observation time series per year, and (c) the availability of observed streamflow for different periods shown by the number of stations with the percentage of available streamflow.

Data availability for the 304 stations included in CAMELS-DK is illustrated in Fig. 2. Most stations provided continuous streamflow observations during the earlier part of the study period, specifically from 1989 to 2005 (Fig. 2b). However, the number of active hydrological stations declined sharply between 2005 and 2010, with only about 200 stations continuing to provide full observations of streamflow. Figure 2c provides further insight into data availability based on the fraction of days with streamflow observations. During 1990–1999, around 280 stations had data for 90 % of the period, indicating high data coverage during this decade. However, data availability declined significantly in subsequent years, with only about 200 stations retaining 90 % data coverage over the entire study period from 1990 to 2019. This decline highlights the challenges of maintaining consistent long-term hydrological monitoring. The reduction in the number of available stations and data coverage may be due to factors such as station decommissioning, operational challenges, or changes in monitoring priorities. These trends have important implications for hydrological modeling, as reduced data availability may limit the spatial and temporal robustness of hydrological analyses.

3.3 DK-model simulations

The National Hydrological Model of Denmark (hereafter referred to as the DK-model) is an integrated groundwater surface water model covering most of the Danish land area of approximately 43 000 km2. It has been continuously developed at the Geologic Survey of Denmark and Greenland (GEUS) since the late 1990s (Henriksen et al., 2019; Højberg et al., 2013; Koch et al., 2021; Schneider et al., 2022; Stisen et al., 2020). It has been used in public consultancy projects, targeting, for example, water resources, evaluation of the impact of climate change on the hydrological cycle, or nitrate retention. However, the DK-model as a whole or in parts also has been and is being used actively in various national and international research projects, further pushing its development (Mahmood et al., 2023; Martinsen et al., 2022; Seidenfaden et al., 2022; Soltani et al., 2021).

The DK-model is set up in the MIKE SHE model code (Abbott et al., 1986; DHI, 2020), which is used to fully couple a 3D description of subsurface flow to 2D overland flow, a simple two-layer description of the unsaturated zone, and 1D kinematic routing of streamflow. It runs as a transient model at a maximum simulation time step of 24 h and, for the historic period covered by this dataset, with daily climate forcing of precipitation, potential evapotranspiration, and temperature (as described in Sect. 3.1). In its most recent version, which serves as a basis for the data presented here, it exists at horizontal resolutions of both 100 and 500 m (Henriksen et al., 2021),where we are using results from the 500 m version (input data, such as the hydrostratigraphic model, remain at 100 m resolution and are resampled internally by the model). Special attention has been paid to the representation of surface and near-surface processes (shallow groundwater). The model has been calibrated for the period 2000 to 2010 against 304 time series of daily streamflow observations (mostly identical to the ones provided in this dataset) and against groundwater head observations from roughly 40 000 intakes distributed across the entire country. The most significant anthropogenic impacts are accounted for in the model: groundwater extraction for domestic and industrial water use is included using data on annual average extractions from waterworks in the National Well Database (Jupiter, 2023). Groundwater extraction for irrigation in agriculture, which occurs predominantly in the western parts of Denmark, is accounted for based on dynamic calculations of soil water deficit in MIKE SHE (Danapour et al., 2021). Where sewage plants discharge their outflow into streams, these are included as point sources in the stream network, based on yearly averages. As roughly half of the agricultural land in Denmark is artificially drained, saturated zone drainage is included (Møller et al., 2018b). Due to the coarse grid resolution, the related inability to represent the drainage infrastructure explicitly, and the effects of microtopography, as well as other aspects such as urban sewer systems acting as further drainage, the DK-model includes drainage across most of Denmark. It is parameterized using drain depths and time constants distributed based on the BASEMAP land use map. Generated drain flow is routed to the river network.

With hydrological processes in Denmark in general, and streamflow in particular, being strongly groundwater-dependent (Duque et al., 2023), we provide, amongst others, groundwater-related variables from the DK-model simulations. We could previously show that data-driven methods for streamflow estimation benefit from information contained in simulated groundwater levels across Denmark (Liu et al., 2024a). Hence, as part of the CAMELS-DK dataset, we include various simulation results from the DK-model at daily time steps with relevance for streamflow (Table 1). For reference, we also include the simulated streamflow at the outlet of each ID15 catchment. The other simulation results are aggregated per ID15 catchment. From the two-layer unsaturated zone module of MIKE SHE, the water content in the root zone is provided, together with simulated actual evapotranspiration and recharge from the unsaturated to the saturated zone. For recharge, positive values indicate flow downwards to the saturated zone, whereas negative values indicate upwards flow from the saturated zone, occurring, for example, in areas of upwelling groundwater. Furthermore, the simulated exchange between streamflow in rivers and the saturated zone is provided, as it gives an indication of the groundwater baseflow contribution to streamflow (most Danish rivers can be classified as predominantly gaining streams). Another important part of the hydrological cycle in Denmark is artificial drainage because roughly half of the Danish agricultural land is artificially drained (Møller et al., 2018a); therefore simulated artificial drainage is included in the present dataset. Together, direct groundwater contribution to the streams and artificial drainage account for most of the streamflow generated in the DK-model (Refsgaard et al., 2022). Finally, the groundwater state is included with two variables: depth to the phreatic layer, representing the depth below the surface to the uppermost simulated groundwater table (which can be a perched groundwater table), and the simulated groundwater head in the main deeper aquifer (typically in a few tens of meters of depth). The former gives an indication of surface and near-surface interactions between groundwater and surface water, reacting more quickly to precipitation events, whereas the latter typically shows a more delayed response to climate.

3.4 Groundwater abstraction

Domestic and industrial water supply in Denmark is almost exclusively covered by groundwater. Data on groundwater abstraction from water works are reported to the national well database Jupiter (Hansen and Pjetursson, 2011) as aggregated yearly values per well or well field. Those data are used as input to the DK-model and are also provided as part of the CAMELS-DK dataset. Due to the close coupling between surface and subsurface processes in Denmark, groundwater extraction locally impacts streamflow, as acknowledged, for example, in national water resource assessments (Henriksen et al., 2008, 2023).

4 Catchment attributes

4.1 Location and topography

The dataset is organized based on the ID15 catchments (Højberg et al., 2021). All the catchments are identified with an eight-digit name according to the shapefile. The locations of catchment outlets are provided as longitude and latitude with the projection of WGS 84/UTM zone 32N (EPSG:32632). Flow direction indicates the downstream catchments; the column was filled with −99 for most downstream catchments. The number of upstream catchments and the list of upstream catchment IDs are also provided. The area of the individual ID15 catchments and their entire upstream accumulated area originate from the provided shapefile. Gauge types indicate if the catchment is gauged or ungauged; when “true” indicates that the catchment is a gauged catchment, observed streamflow and simulated streamflow are available in the dataset, and “false” indicates that only simulated streamflow is available. Details about the observed data, such as the length and the percentage of available observed streamflow data during 1989 to 2019, are provided. Catchment topography described using various statistics derived from elevation and slope is also provided in the attribute table; see Table 2. Please be aware that catchment zonal statistics are based on the entire upstream area.

Table 2Catchment location and topography.

Download Print Version | Download XLSX

4.2 Climate indices

Climate indices are based on the aforementioned gridded datasets of precipitation, temperature, and potential evapotranspiration. To provide consistency with previous CAMELS datasets, we compute the same climatic indices for all catchments in CAMELS-DK as Addor et al. (2017) based on the script provided by Hao et al. (2021), such as the mean value of daily precipitation, seasonality, and frequency. Details of these climate indices are listed in Table 3. Climatic indices in CAMELS-DK are calculated based on the time series from 1989 to 2019, which is consistent with the availability of observed discharge.

Table 3Climate indices.

Download Print Version | Download XLSX

4.3 Streamflow signatures

Hydrologic signatures are derived from the observed daily streamflow at 304 hydrological stations and the DK-model-simulated streamflow at the outlets of 2942 catchments (388 catchments have no simulated river flows) during 1989 to 2019. The signatures are calculated using a MATLAB toolbox called TOSSH (A Toolbox for Streamflow Signatures in Hydrology), developed by Gnann et al. (2021). TOSSH provides basic signatures such as magnitude, frequency, duration, timing, and rate of change of a natural streamflow regime, as well as signatures from benchmark papers, including those used by Addor et al. (2017), and an extended set of process-based signatures.

We calculated 13 hydrological signatures presented by Addor et al. (2017) using TOSSH, ensuring consistency with other CAMELS datasets (e.g., Alvarez-Garreton et al., 2018; Chagas et al., 2020; Höge et al., 2023). These signatures are listed in Table 4. Additionally, we provide process-based signatures calculated with TOSSH, including groundwater and overland flow signatures defined by McMillan (2020); see Appendix A. The groundwater signatures characterize groundwater storage, groundwater dynamics, and baseflow, while the overland flow signatures reflect infiltration excess and saturation excess in overland flow.

Table 4Hydrographical signatures based on observed and simulated streamflow.

Download Print Version | Download XLSX

4.4 Land use and land cover

We provide both national and continental data sources for land use and land cover (LULC). The LULC attributes are derived from two datasets: the CORINE Land Cover (CLC) from the European Copernicus program (Büttner et al., 2004) and Basemap04 developed by Aarhus University (Levin, 2022). CAMELS-DK calculates the catchment-scale percentage of key LULC classes. The aggregated areal percentages of five land cover types are provided for all catchments, including the percentages of forest, crops, water, urban areas, and wetlands (Table 5).

The CORINE Land Cover (CLC) product offers a pan-European land cover and land use inventory with 44 thematic classes, ranging from broad forested areas to individual vineyards (Büttner et al., 2004). The product is updated every 6 years with new status and change layers for 1990, 2000, 2006, 2012, and 2018. The dataset is based on multiple satellite data sources, such as Landsat, SPOT-4/5, RapidEye, and Sentinel-2. The geometric accuracy of the CLC data is 100 m for the first dataset released in 1990, with accuracy improving to better than 100 m in subsequent years. Basemap04 provides four maps of LULC representing the years 2011, 2016, 2018, and 2021, with a spatial resolution of 10 m. The dataset combines existing thematic geographic information, such as census mapping of state forests, maps of protected habitats, agricultural field parcel maps, and cadaster maps (Levin, 2022).

Table 5Land use attributes.

Download Print Version | Download XLSX

4.5 Soil types

Physical soil properties were derived from national and continental data sources. For the former, catchment average soil texture was derived for four soil layers, i.e., 0–5, 5–15, 15–30, 30–60, 60–100, and 100–200 cm at  30 m spatial resolution. The Danish dataset by Adhikari et al. (2013) is based on a regression model using field data for 1958 soil profiles and 17 environmental covariates. In addition, catchment average soil physical properties were derived from the European Soil Data Centre (ESDAC) and the dataset of 3D Soil Hydraulic Database of Europe at 1 km and 250 m resolution (Hiederer, 2013; Tóth et al., 2017). A broader set of variables was processed, including rooting depth and saturated hydraulic conductivity, among others. Soil maps of ESDAC are at 1000 m spatial resolution and are based on data from the European Soil Database in combination with data from the Harmonized World Soil Database (HWSD) and Soil-Terrain Database (SOTER). Several hydraulic parameters from the 3D Soil Hydraulic Database of Europe, available at 1 km and 250 m resolutions, are included in CAMELS-DK, such as water content at field capacity and saturated hydraulic conductivity (Table 6).

Table 6Catchment attributes of soil types.

Download Print Version | Download XLSX

4.6 Hydrogeology and geology

Hydrogeological features were obtained from the 3D hydrogeological model implemented in the DK-model (Stisen et al., 2020). The model is based on a layer structure, with a total of up to 28 hydrostratigraphical layers that have been modeled based on borehole data and subsurface geophysics. The layers delineate the occurrence of, e.g., Quaternary and pre-Quaternary sand and clay as well as limestone. The bottom and top of the hydrostratigraphical layers are mapped on a 100 m grid and five key variables were compiled for the CAMELS-DK dataset: (1) depth to chalk specifies the vertical distance from terrain to chalk. (2) Thickness of the uppermost aquifer is the accumulated thickness of Quaternary sand layers that are not interrupted by clay layers thicker than 1 m. (3) Depth to uppermost aquifer specifies the vertical distance from terrain to the top of the uppermost Quaternary sand layer thicker than 1 m. (4) Thickness of uppermost clay specifies the accumulated Quaternary clay layers from terrain until a sand layer thicker than 1 m is present. (5) Thickness of uppermost sand specifies the accumulated Quaternary sand layers from terrain until a clay layer thicker than 1 m is present. All variables are stated in meters, and the catchment average has been calculated for the CAMELS-DK dataset (Table 7). The hydrostratigraphical model for the island of Bornholm is developed separately from the rest of Denmark. Bornholm's geology is significantly different with pre-Cambrian granite–gneisses and Cambrian sandstones. Available data are interpreted to build a 3D voxel model for the island. Due to these differences, the five hydrogeological features are not available for Bornholm. Moreover, the 14 classes of the surface geology map of Denmark (Pedersen et al., 2011a) were processed as catchment percentages.

Table 7Catchment attributes of hydrogeologic and geologic features.

Download Print Version | Download XLSX

5 Data discussion

5.1 Regional variability in catchment attributes

Denmark has a gentle topography with many regions at a mean elevation lower than 50 m and low slopes (Fig. 3a, b). Across central Jutland, a ridge has elevations reaching up to about 170 m. This ridge, together with the prevailing westerly winds, is responsible for the regional precipitation patterns, with higher precipitation in western Jutland and lower precipitation in the more eastern parts of the country (Fig. 3c). Another important factor controlling hydrology across Denmark is the soil type: soils are most clayey in eastern Denmark and central and northwestern Jutland (Fig. 3d) and least clayey (mostly sandy) in western Jutland. A higher percentage of clay soil generally relates to lower hydraulic conductivities in the topsoil, leading to higher groundwater levels and more artificial drainage. It also leads to larger amount of water available for evapotranspiration and lower runoff. In contrast, the thickness of uppermost sand is high in western Jutland, northern Jutland, and Himmerland (Fig. 3e). The depth to chalk aquifers is high (>250 m) in northwestern Jutland and central and western Jutland but lower in northern Zealand and Djursland with typical depths of around 60 m (Fig. 3f). The temperate climate, with precipitation clearly exceeding evapotranspiration, low-lying topography, and geology dominated by clay till or sandy meltwater deposits, results in high groundwater levels and abundant groundwater resources in Denmark.

https://essd.copernicus.org/articles/17/1551/2025/essd-17-1551-2025-f03

Figure 3Regional distributions of (a) mean elevation, (b) slope, (c) mean daily precipitation, (d) clay percentage in the layer depth of 0–30 cm, (e) depth of chalk, and (f) thickness of uppermost sand in geo-regions (NJ: northern Jutland, NwJ: northwestern Jutland, HL: Himmerland, DL: Djursland, NZ: northern Zealand, CJ: central Jutland, WJ: western Jutland, ED: eastern Denmark).

Download

5.2 Surface water dynamics

CAMELS-DK provides observed streamflow data from 304 hydrological stations and simulated streamflow data for 2942 catchments. However, 388 catchments lack observed or simulated streamflow data because they are located upstream of or in coastal areas without distinct land surface river channels. The simulated streamflow is based on the DK-model, which was calibrated using observed streamflow data from the period 2000–2010; see Sect. 3.3. Figure 4 shows the and Kling–Gupta efficiency (KGE) and Nash–Sutcliffe model efficiency coefficient (NSE) for the DK-model's simulated hydrography, referenced against observations from 1990–1999. The mean NSE is 0.46, and the median value is 0.64. The mean KGE is 0.64, and the median value is 0.69, indicating that the simulated streamflow is satisfactory at many stations, with the simulations and observations showing temporal coherence and consistency.

https://essd.copernicus.org/articles/17/1551/2025/essd-17-1551-2025-f04

Figure 4The performance of DK-model-simulated streamflow at 304 hydrological stations. (a) Spatial distribution of the Nash–Sutcliffe model efficiency coefficient, (b) spatial distribution of the Kling–Gupta efficiency (KGE), and (c) histogram of NSE and KGE.

The DK-model performs well at the national scale but has challenges in central and northern regions. These challenges arise from factors such as geological complexity, insufficient regional parameterization, and limitations in data quality and station coverage over time. Simulations show reduced performance under high-flow and low-flow conditions, as evidenced by lower NSE values in northern Zealand and central and northern Jutland (Fig. 4). The deteriorated performance in regions like central and northern Jutland is attributed to significant geological variability and limited experience with newer parameterization approaches. Smaller stations also tend to exhibit lower performance compared to larger ones, likely due to their sensitivity to geological complexity and inadequate parameter regionalization for drainage flows. More details on the DK-model's performance, including its strengths and limitations, can be found in Stisen et al. (2020). To address these limitations, recent advancements such as deep learning methods and hybrid modeling schemes have been explored to enhance streamflow forecasting (Liu et al., 2024a). Furthermore, the DK-model's performance in ungauged catchments remains insufficiently validated, largely due to the computational demands of model calibration.

https://essd.copernicus.org/articles/17/1551/2025/essd-17-1551-2025-f05

Figure 5Spatial distribution of (a) baseflow index, (b) streamflow–precipitation elasticity, and (c) slope of the flow duration curve.

Groundwater and surface water are closely linked in Denmark (Duque et al., 2023). Figure 5 presents three hydrographic signatures derived from DK-model simulations: baseflow index (BFI), streamflow–precipitation elasticity (QP-elasticity), and the slope of the flow duration curve (FDC_slope). BFI represents the proportion of streamflow that occurs as baseflow. The value is higher in Himmerland, Djursland, and central Jutland, indicating that a significant portion of the streamflow comes from groundwater seepage (Fig. 5a). The groundwater levels are lower in these areas, resulting in the slow and steady release of groundwater into the streams, thereby exhibiting more stable flow conditions throughout the year. Groundwater is a buffer against short-term variations in precipitation in these areas. QP elasticity indicates the sensitivity of streamflow to changes in precipitation, which is low in central Jutland and high in eastern Denmark. This suggests significant regulation of streamflow by groundwater. The FDC_slope also shows higher values in eastern Denmark, indicating highly variable streamflow that is largely due to the quick runoff of rainfall into the stream (Fig. 5c).

https://essd.copernicus.org/articles/17/1551/2025/essd-17-1551-2025-f06

Figure 6Climatology of precipitation, observed streamflow, and DK-model-simulated shallow groundwater levels in the (a) Storå River (the largest river in western Jutland) and (b) Suså River (the largest in eastern Denmark).

Download

Groundwater–surface water interaction is significant at temporal scales as well. Figure 6 shows the climatology (1990–2019) of precipitation, observed streamflow, and depth to phreatic layers in the Storå River (the largest river in western Jutland) and Suså River (the largest river in eastern Denmark), close to each of their outlets. Climatology is calculated by averaging time series values for the same day of the year across 1990–2019. The spatial variability of precipitation is relatively small, and the seasonality is similar across Denmark. The two examples shown in Fig. 6 have a comparable amplitude of precipitation during the year, with the highest precipitation occurring during the fall and winter months and the lowest precipitation during spring. Streamflow, though, has a more pronounced seasonality due to higher evapotranspiration during summer, leading to lower runoff coefficients. Even in the climatology, the peaky streamflow response of the sandier Storå catchment becomes apparent compared to the more clayey Suså catchment. Moreover, summer baseflow in the Storå catchment is higher than in the Suså catchment (see also BFI in Fig. 5a and clay percentage in Fig. 3d), which again is due to the better connectivity between (shallow and deep) groundwater and rivers due to the more sandy and higher-conductivity subsurface conditions in large parts of western Jutland. Linked to similar processes, differences in groundwater dynamics can be seen, with the Storå catchment exhibiting lower groundwater tables but higher seasonal amplitudes of the two example catchments. The steady decline of groundwater levels during the summer months is due to elevated levels of evapotranspiration, limiting groundwater recharge during that time. Groundwater recharge typically starts again around the beginning of September, and streamflow levels follow closely and begin to rise from summer baseflow conditions around the same time.

https://essd.copernicus.org/articles/17/1551/2025/essd-17-1551-2025-f07

Figure 7Groundwater dynamics, abstraction, and groundwater abstraction for irrigation based on CAMELS-DK. The left column illustrates example time series data from a catchment in northern Jutland (catch_id 37220013), while the right column presents a map of time-averaged values for all basins. From top to bottom, the rows represent (a) depth to the phreatic surface, (b) catchment-wide groundwater abstraction with corresponding annual time series, and (c) catchment-wide groundwater abstraction specifically for irrigation purposes.

5.3 Groundwater dynamics

Groundwater supplies nearly 100 % of Danish domestic and industrial water use. Groundwater monitoring and modeling are thus important for sustainable water management in Denmark. Figure 7 shows catchment-aggregated variables from DK-model simulations, such as depth to phreatic layers (Fig. 7a), groundwater extraction for irrigation (Fig. 7b), and reported groundwater abstractions (Fig. 7c) from waterworks. The spatial pattern of phreatic depth shows that the groundwater level is highest in eastern and southern Denmark and lowest in northern Denmark. In some catchments located in Himmerland and central Jutland, the average phreatic depth is around 15 m below the land surface (Fig. 7a). An example of the time series of phreatic depth in a catchment in Himmerland shows an annual amplitude of approximately 5 m. The groundwater table is higher in winter and lower in summer, which aligns with the seasonal variations in precipitation and evapotranspiration. Precipitation deficits can reduce the amplitude of phreatic depth, especially during winter months such as in 1996. Catchments with more shallow groundwater depths typically show smaller yearly amplitudes. Groundwater extraction for domestic and industrial water supply is based on yearly data from the National Well Database (Jupiter, 2023).Groundwater abstraction for irrigation is only available in Jutland since there is no significant irrigation activity in the other parts of Denmark. Irrigation is provided as determined dynamically and demand-based by the DK-model.

6 Dataset structure

The CAMELS-DK dataset is available at https://doi.org/10.22008/FK2/AZXSYP (Koch et al., 2024), with a data description in the file “Data_description.pdf” that details the structure of the dataset. There are four folders after downloading and unzipping the dataset. Landscape attributes including climate indices, topography, hydrological signatures, land use, soil characteristics, and hydrogeological features are saved in a folder named “Attributes”. These static attributes are saved in CSV files, where the index is the eight-digital identifier, and the columns are the names of variables (see Tables 2–5). Time series of climate data, observed streamflow, DK-model-simulated variables (such as depth to phreatic layers, average soil water content, and actual evapotranspiration), and observed yearly groundwater abstractions are available in the folder “Dynamics”. The time series data are saved separately for gauged and ungauged catchments. The shapefiles of catchment boundaries, locations, and the geo-regions of Denmark are provided in a folder named “Shapefile”. The Python script for processing the time series and landscape attributes based on original datasets is provided in a folder named “Python”.

7 Data availability

The compiled CAMELS-DK dataset is available via the GEUS Dataverse under a Creative Commons Zero license at https://doi.org/10.22008/FK2/AZXSYP (Koch et al., 2024). The original (raw) datasets used to prepare the CAMELS-DK dataset are referenced in Tables 1–7, including source URLs. The ID15 catchment shapefile is included in CAMELS-DK in the valid version as of the time of the compilation of CAMELS-DK. For updates to the ID15 catchments, please contact the data provider by writing to id15@ecos.au.dk.

8 Conclusion

CAMELS-DK is a freely available dataset that includes extensive hydrometeorological variables and landscape features for 3330 catchments (304 gauged catchments) covering the entirety of Denmark. This dataset is designed to support large-scale, data-driven studies, aligning with the research topics addressed by other CAMELS-style and LSH datasets. Its development follows established conventions, including daily time series of hydrometeorological variables spanning 1989 to 2023, as well as catchment landscape attributes such as climate, soil type, land use/cover, and geology. This ensures seamless integration with other CAMELS-style datasets for large-sample studies.

What sets CAMELS-DK apart is the inclusion of observed annual groundwater abstraction and simulated variables from a process-based hydrological model (PBM) – the National Hydrological Model of Denmark – including simulated discharge, soil water content, and groundwater head. These additional dynamic variables, particularly the groundwater-relevant variables, provide valuable insights for surface water modeling despite being based on simulations, as they reflect the close interactions between groundwater and surface water in many Danish catchments. We anticipate that this addition of PBM simulations will not only enhance data-driven model performance for surface water modeling in Denmark, but also benchmark and facilitate the development of physics-informed ML algorithms, transfer learning approaches, and hybrid modeling frameworks. We provide detailed instructions on how to access the source data, as we are not permitted to share it directly with CAMELS-DK. For users who wish to calculate catchment-aggregated variables, Python scripts are included for data processing, which can be used to derive spatially aggregated time series and attributes for 3330 catchments.

Appendix A

Table A1Additional hydrological signatures.

Download Print Version | Download XLSX

Appendix B: Comparison of simulated and observed signatures

The DK-model has been calibrated jointly against observed streamflow at around 300 hydrological stations; the performance of the DK-model-simulated streamflow is satisfactory at many stations (Stisen et al., 2020). Here, we display some comparison results of signatures derived from the DK-model-simulated streamflow and the observations. The magnitude of streamflow, such as the mean streamflow, between simulated streamflows is aligned with the observations (Fig. B1a). The mean streamflow is lower than 5 m3 s−1 at many stations (96 %), and 18 stations are in large rivers with a daily average streamflow higher than 5 m3 s−1. The DK-model tends to overestimate the BFI (Fig. B1b), indicating a slightly smoother streamflow hydrograph simulated by the DK-model. The precipitation elasticity of streamflow, which indicates the sensitivity of streamflow to precipitation, is still challenging to accurately capture with the model (Fig. B1c). The slope of flow duration curves (FDC_slope) quantifies the variability of hydrographs. A steep slope indicates a highly variable stream, where flow is primarily driven by the quick runoff of rainfall to the stream. The simulated hydrography tends to overestimate the FDC_slope for highly variable streams but underestimate it for less variable streams (Fig. B1d).

https://essd.copernicus.org/articles/17/1551/2025/essd-17-1551-2025-f08

Figure B1Comparison of observed signatures and DK-model-simulated signatures.

Download

Author contributions

JK, RJMS, and JL conceptualized and formulated the overarching research ideas and collected data from different sources. JL processed the data, conducted the formal analysis, and prepared the original draft under the supervision of RJMS, JK, and SS. MFTH assisted with data processing. LT, ALH, HT, and SS provided the ID15 catchments, validated the data sources, and reviewed the manuscript.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Acknowledgements

The dataset presented in this paper was compiled as part of the establishment of the Danish flood warning system (varslingssystem for oversvømmelser) from 2023 to 2026. The project is being led by the Danish Meteorological Institute (DMI) in cooperation with the Geological Survey of Denmark and Greenland (GEUS), the Danish Agency for Climate Data (KDS), the Danish Environmental Protection Agency (MST), the Danish Coastal Authority (KDI), and the Danish Environmental Portal (DMP). We acknowledge the editor Conrad Jackisch and the reviewers Alexander Dolich, Ather Abbas, Vazken Andréassian, and Ashutosh Sharma for their valuable feedback and constructive suggestions on the manuscript and dataset.

Review statement

This paper was edited by Conrad Jackisch and reviewed by Alexander Dolich and Ashutosh Sharma.

References

Abbott, M. B., Bathurst, J. C., Cunge, J. A., O'Connell, P. E., and Rasmussen, J.: An introduction to the European Hydrological System – Systeme Hydrologique Europeen, “SHE”, 1: History and philosophy of a physically-based, distributed modelling system, J. Hydrol., 87, 45–59, https://doi.org/10.1016/0022-1694(86)90114-9, 1986. 

Acuña Espinoza, E., Loritz, R., Álvarez Chaves, M., Bäuerle, N., and Ehret, U.: To bucket or not to bucket? Analyzing the performance and interpretability of hybrid hydrological models with dynamic parameterization, Hydrol. Earth Syst. Sci., 28, 2705–2719, https://doi.org/10.5194/hess-28-2705-2024, 2024. 

Addor, N., Newman, A. J., Mizukami, N., and Clark, M. P.: The CAMELS data set: catchment attributes and meteorology for large-sample studies, Hydrol. Earth Syst. Sci., 21, 5293–5313, https://doi.org/10.5194/hess-21-5293-2017, 2017. 

Addor, N., Do, H. X., Alvarez-Garreton, C., Coxon, G., Fowler, K., and Mendoza, P. A.: Large-sample hydrology: recent progress, guidelines for new datasets and grand challenges, Hydrol. Sci. J., 65, 712–725, https://doi.org/10.1080/02626667.2019.1683182, 2020. 

Adhikari, K., Kheir, R. B., Greve, M. B., Bøcher, P. K., Malone, B. P., Minasny, B., McBratney, A. B., and Greve, M. H.: High-resolution 3-D mapping of soil texture in Denmark, Soil Sci. Soc. Am. J., 77, 860–876, 2013. 

Alvarez-Garreton, C., Mendoza, P. A., Boisier, J. P., Addor, N., Galleguillos, M., Zambrano-Bigiarini, M., Lara, A., Puelma, C., Cortes, G., Garreaud, R., McPhee, J., and Ayala, A.: The CAMELS-CL dataset: catchment attributes and meteorology for large sample studies – Chile dataset, Hydrol. Earth Syst. Sci., 22, 5817–5846, https://doi.org/10.5194/hess-22-5817-2018, 2018. 

Andersen, R. C.: Undersøgelse af DMI's Nedbørsdata til Anvendelse for Hydrologiske Formål, Danish Meteorological Institute, Tekniske rapport 21–40, https://www.dmi.dk/fileadmin/Rapporter/2021/Undersoegelser_af_DMI_s_nedboersdata_til_anvendelse_for_hydrologiske_formaal.pdf (last access: 9 April 2025), 2021. 

Andersson, J. C. M., Pechlivanidis, I. G., Gustafsson, D., Donnelly, C., and Arheimer, B.: Key factors for improving large-scale hydrological model performance, Eur. Water, 49, 77–88, 2015. 

Bathelemy, R., Brigode, P., Andréassian, V., Perrin, C., Moron, V., Gaucherel, C., Tric, E., and Boisson, D.: Simbi: historical hydro-meteorological time series and signatures for 24 catchments in Haiti, Earth Syst. Sci. Data, 16, 2073–2098, https://doi.org/10.5194/essd-16-2073-2024, 2024. 

Büttner, G., Feranec, J., Jaffrain, G., Mari, L., Maucha, G., and Soukup, T.: The CORINE land cover 2000 project, EARSeL eProceed., 3, 331–346, 2004. 

Chagas, V. B. P., Chaffe, P. L. B., Addor, N., Fan, F. M., Fleischmann, A. S., Paiva, R. C. D., and Siqueira, V. A.: CAMELS-BR: hydrometeorological time series and landscape attributes for 897 catchments in Brazil, Earth Syst. Sci. Data, 12, 2075–2096, https://doi.org/10.5194/essd-12-2075-2020, 2020. 

Clerc-Schwarzenbach, F., Selleri, G., Neri, M., Toth, E., van Meerveld, I., and Seibert, J.: Large-sample hydrology – a few camels or a whole caravan?, Hydrol. Earth Syst. Sci., 28, 4219–4237, https://doi.org/10.5194/hess-28-4219-2024, 2024. 

Climate Data Agency: Denmark's Elevation Model – Surface.9Climate Data Agency [data set], https://dataforsyningen.dk/data/928 (last access: 30 May 2023), 2022. 

Coxon, G., Addor, N., Bloomfield, J. P., Freer, J., Fry, M., Hannaford, J., Howden, N. J. K., Lane, R., Lewis, M., Robinson, E. L., Wagener, T., and Woods, R.: CAMELS-GB: hydrometeorological time series and landscape attributes for 671 catchments in Great Britain, Earth Syst. Sci. Data, 12, 2459–2483, https://doi.org/10.5194/essd-12-2459-2020, 2020. 

Danapour, M., Fienen, M. N., Højberg, A. L., Jensen, K. H., and Stisen, S.: Multi-Constrained Catchment Scale Optimization of Groundwater Abstraction Using Linear Programming., Ground Water, 59, 503–516, https://doi.org/10.1111/gwat.13083, 2021. 

Danish Environmental Protection Agency: Lakes and watercourses, https://en.lbst.dk/water/lakes-and-watercourses (last access: 24 May 2023), 2023. 

Da Silva, J. S., Calmant, S., Seyler, F., Rotunno Filho, O. C., Cochonneau, G., and Mansur, W. J.: Water levels in the Amazon basin derived from the ERS 2 and ENVISAT radar altimetry missions, Remote Sens. Environ., 114, 2160–2181, 2010. 

DCE: The Surface Water Database (overfladevandsdatabasen ODA), https://odaforalle.au.dk (last access: 12 November 2020), 2020. 

Demir, I., Xiang, Z., Demiray, B., and Sit, M.: WaterBench-Iowa: a large-scale benchmark dataset for data-driven streamflow forecasting, Earth Syst. Sci. Data, 14, 5605–5616, https://doi.org/10.5194/essd-14-5605-2022, 2022. 

DHI: MIKE SHE User Guide and Reference Manual, https://manuals.mikepoweredbydhi.help/latest/Water_Resources/MIKE_SHE_Print.pdf (last access: 1 November 2022), 2020. 

DMI: Climate Data API, https://opendatadocs.dmi.govcloud.dk/en/Data/Climate_Data (last access: 2 July 2024), 2024. 

Duque, C., Nilsson, B., and Engesgaard, P.: Groundwater–surface water interaction in Denmark, Wiley Interdiscip. Rev. Water, 10, 1–23, https://doi.org/10.1002/wat2.1664, 2023. 

Euser, T., Winsemius, H. C., Hrachowitz, M., Fenicia, F., Uhlenbrook, S., and Savenije, H. H. G.: A framework to assess the realism of model structures using hydrological signatures, Hydrol. Earth Syst. Sci., 17, 1893–1912, https://doi.org/10.5194/hess-17-1893-2013, 2013. 

Fowler, K. J. A., Acharya, S. C., Addor, N., Chou, C., and Peel, M. C.: CAMELS-AUS: Hydrometeorological time series and landscape attributes for 222 catchments in Australia, Earth Syst. Sci. Data, 13, 3847–3867, https://doi.org/10.5194/essd-13-3847-2021, 2021. 

Frame, J., Kratzert, F., Raney, A., Rahman, M., Salas, F., and Nearing, G.: Post-Processing the National Water Model with Long Short-Term Memory Networks for Streamflow Predictions and Model Diagnostics, JAWRA J. Am. Water Resour. Assoc., 57, 885–905, https://doi.org/10.1111/1752-1688.12964, 2021. 

Gnann, S. J., Coxon, G., Woods, R. A., Howden, N. J. K., and McMillan, H. K.: TOSSH: A toolbox for streamflow signatures in hydrology, Environ. Model. Softw., 138, 104983, https://doi.org/10.1016/j.envsoft.2021.104983, 2021. 

GRDC (Global Runoff Data Centre): GRDC Major River Basins, GRDC, 2nd rev. ed., Koblenz, Federal Institute of Hydrology (BfG), https://grdc.bafg.de/products/basin_layers/major_rivers/ (last access: 10 April 2025), 2020. 

Hansen, M. and Pjetursson, B.: Free, online Danish shallow geological data, Geus Bull., 23, 53–56, 2011. 

Hao, Z., Jin, J., Xia, R., Tian, S., Yang, W., Liu, Q., Zhu, M., Ma, T., Jing, C., and Zhang, Y.: CCAM: China Catchment Attributes and Meteorology dataset, Earth Syst. Sci. Data, 13, 5591–5616, https://doi.org/10.5194/essd-13-5591-2021, 2021. 

Helgason, H. B. and Nijssen, B.: LamaH-Ice: LArge-SaMple DAta for Hydrology and Environmental Sciences for Iceland, Earth Syst. Sci. Data, 16, 2741–2771, https://doi.org/10.5194/essd-16-2741-2024, 2024. 

Henriksen, H. J., Troldborg, L., Højberg, A. L., and Refsgaard, J. C.: Assessment of exploitable groundwater resources of Denmark by use of ensemble resource indicators and a numerical groundwater–surface water model, J. Hydrol., 348, 224–240, https://doi.org/10.1016/j.jhydrol.2007.09.056, 2008. 

Henriksen, H. J., Voutchkova, D., Troldborg, L., Ondracek, M., Schullehner, J., and Hansen, B.: National Vandressource Model 20 National Vandressource Model Beregning af udnyttelsesgrader, afsænkning, Geological Survey of Denmark and Greenland (GEUS), https://data.geus.dk/pure-pdf/GEUS-R_2019_32_web.pdf (last access: 9 April 2025), 2019. 

Henriksen, H. J., Kragh, S. J., Gotfredsen, J., Ondracek, M., van Til, M., Jakobsen, A., Schneider, R. J. M., Koch, J., Troldborg, L., Rasmussen, P., Pasten-Zapata, E., and Stisen, S.: Udvikling af landsdækkende modelberegninger af terrænnære hydrologiske forhold i 100m grid ved anvendelse af DK-modellen: Dokumentationsrapport vedr. modelleverancer til Hydrologisk Informations- og Prognosesystem, Udarbejdet som en del af Den Fællesoffen, GEUS, https://doi.org/10.22008/gpub/38113, 2021. 

Henriksen, H. J., Ondracek, M., and Troldborg, L.: Vandressourceopgørelse – datarapport. Baggrundsrapport til Miljøstyrelsens samlede afrapportering omkring forvaltning af fremtidens drikkevandsressource. Metode, resultater, usikkerheder og forventede klimapåvirkninger, GEUS, https://doi.org/10.22008/gpub/34675, 2023. 

Hersbach, H., Bell, B., Berrisford, P., Hirahara, S., Horányi, A., Muñoz-Sabater, J., Nicolas, J., Peubey, C., Radu, R., and Schepers, D.: The ERA5 global reanalysis, Q. J. Roy. Meteor. Soc., 146, 1999–2049, 2020. 

Hiederer, R.: Mapping soil properties for Europe-spatial representation of soil database attributes, Publ. Off. Eur. Union, https://doi.org/10.2788/94128, 2013. 

Höge, M., Kauzlaric, M., Siber, R., Schönenberger, U., Horton, P., Schwanbeck, J., Floriancic, M. G., Viviroli, D., Wilhelm, S., Sikorska-Senoner, A. E., Addor, N., Brunner, M., Pool, S., Zappa, M., and Fenicia, F.: CAMELS-CH: hydro-meteorological time series and landscape attributes for 331 catchments in hydrologic Switzerland, Earth Syst. Sci. Data, 15, 5755–5784, https://doi.org/10.5194/essd-15-5755-2023, 2023. 

Højberg, A. L., Troldborg, L., Stisen, S., Christensen, B. B. S., and Henriksen, H. J.: Stakeholder driven update and improvement of a national water resources model, Environ. Model. Softw., 40, 202–213, https://doi.org/10.1016/j.envsoft.2012.09.010, 2013. 

Højberg, A. L., Hoffmann, C. C., Thodsen, H., Børgesen, C. D., Tornbjerg, H., Nordstrøm, B. O., Troldborg, L., Kjeldgaard, A., Holm, H., Audet, J., Ellermann, T., Christensen, J. H., Bach, E. O., and Pedersen, B. F.: National kvælstofmodel – version 2020, Metoderapport. De nationale geologiske undersøgelser for Danmark og Grønland, GEUS Specialrapport, 2021. 

Jehn, F. U., Bestian, K., Breuer, L., Kraft, P., and Houska, T.: Using hydrological and climatic catchment clusters to explore drivers of catchment behavior, Hydrol. Earth Syst. Sci., 24, 1081–1100, https://doi.org/10.5194/hess-24-1081-2020, 2020. 

Jupiter: GEUS National Well Database, Jupiter, https://www.geus.dk/produkter-ydelser-og-faciliteter/data-og-kort/national-boringsdatabase-jupiter (last access: 24 May 2024), 2023. 

Kibler, K. M., Biswas, R. K., and Lucas, A. M. J.: Hydrologic data as a human right? Equitable access to information as a resource for disaster risk reduction in transboundary river basins, Water Policy, 16, 36–58, https://doi.org/10.2166/wp.2014.307, 2014. 

Klingler, C., Schulz, K., and Herrnegger, M.: LamaH-CE: LArge-SaMple DAta for Hydrology and Environmental Sciences for Central Europe, Earth Syst. Sci. Data, 13, 4529–4565, https://doi.org/10.5194/essd-13-4529-2021, 2021. 

Koch, J. and Schneider, R.: Long short-term memory networks enhance rainfall-runoff modelling at the national scale of Denmark, GEUS Bull., 49, 1–7, https://doi.org/10.34194/geusb.v49.8292, 2022. 

Koch, J., Gotfredsen, J., Schneider, R., Troldborg, L., Stisen, S., and Henriksen, H. J.: High Resolution Water Table Modeling of the Shallow Groundwater Using a Knowledge-Guided Gradient Boosting Decision Tree Model, Front. Water, 3, 1–14, https://doi.org/10.3389/frwa.2021.701726, 2021. 

Koch, J., Liu, J., Stisen, S., Troldborg, L., Højberg, A. L., Thodsen, H., Hansen, M. F. T., and Schneider, R. J. M.: CAMELS-DK: Hydrometeorological Time Series and Landscape Attributes for 3330 Catchments in Denmark, GEUS Dataverse [data set], https://doi.org/10.22008/FK2/AZXSYP, 2024. 

Konapala, G., Kao, S. C., Painter, S. L., and Lu, D.: Machine learning assisted hybrid models can improve streamflow simulation in diverse catchments across the conterminous US, Environ. Res. Lett., 15, 104022, https://doi.org/10.1088/1748-9326/aba927, 2020. 

Kratzert, F., Klotz, D., Brenner, C., Schulz, K., and Herrnegger, M.: Rainfall–runoff modelling using Long Short-Term Memory (LSTM) networks, Hydrol. Earth Syst. Sci., 22, 6005–6022, https://doi.org/10.5194/hess-22-6005-2018, 2018. 

Kratzert, F., Klotz, D., Herrnegger, M., Sampson, A. K., Hochreiter, S., and Nearing, G. S.: Toward Improved Predictions in Ungauged Basins: Exploiting the Power of Machine Learning, Water Resour. Res., 55, 11344–11354, https://doi.org/10.1029/2019WR026065, 2019. 

Kratzert, F., Klotz, D., Hochreiter, S., and Nearing, G. S.: A note on leveraging synergy in multiple meteorological data sets with deep learning for rainfall–runoff modeling, Hydrol. Earth Syst. Sci., 25, 2685–2703, https://doi.org/10.5194/hess-25-2685-2021, 2021. 

Kratzert, F., Nearing, G., Addor, N., Erickson, T., Gauch, M., Gilon, O., Gudmundsson, L., Hassidim, A., Klotz, D., Nevo, S., Shalev, G., and Matias, Y.: Caravan – A global community dataset for large-sample hydrology, Sci. Data, 10, 1–11, https://doi.org/10.1038/s41597-023-01975-w, 2023. 

Ladson, A. R., Brown, R., Neal, B., and Nathan, R.: A standard approach to baseflow separation using the Lyne and Hollick filter, Australas. J. Water Resour., 17, 25–34, https://doi.org/10.7158/13241583.2013.11465417, 2013. 

Lehner, B., Verdin, K., and Jarvis, A.: New global hydrography derived from spaceborne elevation data, EOS T. Am. Geophys. Un., 89, 93–94, 2008. 

Levin, G.: Basemap04: Documentation of the data and method for the elaboration of a land use and land cover map for Denmark, Aarhus University, DCE – Danish Centre for Environment and Energy, 77 pp., Technical Report No. 252, http://dce2.au.dk/pub/TR252.pdf (last access: 9 April 2025), 2022. 

Liu, J., Koch, J., Stisen, S., Troldborg, L., and Schneider, R. J. M.: A national-scale hybrid model for enhanced streamflow estimation – consolidating a physically based hydrological model with long short-term memory (LSTM) networks, Hydrol. Earth Syst. Sci., 28, 2871–2893, https://doi.org/10.5194/hess-28-2871-2024, 2024a. 

Liu, J., Bian, Y., Lawson, K., and Shen, C.: Probing the limit of hydrologic predictability with the Transformer network, J. Hydrol., 637, 131389, https://doi.org/10.1016/j.jhydrol.2024.131389, 2024b. 

Ma, K., Feng, D., Lawson, K., Tsai, W., Liang, C., Huang, X., Sharma, A., and Shen, C.: Transferring Hydrologic Data Across Continents – Leveraging Data-Rich Regions to Improve Hydrologic Prediction in Data-Sparse Regions, Water Resour. Res., 57, e2020WR028600, https://doi.org/10.1029/2020WR028600, 2021. 

Mahmood, H., Schneider, R. J. M., Frederiksen, R. R., Christiansen, A. V., and Stisen, S.: Using jointly calibrated fine-scale drain models across Denmark to assess the influence of physical variables on spatial drain flow patterns, J. Hydrol. Reg. Stud., 46, 101353, https://doi.org/10.1016/j.ejrh.2023.101353, 2023. 

Mai, J., Shen, H., Tolson, B. A., Gaborit, É., Arsenault, R., Craig, J. R., Fortin, V., Fry, L. M., Gauch, M., Klotz, D., Kratzert, F., O'Brien, N., Princz, D. G., Rasiya Koya, S., Roy, T., Seglenieks, F., Shrestha, N. K., Temgoua, A. G. T., Vionnet, V., and Waddell, J. W.: The Great Lakes Runoff Intercomparison Project Phase 4: the Great Lakes (GRIP-GL), Hydrol. Earth Syst. Sci., 26, 3537–3572, https://doi.org/10.5194/hess-26-3537-2022, 2022. 

Martinsen, G., Bessiere, H., Caballero, Y., Koch, J., Collados-Lara, A. J., Mansour, M., Sallasmaa, O., Pulido-Velazquez, D., Williams, N. H., Zaadnoordijk, W. J., and Stisen, S.: Developing a pan-European high-resolution groundwater recharge map – Combining satellite data and national survey data using machine learning, Sci. Total Environ., 822, 153464, https://doi.org/10.1016/j.scitotenv.2022.153464, 2022. 

McMillan, H.: Linking hydrologic signatures to hydrologic processes: A review, Hydrol. Process., 34, 1393–1409, 2020. 

McMillan, H. K., Westerberg, I. K., and Krueger, T.: Hydrological data uncertainty and its implications, Wiley Interdiscip. Rev. Water, 5, 1–14, https://doi.org/10.1002/WAT2.1319, 2018. 

Meyer Oliveira, A., van Meerveld, H. J., Vis, M., and Seibert, J.: Assessment of the Value of Remotely Sensed Surface Water Extent Data for the Calibration of a Lumped Hydrological Model, Water Resour. Res., 59, 1–19, https://doi.org/10.1029/2023WR034875, 2023. 

Møller, A. B., Beucher, A., Iversen, B. V., and Greve, M. H.: Geoderma Predicting arti fi cially drained areas by means of a selective model ensemble, Geoderma, 320, 30–42, https://doi.org/10.1016/j.geoderma.2018.01.018, 2018a. 

Møller, A. B., Beucher, A., Iversen, B. V, and Greve, M. H.: Predicting artificially drained areas by means of a selective model ensemble, Geoderma, 320, 30–42, 2018b. 

Nearing, G., Cohen, D., Dube, V., Gauch, M., Gilon, O., Harrigan, S., Hassidim, A., Klotz, D., Kratzert, F., Metzger, A., Nevo, S., Pappenberger, F., Prudhomme, C., Shalev, G., Shenzis, S., Tekalign, T. Y., Weitzner, D., and Matias, Y.: Global prediction of extreme floods in ungauged watersheds, Nature, 627, 559–563, https://doi.org/10.1038/s41586-024-07145-1, 2024. 

Newman, A. J., Clark, M. P., Sampson, K., Wood, A., Hay, L. E., Bock, A., Viger, R. J., Blodgett, D., Brekke, L., Arnold, J. R., Hopson, T., and Duan, Q.: Development of a large-sample watershed-scale hydrometeorological data set for the contiguous USA: data set characteristics and assessment of regional variability in hydrologic model performance, Hydrol. Earth Syst. Sci., 19, 209–223, https://doi.org/10.5194/hess-19-209-2015, 2015. 

Ondracek, M.: Raster geodatabase “GeoKomp” i geotiff format, GEUS Dataverse [data set], https://doi.org/10.22008/FK2/UP1PBJ/E3WYBX (last access: 26 March 2024), 2023. 

Pedersen, S. A. S., Hermansen, B., Nathan, C., and Tougaard, L.: Digitalt kort over Danmarks jordarter 1:200.000, version 2. Geologisk kort over de overfladenære jordarter i Danmark, GEUS, https://doi.org/10.22008/gpub/28464, 2011a. 

Pedersen, S. A. S., Hermansen, B., Nathan, C., and Tougaard, L.: Jordartskort over Danmark 1:200 000, GEUS Dataverse [data set], https://doi.org/10.22008/FK2/AAEEMN, 2011b. 

Refsgaard, J. C., Stisen, S., and Koch, J.: Hydrological process knowledge in catchment modelling – Lessons and perspectives from 60 years development, Hydrol. Process., 36, 1–20, https://doi.org/10.1002/hyp.14463, 2022. 

Sawicz, K., Wagener, T., Sivapalan, M., Troch, P. A., and Carrillo, G.: Catchment classification: empirical analysis of hydrologic similarity based on catchment function in the eastern USA, Hydrol. Earth Syst. Sci., 15, 2895–2911, https://doi.org/10.5194/hess-15-2895-2011, 2011. 

Scharling, M.: Klimagrid Danmark – Nedbør, lufttemperatur og potentiel fordampning 20 × 20 & 40 × 40 km – Metodebeskrivelse, Danish Meteorol. Inst., https://www.dmi.dk/fileadmin/user_upload/Rapporter/TR/1999/tr99-12.pdf (last access: 9 April 2025), 1999a. 

Scharling, M.: Klimagrid Danmark Nedbør 10 × 10 km (ver. 2) – Metodebeskrivelse, Danish Meteorol. Inst., 15–17, https://www.dmi.dk/fileadmin/user_upload/Rapporter/TR/1999/tr99-15.pdf (last access: 9 April 2025), 1999b. 

Schneider, R., Koch, J., Troldborg, L., Henriksen, H. J., and Stisen, S.: Machine-learning-based downscaling of modelled climate change impacts on groundwater table depth, Hydrol. Earth Syst. Sci., 26, 5859–5877, https://doi.org/10.5194/hess-26-5859-2022, 2022. 

Seidenfaden, I. K., Sonnenborg, T. O., Stisen, S., and Kidmose, J.: Quantification of climate change sensitivity of shallow and deep groundwater in Denmark, J. Hydrol. Reg. Stud., 41, 101100, https://doi.org/10.1016/j.ejrh.2022.101100, 2022. 

Singh, R., Archfield, S. A., and Wagener, T.: Identifying dominant controls on hydrologic parameter transfer from gauged to ungauged catchments – A comparative hydrology approach, J. Hydrol., 517, 985–996, https://doi.org/10.1016/j.jhydrol.2014.06.030, 2014. 

Soltani, M., Bjerre, E., Koch, J., and Stisen, S.: Integrating remote sensing data in optimization of a national water resources model to improve the spatial pattern performance of evapotranspiration, J. Hydrol., 603, 127026, https://doi.org/10.1016/j.jhydrol.2021.127026, 2021. 

Stisen, S., Sonnenborg, T. O., Højberg, A. L., Troldborg, L., and Refsgaard, J. C.: Evaluation of Climate Input Biases and Water Balance Issues Using a Coupled Surface-Subsurface Model, Vadose Zone J., 10, 37–53, https://doi.org/10.2136/vzj2010.0001, 2011. 

Stisen, S., Ondracek, M., Troldborg, L., Schneider, R. J. M., and Til, M. J. van: National Vandressource Model, Modelopstilling og kalibrering af DK-model 2019, GEUS, Copenhagen, https://doi.org/10.22008/gpub/32631, 2020.  

Tang, S., Sun, F., Liu, W., Wang, H., Feng, Y., and Li, Z.: Optimal Postprocessing Strategies With LSTM for Global Streamflow Prediction in Ungauged Basins, Water Resour. Res., 59, 1–16, https://doi.org/10.1029/2022WR034352, 2023. 

Tegegn, Z., Abebe, A., and Agide, Z.: Understanding Catchments' Hydrologic Response Similarity of Upper Blue Nile (Abay) basin through catchment classification, Model. Earth Syst. Environ., 8, 3305–3323, https://doi.org/10.1007/s40808-021-01298-y, 2022. 

Tóth, B., Weynants, M., Pásztor, L., and Hengl, T.: 3D soil hydraulic database of Europe at 250 m resolution, Hydrol. Process., 31, 2662–2666, https://doi.org/10.1002/hyp.11203, 2017. 

Tshimanga, R. M., Bola, G. B., Kabuya, P. M., Nkaba, L., Neal, J., Hawker, L., Trigg, M. A., Bates, P. D., Hughes, D. A., and Laraque, A.: Towards a framework of catchment classification for hydrologic predictions and water resources management in the ungauged basin of the Congo River: An a priori approach, in: Congo Basin Hydrology, Climate, and Biogeochemistry: A Foundation for the Future, edited by: Tshimanga, R. M., Moukandi N'kaya, G. D., and Alsdorf, D., American Geophysical Union, 469–498, https://doi.org/10.1002/9781119657002.ch24, 2022. 

Van Kraalingen, D. W. G. and Stol, W.: Evapotranspiration modules for crop growth simulation. Implementation of the algorithms from Penman, Makkink and Priestley-Taylor, AB-DLO, https://edepot.wur.nl/4413 (last access: 9 April 2025), 1997. 

Van Loon, A. F.: Hydrological drought explained, Wiley Interdiscip. Rev. Water, 2, 359–392, https://doi.org/10.1002/WAT2.1085, 2015. 

Wilbrand, K., Taormina, R., ten Veldhuis, M.-C., Visser, M., Hrachowitz, M., Nuttall, J., and Dahm, R.: Predicting streamflow with LSTM networks using global datasets, Front. Water, 5, 1166124, https://doi.org/10.3389/frwa.2023.1166124, 2023. 

Wilkinson, M. D., Dumontier, M., Aalbersberg, Ij. J., Appleton, G., Axton, M., Baak, A., Blomberg, N., Boiten, J. W., da Silva Santos, L. B., Bourne, P. E., Bouwman, J., Brookes, A. J., Clark, T., Crosas, M., Dillo, I., Dumon, O., Edmunds, S., Evelo, C. T., Finkers, R., Gonzalez-Beltran, A., Gray, A. J. G., Groth, P., Goble, C., Grethe, J. S., Heringa, J., t Hoen, P. A. C., Hooft, R., Kuhn, T., Kok, R., Kok, J., Lusher, S. J., Martone, M. E., Mons, A., Packer, A. L., Persson, B., Rocca-Serra, P., Roos, M., van Schaik, R., Sansone, S. A., Schultes, E., Sengstag, T., Slater, T., Strawn, G., Swertz, M. A., Thompson, M., Van Der Lei, J., Van Mulligen, E., Velterop, J., Waagmeester, A., Wittenburg, P., Wolstencroft, K., Zhao, J., and Mons, B.: Comment: The FAIR Guiding Principles for scientific data management and stewardship, Sci. Data, 3, 1–9, https://doi.org/10.1038/sdata.2016.18, 2016. 

Yin, H., Guo, Z., Zhang, X., Chen, J., and Zhang, Y.: RR-Former: Rainfall-runoff modeling based on Transformer, J. Hydrol., 609, 127781, https://doi.org/10.1016/j.jhydrol.2022.127781, 2022. 

Download
Short summary
We developed a CAMELS-style dataset in Denmark, which contains hydrometeorological time series and landscape attributes for 3330 catchments (304 gauged). Many catchments in CAMELS-DK are small and at low elevations. The dataset provides information on groundwater characteristics and dynamics, as well as quantities related to the human impact on the hydrological system in Denmark. The dataset is especially relevant for developing data-driven and hybrid physically informed modeling frameworks.
Share
Altmetrics
Final-revised paper
Preprint