the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
CAMELS-LUX: Highly Resolved Hydro-Meteorological and Atmospheric Data for Physiographically Characterized Catchments around Luxembourg
Abstract. Harmonized large-sample datasets have become a central pillar of hydrological research, particularly in the machinelearning era, where data-based algorithms and machine-learning techniques are gaining increasing importance in daily life. The CAMELS-LUX dataset (Catchment Attributes and MEteorology for Large-sample Studies – LUXembourg) described here covers 56 nested catchments (0.46 km2 – 4256.62 km2) that contribute to the Luxembourgish stream network. While Luxembourg has a relatively homogeneous climate, the physiography varies significantly on a small scale making it a suitable study area for investigating different hydrological processes, such as runoff generation or groundwater recharge. The CAMELS-LUX dataset contains hydrological observations, meteorological data, and atmospheric reanalysis data from 2004–2021. Moreover, comprehensive physiographic catchment characteristics are provided that incorporate geology classes, land use classes, and a range of topographic indices. CAMELS-LUX is distinctive as the first dataset in the CAMELS series that offers data at three different temporal resolutions: daily, hourly, and in a 15-minute time step. Furthermore, CAMELS-LUX includes a series of flash floods in 2016 and 2018 as well as major large floods in 2010 and 2021. The extensive information contained in CAMELS-LUX is instrumental in advancing our understanding of varying discharge behaviour within Luxembourg and beyond. The CAMELS-LUX dataset has been utilized to develop and train a Long-Short-Term-Memory (LSTM) model, that simulates discharge time series, providing a benchmark for subsequent hydrological modelling efforts in the area. The model based on this dataset sufficiently reproduces hydrological rainfall-runoff dependencies and can be applied to simulate discharge in sparsely gauged basins for approximation. The CAMELS-LUX dataset is available on zenodo: https://doi.org/10.5281/zenodo.13846619 (Nijzink et al., 2024).
- Preprint
(1439 KB) - Metadata XML
-
Supplement
(52 KB) - BibTeX
- EndNote
Status: final response (author comments only)
-
CC1: 'Comment on essd-2024-482', Ather Abbas, 28 Jun 2025
-
AC2: 'Reply on CC1', Judith Nijzink, 30 Nov 2025
Dear Ather Abbas,
Thank you for using the CAMELS-LUX dataset and pointing us to this inconsistency! Having checked this issue, duplicates seem to be limited to 29 hours on 15-16 April 2006 in basin 16. We have corrected these duplicates in the new, updated version of CAMELS-LUX: https://doi.org/10.5281/zenodo.17621594 .
Citation: https://doi.org/10.5194/essd-2024-482-AC2
-
AC2: 'Reply on CC1', Judith Nijzink, 30 Nov 2025
-
RC1: 'Comment on essd-2024-482', Franziska Clerc-Schwarzenbach, 10 Jul 2025
Dear authors, please find my review of the CAMELS-LUX manuscript and data set in the attachment.
Kind regards, Franziska Clerc-Schwarzenbach
- AC1: 'Reply on RC1', Judith Nijzink, 26 Oct 2025
-
RC2: 'Comment on essd-2024-482', Anonymous Referee #2, 08 Jan 2026
This manuscript introduces a CAMELS dataset for 56 nested catchments in Luxemberg. The datasets are in suitable format and easy to interpret. This paper is a first attempt among CAMELS datasets providing atmospheric data and the authors also tried to indicate and focus on two flash floods within the study period. The manuscript lies well within the scope of ESSD. However, there are certain comments required to be addressed before the publication.
Major comments:
- After the introduction, the manuscript should first provide the description of the study area. Section 3 (regional context, climate and hydrology) should be moved ahead of the data description. Additionally, the current Figure 4, which displays the spatial distribution of the 56 nested catchments should be placed at the beginning of the manuscript as Figure 1 to give readers early contextual understanding of the study area.
- Line 71: The flags 0 and 1 are not mentioned anywhere in supplement S2. What does flag 2, 3 and 4 in the supplement S2 signify?
- Table B1, which contains catchment names is not cited anywhere in the manuscript. Please cite this table appropriately. Additionally, the catchment name ‘White Ernz’ in line 187 was not found in the list of Table 2. This needs clarification or correction.
- Sections 4.2, 4.3 and 4.4 lack explanations on how these indices are varying spatially across Luxemberg and what are the potential reasons behind such variation.
- What does n stand for in equation 9? Please provide a clear definition.
- As it is mentioned that CAMELS-LUX includes a series of flash floods that occurred in 2016 and 2018, add more details on what difference can be observed in the parameters of the affected catchment quantitatively. What were the distinct observations from the data during this period which strongly indicates flash floods? Section 6.2 describes atmospheric parameters characterizing thunderstorms, however, it is not clear from the paragraph that the increase/ decrease in the parameters mentioned refers to which catchments.
- Line 318: Provide numbers/ ranges showing the increase/decrease in the parameters such as specific humidity, q and total column water vapour.
Similarly, in line 322 provide a range by how much did CAPE and K index have increased.
- The manuscript would benefit from a section outlining dataset limitations and possible directions for future enhancements.
Comments on data:
- Naming of the time series files of each catchment is same in all the three folders of time scales ‘15 min’, ‘hourly’ and ‘daily’. This can be confusing. It is not possible to simultaneously open the csv files of the same catchment for the 15 min, hourly and daily scale because of the same file name. Therefore, it is suggested to distinguish the files names of catchments for the three different time scales.
- In many of the randomly picked time series csv files, Qflag was noticed to be zero throughout the column. Please check if this value is constantly zero everywhere. If yes, then what is the purpose of this parameter in the time series?
- In the shapefile of catchments, add the name of catchment and gauge_id columns. Although, grid code is already mentioned, it would be more convenient if instead of grid code, gauge_ids are mentioned.
Minor comments:
- Line 271: The manuscript references equation 15, but no such equation is present.
- Line 319: Please correct ‘TCVW’ to ‘TCWV’.
Citation: https://doi.org/10.5194/essd-2024-482-RC2 -
AC3: 'Reply on RC2', Judith Nijzink, 16 Jan 2026
The comment was uploaded in the form of a supplement: https://essd.copernicus.org/preprints/essd-2024-482/essd-2024-482-AC3-supplement.pdf
Data sets
CAMELS-LUX: Highly Resolved Hydro-Meteorological and Atmospheric Data for Physiographically Characterized Catchments around Luxembourg Judith Nijzink, Ralf Loritz, Laurent Gourdol, Davide Zoccatelli, Jean François Iffly, and Laurent Pfister https://doi.org/10.5281/zenodo.13846619
Viewed
| HTML | XML | Total | Supplement | BibTeX | EndNote | |
|---|---|---|---|---|---|---|
| 1,551 | 202 | 39 | 1,792 | 81 | 55 | 72 |
- HTML: 1,551
- PDF: 202
- XML: 39
- Total: 1,792
- Supplement: 81
- BibTeX: 55
- EndNote: 72
Viewed (geographical distribution)
| Country | # | Views | % |
|---|
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
Dear Authors,
I would like to point a small bug in the 15Min file for catchment ID_16. It contains duplicate rows/indices.
Thank you for presenting this data.