Articles | Volume 17, issue 1
https://doi.org/10.5194/essd-17-293-2025
https://doi.org/10.5194/essd-17-293-2025
Data description paper
 | 
30 Jan 2025
Data description paper |  | 30 Jan 2025

HERA: a high-resolution pan-European hydrological reanalysis (1951–2020)

Aloïs Tilloy, Dominik Paprotny, Stefania Grimaldi, Goncalo Gomes, Alessandra Bianchi, Stefan Lange, Hylke Beck, Cinzia Mazzetti, and Luc Feyen
Abstract

Since 1950, anthropogenic activities have altered the climate, land cover, soil properties, channel morphologies, and water management in the river basins of Europe. This has resulted in significant changes in hydrological conditions. The availability of consistent estimates of river flow at the global and continental levels is a necessity for assessing changes in the hydrological cycle. To overcome limitations posed by observations (incomplete records, inhomogeneous spatial coverage), we simulate river discharge for Europe for the period 1951–2020 using a state-of-the-art hydrological modelling approach. We use the new European set-up of the OS LISFLOOD model, running at 1 arcmin (≈1.8km) with 6-hourly time steps. The hydrological model is forced by climate reanalysis data (ERA5-Land) that are bias-corrected and downscaled to the model resolution with gridded weather observations. The model also incorporates 72 surface field maps representing catchment morphology, vegetation, soil properties, land use, water demand, lakes, and reservoirs. Inputs related to human activities are evolving through time to emulate societal changes. The resulting Hydrological European ReAnalysis (HERA) provides 6-hourly river discharge for 282 521 river pixels with an upstream area >100km2. We assess its skill using 2448 river gauging stations distributed across Europe. Overall, HERA delivers satisfying results (median KGE=0.55), despite a general underestimation of observed mean discharges (mean bias=-13.1%), and demonstrates a capacity to reproduce statistics of observed extreme flows. The performance of HERA increases through time and with catchment size, and it varies in space depending on reservoir influence and model calibration. The fine spatial and temporal resolution results in an enhanced performance compared to previous hydrological reanalysis based on OS LISFLOOD for small- to medium-scale catchments (100–10 000 km2). HERA is the first publicly available long-term, high-resolution hydrological reanalysis for Europe. Despite its limitations, HERA enables the analysis of hydrological dynamics related to extremes, human influences, and climate change at a continental scale while maintaining local relevance. It also creates the opportunity to study these dynamics in ungauged catchments across Europe. The HERA hydrological reanalysis and its climate and dynamic socio-economic inputs are available via the JRC data catalogue: https://doi.org/10.2905/a605a675-9444-4017-8b34-d66be5b18c95 (Tilloy et al., 2024).

1 Introduction

In the last century, Europe has experienced a growth in its population, economy, and urbanized area (Li et al., 2021; Paprotny and Mengel, 2023). Recent decades have also witnessed a rapid rise in global air temperature that is attributable to anthropogenic activities (IPCC, 2023). These evolving conditions have significantly changed flows in European streams and rivers (Barker et al., 2019; Gudmundsson et al., 2021; Vicente-Serrano et al., 2019; Wang et al., 2024), leading to multiple challenges for hydrological sciences related, for example, to long-term variability, climate change, extremes, or human alterations of the water cycle (Blöschl et al., 2019a). In order to assess the impacts of these changes, hydrologists need consistent, reliable, and long hydrological series. Observations, despite continuous improvements (Blöschl et al., 2019b; Ekolu et al., 2022), can hamper the analysis of pan-European long-term trends due to sparse spatial distributions in some regions and temporal discontinuities. One option for overcoming these limitations is to rely on a suite of models (climate, hydrological, and land use) to simulate past hydrological conditions and interpret changing dynamics in the hydrological cycle in connection with rapidly changing human systems (e.g. Richards and Gutierrez-Arellano, 2022). This article introduces the Hydrological European ReAnalysis (HERA) for the period 1951–2020, providing consistent estimates of river flow for European rivers at high spatial and temporal resolution.

Hydrological models are essential tools for understanding and characterizing processes related to the water cycle (e.g. flood and drought forecasting). In the past 3 decades, there have been efforts to develop models that are able to simulate hydrological processes at a large scale (continental to global scales). A myriad of these global hydrological models (GHMs), differing in their conceptualization, now exist (Beck et al., 2017; Sood and Smakhtin, 2015; Kauffeldt et al., 2016; Prudhomme et al., 2011). The nature of GHMs implies that they are usually run at a coarse spatial resolution (e.g. 0.5°), limiting their relevance for local and regional water resource problems (Sood and Smakhtin, 2015). Nonetheless, the development of GHMs has been fuelled by continuous improvements in remote sensing technologies and processing power (Yang et al., 2021). Remote sensing technologies provide high-resolution input for hydrological models, such as land use and vegetation properties. The advancements in computational capabilities have allowed us to refine the spatial and temporal scale of hydrological models, enabling a more accurate representation of surface and sub-surface processes and reducing modelling uncertainties (Wood et al., 2011). In this context, HERA falls within a global effort towards the development of hyper-resolution (1 km and below) land surface and hydrological models at the continental (Hoch et al., 2023; O'Neill et al., 2021) and global (Hanasaki et al., 2022) scales.

A key hindrance to simulating past river flows has been the availability of meteorological inputs for hydrological models. Of the potential inputs, climate reanalysis offers several advantages: temporal coverage (typically spanning several decades) and a large number of variables (e.g. precipitation, wind speed, or temperature) that are physically consistent with a homogeneous spatio-temporal resolution. The reanalysis data are outputs of climate models calibrated on observed data worldwide (Brönnimann et al., 2018). Here we use ERA5-Land, the land component of ERA5 (Muñoz-Sabater et al., 2021). One main advantage of ERA5-Land compared to ERA5 is its horizontal resolution, which is 9 km globally, compared to 31 km in ERA5. This enhanced resolution is obtained by downscaling meteorological variables from ERA5. The temporal resolution is hourly as in ERA5. Nonetheless, reanalysis data are obtained from short-term model forecasts and can be affected by forecast errors (Pfahl and Wernli, 2012). Variables produced in ERA5 are averages over grid cells. This averaging combined with the relatively coarse resolution of ERA5/ERA5-Land often smooths local extremes (Donat et al., 2014; Tilloy et al., 2022). To tackle this issue, we downscale and bias-correct ERA5-Land with a gridded observational dataset, EMO-1 (Thiemig et al., 2022) (Sect. 2.2).

In the context of the European Flood Awareness System (EFAS), an operational system for European flood monitoring and forecasting (https://www.efas.eu, last access: 2 December 2024), there have been recent efforts to develop more detailed surface fields (e.g. land use or vegetation) (Choulga et al., 2024) and observational climate inputs (Thiemig et al., 2022) at a spatial resolution of 1 arcmin (1, 0.0167°, typically 1.5–3 km2 over Europe). These developments have come with improvements in the OS LISFLOOD hydrological model underpinning EFAS. OS LISFLOOD is a spatially distributed grid-based hydrological and channel routing model which was initially developed for flood forecasting and flood risk assessment (Burek et al., 2013). However, it is also able to model effects of land use change, climate change, and river regulation measures and has been used in a wide range of hydrological applications, such as mapping populations under water stress in relation to how much water is reserved for the environment (Vanham et al., 2021) and projecting droughts in view of climate change (Cammalleri et al., 2020b). It is also used in the generation of the GLOFAS-ERA5 hydrological reanalysis (Harrigan et al., 2020).

Therefore, this article brings together improvements from diverse fields (i.e. remote sensing, climate modelling, machine learning, and hydrology) to generate a state-of-the-art hydrological reanalysis for a European domain that covers EU27 countries, the UK, Switzerland, Iceland, Norway, and the Balkan countries (Serbia, Montenegro, Bosnia-Herzegovina, Kosovo, North Macedonia, and Albania) over the past 70 years. HERA aims to reproduce as accurately as possible the evolution of the hydrological landscape of Europe by using the latest development of OS LISFLOOD (improvements in processing speed, spatial and temporal resolution, and calibration), which is also used in the generation of the latest EFAS v5.0 reanalysis (1991–2022) (Decremer et al., 2023) (Sect. 2.1). Climate inputs are derived from ERA5-Land and are bias-corrected and downscaled to 1 arcmin to improve the representation of extremes (Sect. 2.2). We generated dynamic socio-economic inputs (water demand, land use, and reservoir maps) to capture the effect of human activities on the water cycle (Sect. 2.3). These developments make this dataset the first publicly available long-term pan-European hydrological reanalysis that takes into account the evolving socio-economic conditions that have altered the hydrological cycle since 1951. In Sect. 3, we assess the performance of HERA against observations from 2448 river gauges in Europe.

2 Method

The modelling framework developed to generate the HERA dataset is presented in a flowchart in Fig. 1. The framework is organized around the OS LISFLOOD hydrological model that is used to simulate river discharge. For this run, we use calibrated parameters for the European setting of OS LISFLOOD developed by ECMWF in the context of the EFAS v5.0 calibration (CEMS-Flood online documentation, 2023). We first introduce OS LISFLOOD and its calibration procedure (Sect. 2.1). Figure 1 also displays the main input of OS LISFLOOD: high-resolution climate inputs (Sect. 2.2) and state-of-the-art static (Sect. 2.3.1) and dynamic socio-economic maps (Sect. 2.3).

https://essd.copernicus.org/articles/17/293/2025/essd-17-293-2025-f01

Figure 1Flowchart of the framework employed in the generation of HERA. The numbers relate to the section in which each component of the framework is presented.

2.1 Hydrological modelling

2.1.1 The OS LISFLOOD model

Here, we simulate sub-daily continuous streamflow time series over Europe by means of the OS LISFLOOD model (Burek et al., 2013; Van Der Knijff et al., 2008). This is a spatially distributed, semi-physical rainfall-runoff model combined with a routing module for river channels (Dottori et al., 2022). The model has been developed by the Joint Research Centre (JRC) since the late 1990s and is used operationally for large-scale flood forecasting in EFAS and the Global Flood Awareness System (GLOFAS). OS LISFLOOD has also been used in drought monitoring (Cammalleri et al., 2020a, 2017) to assess the effect of flood adaptation measures, environmental flow protection, or climate change (Mentaschi et al., 2020; Vanham et al., 2022). Since 2019, the model has been open-source and available on GitHub along with a set of auxiliary tools (https://github.com/ec-jrc/lisflood-code, last access: 2 December 2024). OS LISFLOOD is composed of the following main components:

  • three soil layers (superficial, upper, lower) for water balance modelling;

  • sub-models for the simulation of groundwater and subsurface flow (using two parallel interconnected reservoirs);

  • a sub-model for the routing of surface runoff to the nearest river channel; and

  • a sub-model for the routing of channel flow.

Other processes, e.g. snowmelt, infiltration, rainfall interception, leaf drainage, evaporation and water uptake by vegetation, surface runoff, and exchange of soil moisture between soil layers, are also simulated by the model (European Commission, Joint Research Centre (JRC), 2025a). OS LISFLOOD is also able to model lakes and reservoirs.

In this work, we use the latest version of OS LISFLOOD (v4.1.2, January 2023), which includes upgrades compared to previous versions in the hydrological routines and improvements in the modelling of water abstraction for anthropogenic use. Moreover, OS LISFLOOD v4.1.2 benefits from improvements in the management of large inputs and computational performance. Figure 2 displays the domain for which data were retained in HERA. This comprises 42 European countries and excludes non-EU countries of the former Soviet Union, countries in northern Africa and the Middle East, and Türkiye, which are included in the EFAS domain. Moreover, HERA uses the same domain as the Historical Analysis of Natural Hazards in Europe (HANZE) database (Paprotny and Mengel, 2023; Paprotny et al., 2023). We run the model using the 1 grid in EFAS v5.0 (Decremer et al., 2023). The temporal resolution of the simulation is 6-hourly, which has been the standard for EFAS since 2020. Due to the size and spatial resolution of our domain, combined with the 6-hourly time steps, we divide the simulations into 71-year chunks based on calendar year starting on 3 January 1950. To estimate the initial model state, we performed a 71-year pre-run. In particular, we used the pre-run to initialize the soil and upper groundwater zone storages and to derive the average inflow into the lower zone and discharge, which represent the theoretical steady-state storage. Due to the rapidly evolving socio-economic conditions in catchments of Europe, we change the input socio-economic maps at the start of every new calendar year of the simulation (Sect. 2.4). This differs from the standard EFAS settings, which assume static land use and a reservoir network, and only varies the water demand values. At the start of every calendar year, the model is initialized with the state variables from the last time step of the previous year (warm start). As water volumes at the first time step in the channels are not known, the model sets a conventional initial volume (OS LISFLOOD uses half-bankfull), leading to unrealistic initial discharge in some catchments. We therefore removed the first simulation year (1950) from the final dataset. Further, we only retained simulations for river pixels with an upstream area greater than 100 km2, resulting in simulations in the 282 521 river pixels displayed in Fig. 2.

https://essd.copernicus.org/articles/17/293/2025/essd-17-293-2025-f02

Figure 2River network (rivers with an upstream area >100km2) on which discharge data have been generated. The HERA domain (in which data are provided) is confined by the red-bordered area.

2.1.2 Model calibration

In this work, we also take advantage of the new EFAS v5.0 calibration that was completed in December 2022 by ECMWF. The calibration was performed using the EMO-1 meteorological dataset (Thiemig et al., 2022) over the period 1990–2021, with a focus on high flows. The modified Kling–Gupta efficiency (KGE; Gupta et al., 2009; Kling et al., 2012) was used as a skill metric. Discharge data at 1903 stations, identified through a selection process based on several criteria (CEMS-Flood online documentation, 2023), were used to calibrate the OS LISFLOOD model over Europe. Sub-daily data are always preferred when available (994 of the 1903 stations). For stations where only daily observations were available, the 6-hourly discharge simulations were first aggregated to daily steps (daily mean) before evaluating the objective function. The calibration was performed at the catchment level, with the 1903 selected stations covering 69.6 % of the HERA domain. A map showing the calibrated catchments is provided in Fig. S1 in the Supplement. The calibration was performed on 14 parameters that influence the modelling of snowmelt, water infiltration into the soil, surface water flow, groundwater flow, and lake and reservoir dynamics. A list of the calibration parameters is provided in Table S1 in the Supplement. Parameter values were identified using the Distributed Evolutionary Algorithm for Python (DEAP, Fortin et al., 2012) within a physically realistic range. The calibration protocol went from head catchments to downstream catchments in a top–down manner, prescribing physical dependencies between upstream and downstream catchments within the same basin.

Coastal and endorheic catchments with drainage areas smaller than 150 km2, representing 6.5 % of the HERA domain, are modelled with default parameter values. Parameter values for the other ungauged catchments were estimated by parameter regionalization. These catchments are mostly located near the coastlines, with a high concentration in southern Italy and Greece, and represent 23.9 % of the HERA domain. The parameter regionalization here consists of transferring parameter values (except for the ones linked to reservoirs and lakes) from a calibrated catchment to an ungauged catchment. Catchments are matched according to climatic and geographical similarities (Beck et al., 2016). We discuss the impact of calibration on the skill of HERA in Sect. 3.1.1. For more information on the calibration of EFAS v5.0, we refer the reader to the online documentation of the Copernicus Emergency Management Service for floods (CEMS-Flood online documentation, 2023).

2.2 Climate inputs: bias-adjusted climate reanalysis data

To force the OS LISFLOOD hydrological model, we used a bias-adjusted and downscaled climate dataset based on the ERA5-Land climate reanalysis (Muñoz-Sabater et al., 2021). The main steps involved in the preparation of the climate inputs are summarized in Fig. 3. The following variables are retrieved from ERA5-Land at an hourly temporal resolution for 1950–2020:

  • total precipitation (tp);

  • mean temperature (ta);

  • mean zonal and meridional wind speeds (u,v);

  • mean dew point temperature (td); and

  • total surface solar radiation downwards (ssrd).

https://essd.copernicus.org/articles/17/293/2025/essd-17-293-2025-f03

Figure 3Climate input pre-processing scheme, including temporal aggregation, bias adjustment, statistical downscaling, and processing of evapotranspiration.

Download

Precipitation and temperature data were aggregated to 6-hourly resolution and the other variables to daily resolution (Fig. 3). All the variables were averaged, except for precipitation, which was summed to reach the target temporal resolution. Minimum and maximum daily temperatures were also calculated, while dew point temperature was converted into relative humidity and actual vapour pressure.

Our setting of OS LISFLOOD requires meteorological data with a 1 resolution. To downscale ERA5-Land data from 0.1°=6 to 1, we performed statistical downscaling and bias adjustment using ISIMIP3BASD v3.0.0 (Lange, 2019; Lange et al., 2024; Frieler et al., 2024). The ISIMIP3BASD method was initially developed for phase 3 of the Inter-Sectoral Impact Model Intercomparison Project (ISIMIP) and aims to provide robust bias adjustment of extreme values, preservation of trends across quantiles, and a clearer separation of bias adjustment and statistical downscaling compared to its predecessors (Lange, 2019). We used the new EMO-1 gridded observational dataset (1 version of EMO-5; Thiemig et al., 2022) developed for the operational EFAS v5.0 as the high-resolution reference dataset. EMO-1 covers the period 1990–2020 and has also been used directly as a climate input in the calibration (Sect. 2.1.2). We used 1990–2020 as the training period for the algorithm since both datasets overlap for this period. The trained algorithm is then applied to ERA5-Land to produce high-resolution data for both the training period and 1950–1989, when high-resolution data comparable to EMO-1 are not available. The resulting climate data consistently cover 1950–2020. The ISIMIP3BASD method is applied to the following variables:

  • daily mean near-surface relative humidity (hurs), obtained from actual vapour pressure (vp);

  • daily and 6-hourly total precipitation (pr);

  • daily total surface downwelling short-wave radiation (rsds);

  • daily mean near-surface wind speed (ws);

  • daily and 6-hourly mean near-surface air temperature (tas);

  • diurnal near-surface air temperature range (tasrange=tasmax-tasmin); and

  • diurnal near-surface air temperature skewness (tasskew=(tas-tasmin)/tasrange).

Here, tasmin and tasmax are the daily near-surface air temperature minimum and maximum, respectively.

Version 3.0.0 of ISIMIP3BASD differs technically from version 2.5.0, which was used to produce the climate forcing data for phase 3b of the Inter-Sectoral Impact Model Intercomparison Project (ISIMIP3b; Frieler et al., 2024), yet both versions produce the same results, and we apply version 3.0.0 using the same climate-variable-specific parameter settings as for the ISIMIP3b data production (Lange et al., 2024; Frieler et al., 2024). ISIMIP3BASD was designed for daily data, but it is applied here to bias-adjust and statistically downscale sub-daily (6-hourly pr and tas) data as if these are daily values. For the bias adjustment, a parametric trend-preserving quantile mapping method was applied to pr, sfcwind, tas, and tasrange, while non-parametric quantile mapping was applied to hurs, rsds, and tasskew. The bias adjustment was done at the spatial resolution of ERA5-Land, 6, using spatially aggregated EMO-1 data (spatial averaging). Data resulting from the bias-adjustment were then statistically downscaled to 1 spatial resolution by using an algorithm based on the multivariate bias correction (MBCn) bias-adjustment method (Cannon, 2018) (Fig. 3). The downscaling method is conservative in the sense that the 1 output data would be identical to the 6 input data in case the former are spatially aggregated back to 6 resolution.

Finally, potential evapotranspiration (et0), potential open-water evapotranspiration (e0), and potential bare soil evapotranspiration (es0) are computed with bias-adjusted and downscaled data at the pixel level using an approach based on the Penman–Monteith equation with the LISVAP model (LISVAP online documentation, 2023).

2.3 Surface field maps

OS LISFLOOD requires a set of surface field maps. Depending on the model set-up, it can ingest up to 108 surface fields divided in six categories:

  • (i)

    catchment morphology and river networks;

  • (ii)

    vegetation cover types and properties;

  • (iii)

    soil properties;

  • (iv)

    land use;

  • (v)

    water demand; and

  • (vi)

    lake and reservoir information.

The first three categories, hereafter referred to as static maps, were taken directly from the CEMS_SurfaceFields_2022 open-source dataset of the Copernicus Emergency Management Service developed for the European domain at 1 arcmin resolution, which can be found in the JRC Data Catalogue (Choulga et al., 2024). The last three categories were derived from CEMS_SurfaceFields_2022 and modified to take into account socio-economic changes (hereafter referred to as dynamic socio-economic maps). This section briefly presents each of the map categories, with an emphasis on dynamic socio-economic maps, which are original to this work.

2.3.1 Static maps

Static maps include surface fields of morphology and channel shapes (14 maps), vegetation properties (18 maps), and soil properties (29 maps).

Morphology and river network information were used directly for the computation of snowmelt, temperature scaling, river routing, and open-water evapotranspiration. Morphological information was derived from elevation and includes the elevation gradient, the within-grid standard deviation of elevation, and Manning's roughness coefficient. Maps representing channel shapes and river networks provide information on the grid cell area (which varies with latitude as the grid projection is WGS84), local drainage direction, upstream area, and channel dimensions. All the morphologies and river network maps were derived from Multi-Error-Removed Improved-Terrain Digital Elevation Model v.1.0.3 (MERIT DEM) (Yamazaki et al., 2019) and Catchment-based Macro-scale Floodplain (CaMa-Flood) Global River Hydrodynamics Model v4.0 maps (Yamazaki, 2023).

Vegetation cover types and property maps are involved in the computation of precipitation interception, evaporation, transpiration, surface runoff, and root water uptake. These properties are described by four variables: crop coefficients (transpiration), crop groups (water uptake), Manning roughness (surface runoff), and leaf area index (interception and evaporation). Each of these variables was mapped for three different land cover types: forest, irrigated, and other. Further, maps of planting and harvesting days for rice, which has specific water demands, are also available. Vegetation properties were derived from several data sources, including the Copernicus Global Land Service (CGLS) leaf area index (LAI) at 1 km (Copernicus, 2021), the Spatial Production Allocation Model (SPAM) Global Spatially-Disaggregated Crop Production Statistics Data for 2010 (Yu et al., 2020; International Food Policy Research Institute, 2019), and the Food and Agriculture Organization (FAO) of the United Nations Irrigation and Drainage Paper No. 56 (Allen et al., 1998).

Soil properties refer to physical characteristics of the soil and aim to describe the water dynamics through a vertical soil profile. In OS LISFLOOD, the soil profile is composed of three layers: superficial (0–5 cm), upper (5–varying (30–50) cm), and lower soil layers. For each layer, variables representing soil hydraulic properties (e.g. soil moisture content or pore size index) are provided. Similarly to vegetation property maps, variables were mapped for two categories of land cover, “forest” and “other”. Soil properties were derived from the International Soil Reference and Information Centre (ISRIC) global gridded SoilGrids dataset (release 2017) available at 250 m (Hengl et al., 2014), which is based on more than 150 000 observation sites and covariate data.

A table summarizing all the static and dynamic surface field maps used to produce HERA is provided in Table S2 in the Supplement. For more details on these surface field maps, their production, and the input datasets used, we refer the reader to Choulga et al. (2024).

2.3.2 Dynamic land use

OS LISFLOOD includes six land use classes as inputs: rice, other irrigated land, forest, sealed surfaces, open water, and other (non-irrigated agriculture, non-forest natural, pervious artificial); these land use classes are mostly based on the CLC-Refined 2006 dataset of Batista e Silva et al. (2013) in the default setting. Of the hydrological processes, interception, evapotranspiration, infiltration, and surface runoff respond differently to each land use type. With the aim of better representing complex rainfall-runoff processes, OS LISFLOOD accounts for the sub-grid variability in land use. Therefore, the spatial distribution of each land use class is defined as a percentage of the whole represented area of a given pixel (European Commission, Joint Research Centre (JRC), 2025a). The magnitude of the variation of hydrological responses is tied to the magnitudes of the changes in land cover. De Roo et al. (2001), for instance, investigated the effects of land use changes on floods in two European catchments and identified different results depending on the magnitude of the land cover change. While such changes tend to have a limited impact on river discharge, they can locally increase flood magnitude (Merz et al., 2021; Sajikumar and Remya, 2015; Van Lanen et al., 2013; Van Loon, 2015). Here we modified the grid cell fractions of each land use class using HANZE-Exposure land use maps at 100 m resolution (Paprotny and Mengel, 2023) for 42 countries in the study area. In the remaining part of the domain, we used coarser, 5 resolution maps from HYDE 3.2 (Klein Goldewijk et al., 2017) to modify the 2006 values. The temporal evolution of the land area of each class is displayed in Fig. 5a. There has been a strong increase in sealed surfaces (+40%), while for the other relevant land use classes the changes are less than 10 %, with more land occupied by irrigated agriculture (except rice), water surfaces (due to reservoir construction), and forests.

https://essd.copernicus.org/articles/17/293/2025/essd-17-293-2025-f04

Figure 4Variation in socio-economic inputs in the hydrological model, averaged over the entire EFAS domain: (a) land area by use category, 1950=100; (b) number of existing reservoirs; (c) water demand by sector in millimetres per grid cell per year; and (d) shares of land use between the different classes in 2020.

Download

https://essd.copernicus.org/articles/17/293/2025/essd-17-293-2025-f05

Figure 5HERA hydrological skill for the 2448 selected stations in terms of (a) KGE and its three components: (b) Pearson correlation, (c) bias ratio, and (d) variability ratio. In panel (a), the green dashed vertical line represents the benchmark KGE value (−0.41). The red vertical line represents the ideal values and the blue dot represents the median for all the stations.

Download

2.3.3 Dynamic water abstraction

Human water use, representing water withdrawal from the natural environment (e.g. rivers, reservoirs, or groundwater) for human needs, is grouped into four main sectors: livestock, domestic, manufacturing industry, and energy production. In OS LISFLOOD, water use is supplied by surface water bodies and groundwater, depending on the sector (Choulga et al., 2024). A considerable increase in water abstraction in a region can diminish surface water resources within the same area. The model also accounts for groundwater abstraction for human use, except for flooded irrigation and cooling processes. Increased groundwater abstraction can locally reduce (or halt) baseflow. To derive monthly historic sectoral water withdrawal maps, we followed the methodology of Huang et al. (2018) and used the FAO AQUASTAT sectoral water withdrawal data (Food and Agriculture Organisation, 2023) as a starting point. These data were subsequently spatially and temporally disaggregated using a variety of datasets. These include the Global Human Settlement Layer (Schiavina et al., 2019; Florczyk et al., 2019) for population estimates, the Global Change Analysis Model (GCAM; Calvin et al., 2019) for regional water withdrawal and electricity consumption, and the Gridded Livestock of the World (GLW; Gilbert et al., 2018) for livestock distribution. Additional datasets included Multi-Source Weather (MSWX; Beck et al., 2022) for air temperature data, United States Geological Survey (USGS) water withdrawal estimates, and Vassolo and Döll (2005) industrial and thermoelectric withdrawal maps. More information on the water demand and input datasets used is provided in Choulga et al. (2024).

We extrapolated the water withdrawal maps to the period 1950–1978 using annual gridded 0.5° data from ISIMIP 3a (Frieler et al., 2024; Wada et al., 2016) that were downscaled to 1 resolution using historical population data from HANZE (Paprotny and Mengel, 2023) and HYDE 3.2 (Klein Goldewijk et al., 2017) for the other parts of the domain. More precisely, the ratio between EFAS high-resolution water demand maps and the ISIMIP 3a dataset for 1979 was used to adjust the water withdrawal data in each grid cell. Intra-annual (monthly) cycling of water use in the energy and domestic sectors was estimated for 1950–1978 using the same approach as for 1979–2020, informed by temperature data from our input meteorological dataset (Sect. 2.3.1). Livestock water use was assumed to be constant before 1979. Water demand and use for irrigation were computed directly by the hydrological model based on land use data and available water. The evolution of water use by sectors between 1950 and 2020 is displayed in Fig. 5c and Table S4 in the Supplement. The total water use peaked in 1990 after more than doubling since the 1950s and before declining due to a drop in demand from the manufacturing and energy sectors. Nonetheless, there are usually much stronger trends at the country or catchment levels.

2.3.4 Dynamic reservoir maps

Reservoir maps contain locations and identifiers of reservoirs and are linked to tables containing metadata on storage capacity, construction year, and a set of values associated with reservoir operation rules. Normal reservoir outflow rates were further adjusted through the model calibration (Sect. 2.1.2). The year of construction for each reservoir was taken from the EFAS reservoir database HANZE (Paprotny and Mengel, 2023), the Global Reservoir and Dam Database (GRanD) v1.3 (Lehner et al., 2011), or additional manual research for reservoirs not covered by the three datasets. The reservoir maps are updated every simulation year (1 January) by adding newly built reservoirs. When a reservoir is added, it is considered empty and fills up according to its associated metadata. Figure 5b shows the evolution of the number of reservoirs in Europe during the period 1950–2020. The number of reservoirs in the model increased 6-fold from 244 in 1950 to 1419 in 2020, though few were built after the late 1980s.

3 Results

3.1 Technical validation

We evaluated our hydrological reanalysis by comparison against a dataset of daily river discharge observations from 3442 stations across Europe. Of the data obtained, 60 % were from the Global Runoff Data Centre (GRDC) and 40 % from national public datasets of France, Norway, Poland, Spain, Sweden, and the UK. Furthermore, this dataset was compiled independently of the one used in the EFAS calibration (Sect. 2.1.2). The stations' record durations vary between 1 and 71 years. The selection of the stations used for validation is based on several criteria:

  • Spatial matching. To link stations to their corresponding river pixel, we scanned the nine modelled pixels around the river gauge location. When information on the upstream area was available (for 60 % of the stations), we retained the pixel with the upstream area closest to the reported one. For pixels without information on the upstream area, we retained the one with the simulated mean discharge (Qmean) closest to the observed one. For more accurate spatial matching, we used the available OS LISFLOOD coordinates from the EFAS calibration (1026 stations). A total of 546 stations did not match OS LISFLOOD river pixels, mostly due to their upstream area being lower than 100 km2.

  • Upstream area verification. The spatial matching selected the closest upstream area for stations where we have information on catchment area. It is however possible that the reported catchment differs greatly from its matched pixel upstream area. We removed stations where the difference between the pixel and observed upstream areas was larger than 50 % (51 stations).

  • Mean discharge comparison. For some stations, the ratio between the observed and simulated Qmean was suspicious. This could be due to an erroneous spatial match (i.e. matching of a river with a station on a tributary). As uncertainty grows with smaller streams, we decided to remove those with a suspicious Qmean ratio (rQmean>6 or rQmean>3 if Qmean,obs>10m3 s−1) (49 stations).

  • Manual check. A manual verification was performed on 66 stations with KGE<-0.41. Each station and its matching pixel were checked individually, resulting in the removal of 13 more stations due to wrong spatial matching, erroneous station location, and doubtful observations. The corresponding river pixel was set manually for eight stations. Manually checked stations and the reason for their exclusion or inclusion are provided in Table S5 in the Supplement.

  • Finally, we removed stations with a record length shorter than 30 years (334 stations). This enabled a meaningful comparison between different locations in the validation process.

This procedure resulted in the selection of 2448 river stations across Europe, with an upstream area ranging from 100 to 785 421 km2. Of these stations, more than half (1507) have an upstream area of less than 1000 km2, and one-fifth (498) have an upstream area of less than 200 km2.

HERA comes at a sub-daily resolution (6-hourly), but the performance could only be evaluated at the daily time step of the observational dataset. Discharge data from HERA were therefore aggregated (daily mean) for the technical validation. We expect the performance to be slightly higher at the daily scale, as the temporal aggregation tends to increase the correlation between observed and modelled discharge. The performance was assessed using the KGE on discharge data (Gupta et al., 2009; Kling et al., 2012). KGE was used as the standard performance metric in EFAS and GLOFAS (Harrigan et al., 2020; Cammalleri et al., 2020a) as well as in other hydrological model assessments (Lin et al., 2019; Harrigan et al., 2020; Beck et al., 2017) and has three components: correlation, bias errors, and variability errors:

(1)KGE=1-(r-1)2+(β-1)2+(γ-1)2,(2)β=μsμo,(3)γ=σsμsσoμo,

where r is the Pearson correlation coefficient between the simulated (s) and observed (o) flow, β is the bias ratio, γ is the variability ratio, μ is the mean discharge, and σ is the discharge standard deviation. KGE and its three components are dimensionless, with an optimal value of 1. It is important to note here that KGE values should not be interpreted like the more traditional Nash–Sutcliffe efficiency (NSE; Nash and Sutcliffe, 1970). Indeed, for KGE the mean flow benchmark has a value of KGE=1-2=-0.41. Any value above −0.41 therefore exceeds the benchmark (Knoben et al., 2019), meaning that the model performs better than simply taking the mean.

In Sect. 3.1.1, we assessed the model performance across space, time (1951–2020), and catchment size in order to identify the strengths and weaknesses of HERA. Despite covering many aspects of the performance of hydrological models, KGE mainly focuses on mean values and gives a higher weight to high extremes compared to low ones. As this dataset also aims for use in long-term analysis of hydrological extremes, we evaluated how well high and low extremes are reproduced, including their timing and seasonality.

3.1.1 Hydrological performance

Here we quantified the overall performance of HERA in terms of KGE as well as the decomposition of this indicator into its three components: correlation, bias, and variability. Figure 5 displays the distribution of KGE and its three components across the 2448 validation stations. We obtained KGE>-0.41 for 2411 (98.5 %) of them, meaning that the reanalysis is skilful for these stations (Fig. 5a). The median KGE across all the catchments is 0.55, while the mean is 0.46, although this value varies widely across the catchments (Figs. 5a and 6a). The mean correlation value is relatively high (r¯=0.69), with 90 % of the stations having r>0.5 (Fig. 5b). From Fig. 5c and d, we can observe that there is a tendency to slightly underestimate flows (β¯=-13.1%) and flow variability (γ¯=-14.2%). The bias ranges between 0.8 and 1.2 (0.5–1.5) in 50 % (91 %) of the river gauges, which is considered very good for hydrological reanalysis (Harrigan et al., 2020; Alfieri et al., 2020; Lin et al., 2019; Yang et al., 2021).

https://essd.copernicus.org/articles/17/293/2025/essd-17-293-2025-f06

Figure 6KGE and its three components: (b) Pearson correlation, (c), bias ratio, and (d) variability ratio at the 2448 river gauges considered in the validation of HERA. Point sizes are proportional to catchment sizes.

Figure 6 shows the spatial performance of the model in terms of KGE and its components. The highest skill can be observed in central and north-western Europe. The vast majority of stations in the UK, Germany, France, Austria, and Switzerland (which together account for 51 % of all 2448 stations) exhibit a good (>0.5) to very good (>0.75) KGE. On the other hand, performance is relatively poor in Spain, Cyprus, Scandinavia, and northern Poland. Factors that can explain the poor performances in southern Europe include the combination of arid climates and the strong influence of lakes and reservoirs (Fig. 7c). Dry catchments where precipitation events are separated by long dry spells are in general very difficult to model (Cantoni et al., 2022). In Scandinavia, the negative bias (Fig. 6c) could be linked to an underestimation of precipitation and snowmelt in Scandinavian mountains (Beck et al., 2017, 2020). Figure 6d presents the variability ratio of simulated to observed flow. Overall, our reanalysis exhibits lower variability than observations, with 83 % of the catchments having a variability ratio below 1. The underestimation of variability was also found in the EFAS v5.0 run, although it is more pronounced in HERA. This could be explained by the different meteorological forcings used in the two runs.

https://essd.copernicus.org/articles/17/293/2025/essd-17-293-2025-f07

Figure 7Boxplot of the HERA KGE according to different classifications of the 2448 river stations used in the validation: (a) time, (b) catchment area, and (c) reservoir impacts. The numbers inside the boxplot represent the number of river gauges for each category, while the colours of the boxplot represent the median performance of the group from low (light blue) to high (dark blue).

Download

We validate HERA on stations with a wide range of catchment areas (mean upstream area of 7615 km2), which has an impact on OS LISFLOOD performance (Harrigan et al., 2020). The set of 2448 validation stations includes stations that were used in the calibration process (596) as well as stations in uncalibrated catchments (36) (Fig. S1). In Fig. 7, we break down the performance of the reanalysis according to the different attributes of each catchment: time (Fig. 7a), catchment area (Fig. 7b), reservoir impact (Fig. 7c), and calibration status (Fig. 7d).

Overall, the skill of HERA shows a slight increase through time of 21 % in KGEmed between the 1950s and the 2010s. The skill increases between 1951 and 1980 and then stabilizes from 1981 to 2020, though the results are influenced by changes in the gauge data availability over time. This could also be driven by improved climate inputs. Figure 7b shows that the model skill increases with catchment size, from KGEmed values of 0.44 (inter-quartile range IQR 0.25–0.59) for the 498 smallest catchments (<200km2) to 0.77 (IQR 0.68–0.84) for the 28 largest catchments (>100 000 km2). Such patterns have already been observed at global scales (Harrigan et al., 2020). It is important to note here that the majority of stations used in this validation (62 %) have an upstream area below 1000 km2, and the median upstream area of the 2448 stations is 583 km2. This is half of the median upstream area of the 1903 stations used in the calibration of EFAS v5.0 (CEMS-Flood online documentation, 2023).

We also divided stations according to reservoir influence. From the 1420 reservoirs active in 2020 (which represent the maximum amount over the considered time window), we estimated the impact of reservoirs on streamflow at the grid cell level. This was done by computing the ratio (c; –) of reservoir volume to mean discharge (Nilsson et al., 2005) at every grid cell. The ratio has been computed with the accuflux function from PCRaster and compares the upstream cumulative reservoir capacity (m3) and the cell-specific annual volume of annual streamflow (m3) (Zajac et al., 2017). This ratio varies between 0 and 1608 downstream of Embalse de Finisterre in central Spain. Most of the river grid cells highly impacted by reservoirs are found in southern Europe, particularly in Spain and Bulgaria. Figure 7c highlights the influence of reservoirs on the skill of the reanalysis. The river cells affected (medium and high, c>0.5) only represent 6 % of the stations and grid cells in the domain (Fig. 2). The median skill is lowest for highly impacted (c>1) stations with KGEmed=0.24, whereas minimally impacted stations have a KGEmed of 0.55. This highlights the difficulty of large-scale hydrological models such as OS LISFLOOD in accurately simulating reservoir outflows (Zajac et al., 2017).

Finally, we investigated the influence of calibration on model skill. In Fig. 7d, river gauges are divided into four groups according to their calibration status. As displayed in Fig. S1, 83 % of the stations considered in the validation fall into the domain calibrated for EFAS v5.0. We find better performance for calibrated stations (KGEmed=0.64) and comparable skill for stations within the calibrated domain (KGEmed=0.52) and stations benefitting from the parameter regionalization (KGEmed=0.47). The performance is much worse for catchments with default parameters, which here are limited to small (<150km2) coastal and endorheic catchments.

3.1.2 Reproduction of extremes

Large-scale hydrological models forced by climate reanalysis often fail to reproduce extreme hydrological event characteristics, in part due to the coarse spatial and temporal resolution (Brunner et al., 2021; McClean et al., 2023). Here, we analyse how well HERA reproduces different flow quantiles (q05, median, and q95) through the Pearson correlation coefficient and the coefficient of determination (R2) (Fig. 8) for the 2448 considered catchments. The ability to capture annual maxima and minima as well as their seasonality is also assessed (Fig. 9).

https://essd.copernicus.org/articles/17/293/2025/essd-17-293-2025-f08

Figure 8Scatterplot of observed and simulated river flow quantiles (m3 s−1): (a) 5th quantile, (b) median (q50), and (c) 95th quantile (q95) for the 2448 river gauges.

Download

https://essd.copernicus.org/articles/17/293/2025/essd-17-293-2025-f09

Figure 9Assessment of the ability of HERA to reproduce the timing of annual maximum and minimum flows. (a) Violin plot of errors in the mean day of occurrence of annual maxima (daily discharge) and minima (30 d averaged discharge) computed with circular statistics. Inside each violin plot, boxplots display the median and 1st and 3rd quantiles. (b) Difference between the modelled and observed mean annual maximum dates (a positive value means a later occurrence in HERA). (c) Difference between the modelled and observed mean annual minimum dates (a positive value means a later occurrence in HERA).

Figure 8 displays scatterplots of observed and simulated quantiles. Each point represents 1 of the 2448 stations. We observe that low (5th quantile: Q05) and median (Q50) flows are generally well represented with R2>0.99 (Fig. 8a and b), especially for higher discharge values. However, despite this generally good agreement, there is a more pronounced deviation of simulated values from observations for lower flow values, expressed by a higher dispersion for Q05. These deviations can be attributed to bias in climate inputs (McClean et al., 2023), the hydrological model (Feyen and Dankers, 2009), errors in flow measurements (especially for Q05) (Despax, 2016; Tomkins, 2014), and anthropogenic impacts on low- and median-flow regimes (Brunner, 2021) that are not accurately represented in the model (Fig. 8c). The number of stations with large deviations in the reproduction of high-flow statistics (Q95) is minor compared to Q05 and Q50. Nonetheless, despite a relatively high R2 (0.99), there is a general underestimation in the simulated values (Fig. 8c), which is common for large-scale hydrological models. Similarly to low and median flows, errors in high-flow statistics can be due to biases and smoothing of extremes in climate inputs and errors in the hydrological modelling. Uncertainty associated with flow measurements also plays a major role in high flows, as river discharges are usually not measured directly during floods (Despax, 2016). Finally, the spatial and temporal resolution of the model can affect its ability to reproduce high flows, particularly for flash floods in small catchments.

We also assessed the ability of the reanalysis to reproduce the timing of annual maxima and minima of discharge as well as their overall seasonality. As the daily temporal scale is not the most relevant factor when it comes to drought analysis with discharge data (Hannaford and Marsh, 2006; Kohn et al., 2019), annual minima were computed from 30 d moving average flows. Figure 9a displays the mismatch in the mean day of occurrence computed with circular statistics following Berghuijs et al. (2019). We observe that the median error in the mean day is very close to 0 for both maxima (median=0.1,IQR=-12–18) and minima (median=-1,IQR=-28–41) but with a much higher dispersion for annual minima compared to annual maxima. The higher dispersion for low flows is due to the slow-onset nature of these events (Brunner, 2021). Figure 9b shows the difference in timing between simulated and observed annual maxima across the 2448 considered stations. Differences in timing are smaller over the Atlantic coast, though a particularly high lag (simulated maxima delayed by 30 d or more in HERA) is observed over Poland and central Spain. For low flows (Fig. 9c), delays in central Europe are more than 30 d, while in Scandinavia the timing can be up to several months too early. This can be explained by the high hybridity of river regimes (several high- and low-flow seasons) in these regions, which may be captured with varying accuracy in HERA.

In addition to the validation protocol presented in this section, we compared the reported performances of HERA with other recent hydrological datasets and carried out a comparison between HERA and another recent hydrological simulation done with the grid-based conceptual mesoscale Hydrological Model (mHM) (Kumar et al., 2013; Samaniego et al., 2010, 2019; Thober et al., 2019) for Europe for the period 1960–2010. More details on the comparison are provided in Figs. S3–S6 in the Supplement.

3.2 Usage notes

HERA brings together several improvements (climate, scale, and socio-economic dynamics) to better simulate river discharge in catchments of Europe over the past 70 years. Despite still covering a relatively short period of time compared to human history on Earth, these 70 years capture a very intense period of climate and socio-economic change, often called the Anthropocene, and offers multiple research opportunities:

  • assessment of long-term trends in European river regimes;

  • provision of benchmark data for “data-poor” areas;

  • generation of catalogues of flood and drought events;

  • identification of spatial and temporal correlations between European catchments;

  • identification of changes in hydrological extreme characteristics (frequency, magnitude, and timing);

  • combination with other data products for compound hazard analysis; and

  • provision of scenarios for flood inundation simulations.

In this section, we briefly present possible usage of the data, addressing changes in regime for diverse rivers across Europe (Fig. 10).

https://essd.copernicus.org/articles/17/293/2025/essd-17-293-2025-f10

Figure 10Changes in flow regime between 1951–1981 (blue) and 1990–2020 (red) for six diverse European rivers: (a) Ardèche, (b) Schelde, (c) Ebro, (d) Rhône, (e) Danube, and (f) Vistula. The regime is computed here as the 30 d moving average. Shaded coloured areas represent the IQR of discharge for every day of the year. The grey-shaded area represents the absolute difference between the two regimes corresponding to different periods.

Download

Figure 10 displays hydrological regimes, represented here as the mean of a 30 d average moving window over a given period, for six European rivers. These rivers differ in terms of hydrological regimes, with three main regimes represented:

  • a Mediterranean pluvial regime for the Ardèche (a), with its recognizable high flows in autumn;

  • a pluvial or oceanic regime for the Schelde in Ghent (b) and the Ebro in Zaragoza (c); and

  • a nival regime for the upper Rhône in Lyon (d), the Danube in Vienna (e), and the Vistula in Warsaw (f).

These six rivers also vary in terms of catchment area, geographical location (France, Austria, Poland, Belgium, or Spain), climate (Mediterranean, continental, oceanic, or alpine), and geomorphological conditions. For each river, the flow regimes for 1951–1981 (in blue, first 30 years of HERA) and 1990–2020 (in red, last 30 years of HERA) are shown. By comparing the two regimes, one can observe diverging patterns of changes between these rivers. For the two pluvial rivers, the Schelde and the Ebro (Fig. 10b and c), we observe opposite patterns of change: the Schelde saw an increase in its average discharge throughout the year, while the Ebro experienced a downward shift in its regime. For the upper Rhône and Danube (Fig. 10d and e), which are influenced by snowmelt in their upper catchments, we see lower and earlier flow peaks in spring and summer. The Vistula (Fig. 10f) saw an overall increase in flow throughout the year. Finally, the Ardèche (Fig. 10a) has seen reduced flow throughout the year, with a notable decrease in late winter which can be associated with the reduction in snowfall in the Massif Central, where the Ardèche has its upper waters (François et al., 2023). The timing of the autumn peak seems to have shifted slightly towards earlier dates, in agreement with a recent study on trends in Mediterranean floods (Tramblay et al., 2023).

4 Discussion

Recent developments in diverse fields, including climate, hydrology, remote sensing, and computational sciences, have made the generation of high-resolution reanalysis products possible (Aerts et al., 2022; Hanasaki et al., 2022; Hoch et al., 2023). In this context, HERA brings discharge data for all European rivers with upstream areas larger than 100 km2 for the period 1951–2020. With its refined spatial and temporal resolution, HERA represents hydrological processes in Europe with more detail than previous publicly available hydrological reanalysis products (Harrigan et al., 2020; Schellekens et al., 2017). Calibrating hydrological models can significantly improve river flow simulation (Beck et al., 2017; Kauffeldt et al., 2016). Parameters in 93.5 % of the HERA domain were adjusted with a calibration process (Sect. 2.1.2) or by parameter regionalization (Beck et al., 2016). This is a very high calibration coverage for a GHM (Beck et al., 2017), which can be explained by the relatively high coverage in river gauging stations in Europe.

Table 1Description of the HERA dataset.

Download Print Version | Download XLSX

Table 2List of inputs and outputs of OS LISFLOOD provided in the HERA database (link here).

Download Print Version | Download XLSX

It is difficult to compare HERA with other recent hydrological reanalyses such as GLOFAS-ERA5 (Harrigan et al., 2020) and GRFR (Yang et al., 2021), for several reasons: (i) spatial coverage (global vs. continental), (ii) spatial resolution (0.25°, 0.05°, 0.0167°), (iii) temporal coverage, and (iv) dynamic vs. static socio-economic conditions. However, we provide a short summary of the reported performances of HERA, GLOFAS-ERA5, GRFR, and a European-scale hydrological simulation with the mHM model (EUmHM) in Table S6 in the Supplement. While the reported performances of HERA are higher than its global counterparts, they are very close to the performance of EUmHM. In a more detailed comparison with EUmHM over 515 European river gauges (Figs. S3–S6), we show that HERA generally outperforms the EUmHM run in terms of KGE (Fig. S4), but both models exhibit strengths and weaknesses, spatially (Fig. S5) and in terms of the components of KGE (Fig. S6). Differences in performances between the HERA and EUmHM runs can be attributed to the many different features in the two runs, such as meteorological forcing, resolution, calibration, and flow routing within the hydrological model. Conversely, HERA shares a great number of features with the EFAS v5.0 reanalysis (Decremer et al., 2023), with a slightly lower performance (not shown here). Nonetheless, EFAS v5.0 only covers the period 1990–2022 and assumes static socio-economic conditions (land use, water abstraction, and reservoirs).

Similarly to the other aforementioned hydrological reanalyses, HERA exhibits reduced performance in cold and semi-arid catchments. This could be related to deficiencies in the representation of snow processes within OS LISFLOOD or the underestimation of precipitation at northern latitudes (Beck et al., 2017, 2020). Semi-arid environments are notoriously challenging areas for hydrological models due to the highly non-linear rainfall-runoff response and lower precipitation data quality (Cantoni et al., 2022). GHMs tend to represent runoff poorly in small- to medium-sized catchments (10–10 000 km2) (Harrigan et al., 2020; Sood and Smakhtin, 2015), and nearly 90 % of the catchments used in the validation of HERA (Sect. 3.1) are small- to medium-sized catchments. The drop in performance with smaller catchment areas in HERA remains, however, moderate compared to that of the GLOFAS-ERA5 global hydrological reanalysis (Harrigan et al., 2020). The presence of reservoirs also influences the performances of the reanalyses. While including reservoirs in the hydrological modelling has a positive impact on model performance (Zajac et al., 2017), there is still a high level of uncertainty regarding the operating rules of each reservoir. Moreover, the 1422 reservoirs used to generate HERA most likely represent just fractions, mainly the largest ones, of all the operational reservoirs in the modelled domain (Speckhann et al., 2021). In summary, the main strength of HERA lies in its relatively low bias in comparison to the other hydrological datasets considered here (Table S6 and Fig. S6), while its performances are hampered by its underestimation of variability.

HERA is generated through hydrological modelling, which brings a range of uncertainties that can be divided into four categories: (i) model inputs, (ii) model structure, (iii) parameter values, and (iv) observations. It remains challenging to quantify these uncertainties. However, the quality of the inputs and in particular the climate inputs is often an important factor in uncertainty (Beck et al., 2017; Sood and Smakhtin, 2015). Despite efforts in bias correction and downscaling of the climate input, it seems that, on average, HERA slightly underestimates river discharges, with a more pronounced bias for high flows. As reported in other studies, negative biases can be related to an underestimation of precipitation in the climate inputs, in particular for extreme events (McClean et al., 2023; Mahto and Mishra, 2019), at high latitudes, and in (semi-)arid catchments (Beck et al., 2016; Sood and Smakhtin, 2015; Hirpa et al., 2018). Model structure can also play an important role, as shown in Fig. S6, where EUmHM is the best model in terms of correlation, while HERA exhibits a smaller bias ratio. This can be the result of different choices made in the main equation behind the two models, resulting in different responses to forcings and calibration. The large impact of model selection on streamflow and trend estimates is now increasingly acknowledged (Karlsson et al., 2016; Clark et al., 2016). Calibration generally improves streamflow simulations (Hirpa et al., 2018), and HERA also shows better performance for the stations used in the calibration process (Fig. 7d). The negative biases and variability ratios can be related to the different meteorological forcing (EMO-1) used in the calibration, although an underestimation of the variability was also found in the EFAS v5.0 run (which is forced by EMO-1). The method, parameters, and skill metrics used for calibration further affect the uncertainties. Despite its qualities, the skill metric used for the calibration presented in Sect. 2.1.2 (KGE) is known to result in an underestimation of variability (Brunner et al., 2021) and to place more weight on high values (Garcia et al., 2017).

This could partly explain the reduced performances in reproducing the extreme low flows observed in Figs. 8 and 9. Other uncertainties can arise from surface field maps (Sect. 2.3) and measurement of river discharges (instruments and rating curves). With sparser gauging and more complex hydraulic conditions for high and low flows, uncertainty rises (Despax, 2016).

5 Code and data availability

The HERA hydrological reanalysis and its climate and dynamic socio-economic inputs are available in the JRC data catalogue: https://doi.org/10.2905/a605a675-9444-4017-8b34-d66be5b18c95 (Tilloy et al., 2024). Table 1 provides a brief description of the dataset, and Table 2 gives a general overview of the content of the dataset.

The dataset consists of three distinct folders that are described here and in Table 2:

  • Climate inputs. This is a folder containing the climate forcing for the OS LISFLOOD hydrological model. Out of the five variables provided, three are at daily temporal resolution, i.e. potential evapotranspiration, potential evaporation, and potential evaporation from bare soil (obtained with LISVAP; LISVAP online documentation, 2023), while two have a 6-hourly time step, precipitation, and temperature. The spatial resolution of the climate inputs is 1. The files are in NetCDF format, with one file per year per variable for a total of 355 files (2.3 TB of data).

  • Socio-economic inputs. This is a folder containing the dynamic surface field maps (Sect. 2.3), divided into three categories: land use, reservoirs, and water demand. The land use sub-folder contains 426 yearly files (4.6 GB) of land use fraction maps for all six land use classes. The reservoir sub-folder contains 71 yearly files (3.6 GB) of reservoir locations and identifiers. Reservoirs are added or discarded from the simulation every year according to their construction or destruction data. Finally, the water demand sub-folders contain four files (3.9 GB) representing water demand for the considered sectors (Sect. 2.3.3). Each file contains monthly maps of water abstraction for a given sector. All socio-economic inputs are provided in NetCDF format.

  • River discharge. This folder contains river discharge NetCDF files for each year at a 6-hourly time step for all European rivers with an upstream area greater than 100 km2 (2.3 GB per file, 166 GB total).

All the data share the same projection (WGS84) of grid and spatial resolution (1). Static surface field maps were retrieved directly from the OS LISFLOOD static and parameter maps for Europe (2024) dataset, which were developed in the context of the new EFAS deployment (Decremer et al., 2023). It is important to note that HERA simulates discharge in a slightly smaller domain than the original EFAS domain, and the mask used for HERA is also provided in the dataset.

The LISFLOOD OS hydrological model used in this research is released as open-source software and is available at https://ec-jrc.github.io/lisflood/ (European Commission, Joint Research Centre (JRC), 2025a). Version 4.1.2 of the code was used in this study (https://github.com/ec-jrc/lisflood-code/tree/v4.1.2; European Commission, Joint Research Centre (JRC), 2025b). The R and Python scripts used to assess the performances of HERA against observed discharge and to generate the figures of this article are available in this GitHub repository (https://github.com/Alowis/HERA, last access: 20 January 2025) or from the following Zenodo repository: https://doi.org/10.5281/zenodo.14718275 (Alowis, 2025).

6 Conclusions

Despite the limitations discussed above, HERA represents a state-of-the-art, high-resolution, and long-term hydrological reanalysis for Europe in the form of homogeneous river flow data generated with the OS LISFLOOD model. To our knowledge, no other publicly available hydrological reanalysis currently provides discharge data at similar scales and spatio-temporal coverage for Europe. The inclusion of dynamic socio-economic conditions provides a more realistic reanalysis of river flows in heavily managed European catchments. The increased spatial resolution improves the performance due to a better representation of hydrological processes and inputs required to simulate them, including the river network (Hoch et al., 2023; Thober et al., 2019). HERA advances the reanalysis of extreme hydrological events, notably by the sub-daily temporal resolution and high-resolution bias-corrected climate input. The magnitude and seasonality of extremes are fairly reproduced, even if biases exist in some regions (e.g. central Poland or southern Spain). The dataset covers 70 years and is therefore suited for the analysis of long-term trends of several hydrological signatures. The modelling framework developed here further forms a basis for creating alternative (counter-factual) time series of river discharges where climatic or socio-economic conditions can be kept static, enabling the attribution of changes in hydrological regimes across Europe (Kreibich et al., 2019; Sauer et al., 2021; Scussolini et al., 2024).

Supplement

The supplement related to this article is available online at: https://doi.org/10.5194/essd-17-293-2025-supplement.

Author contributions

AT: conceptualization, data curation, formal analysis, software, writing – original draft preparation. DP: conceptualization, methodology, formal analysis, writing – original draft preparation. SG: methodology, software, supervision. CM: methodology, software. GG: software. AB: visualization. SL: conceptualization, methodology, writing – review and editing. HB: conceptualization, methodology, writing – review and editing. LF: conceptualization, methodology, supervision, writing – review and editing.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Acknowledgements

The authors wish to thank Larisa Tarasova, Oldrich Rakovec, and Rahini Kumar for kindly agreeing to share outputs of their mHM European run, enabling a comparison with HERA. We also wish to thank Pratik Mishra for proofreading the manuscript. Dominik Paprotny was supported by the German Research Foundation (Deutsche Forschungsgemeinschaft – DFG) through the project “Decomposition of flood losses by environmental and economic drivers” (FloodDrivers), grant no. 449175973.

Financial support

Dominik Paprotny has been supported by the Deutsche Forschungsgemeinschaft (grant no. 449175973).

Review statement

This paper was edited by Lukas Gudmundsson and reviewed by three anonymous referees.

References

Aerts, J. P. M., Hut, R. W., van de Giesen, N. C., Drost, N., van Verseveld, W. J., Weerts, A. H., and Hazenberg, P.: Large-sample assessment of varying spatial resolution on the streamflow estimates of the wflow_sbm hydrological model, Hydrol. Earth Syst. Sci., 26, 4407–4430, https://doi.org/10.5194/hess-26-4407-2022, 2022. 

Alfieri, L., Lorini, V., Hirpa, F. A., Harrigan, S., Zsoter, E., Prudhomme, C., and Salamon, P.: A global streamflow reanalysis for 1980–2018, J. Hydrol. X, 6, 100049, https://doi.org/10.1016/j.hydroa.2019.100049, 2020. 

Allen, R. G., Pereira, L. S., Raes, D., and Smith, M.: FAO irrigation and drainage paper, ISBN 92-5-104219-5, 1998. 

Alowis: Alowis/HERA: Scripts associated with HERA, a high resolution pan-European hydrological reanalysis, v1.0.0, Zenodo [code], https://doi.org/10.5281/zenodo.14718275, 2025. 

Barker, L. J., Hannaford, J., Parry, S., Smith, K. A., Tanguy, M., and Prudhomme, C.: Historic hydrological droughts 1891–2015: systematic characterisation for a diverse set of catchments across the UK, Hydrol. Earth Syst. Sci., 23, 4583–4602, https://doi.org/10.5194/hess-23-4583-2019, 2019. 

Batista e Silva, F., Lavalle, C., and Koomen, E.: A procedure to obtain a refined European land use/cover map, Journal of Land Use Science, 8, 255–283, https://doi.org/10.1080/1747423X.2012.667450, 2013. 

Beck, H. E., van Dijk, A. I. J. M., de Roo, A., Miralles, D. G., McVicar, T. R., Schellekens, J., and Bruijnzeel, L. A.: Global-scale regionalization of hydrologic model parameters, Water Resour. Res., 52, 3599–3622, https://doi.org/10.1002/2015WR018247, 2016. 

Beck, H. E., van Dijk, A. I. J. M., de Roo, A., Dutra, E., Fink, G., Orth, R., and Schellekens, J.: Global evaluation of runoff from 10 state-of-the-art hydrological models, Hydrol. Earth Syst. Sci., 21, 2881–2903, https://doi.org/10.5194/hess-21-2881-2017, 2017. 

Beck, H. E., Wood, E. F., Mcvicar, T. R., Zambrano-Bigiarini, M., Alvarez-Garreton, C., Baez-Villanueva, O. M., Sheffield, J., and Karger, D. N.: Bias correction of global high-resolution precipitation climatologies using streamflow observations from 9372 catchments, J. Climate, 33, 1299–1315, https://doi.org/10.1175/JCLI-D-19-0332.1, 2020. 

Beck, H. E., van Dijk, A. I. J. M., Larraondo, P. R., McVicar, T. R., Pan, M., Dutra, E., and Miralles, D. G.: MSWX: global 3-hourly 0.1° bias-corrected meteorological data including near-real-time updates and forecast ensembles, B. Am. Meteorol. Soc., 103, E710–E732, https://doi.org/10.1175/BAMS-D-21-0145.1, 2022. 

Berghuijs, W. R., Harrigan, S., Molnar, P., Slater, L. J., and Kirchner, J. W.: The relative importance of different flood-generating mechanisms across Europe, Water Resour. Res., 55, 4582–4593, https://doi.org/10.1029/2019WR024841, 2019. 

Blöschl, G., Bierkens, M. F. P., Chambel, A., et al.: Twenty-three unsolved problems in hydrology (UPH) – a community perspective, Hydrolog. Sci. J., 64, 1141–1158, https://doi.org/10.1080/02626667.2019.1620507, 2019a. 

Blöschl, G., Hall, J., Viglione, A., Perdigão, R. A. P., Parajka, J., Merz, B., Lun, D., Arheimer, B., Aronica, G. T., Bilibashi, A., Boháč, M., Bonacci, O., Borga, M., Čanjevac, I., Castellarin, A., Chirico, G. B., Claps, P., Frolova, N., Ganora, D., Gorbachova, L., Gül, A., Hannaford, J., Harrigan, S., Kireeva, M., Kiss, A., Kjeldsen, T. R., Kohnová, S., Koskela, J. J., Ledvinka, O., Macdonald, N., Mavrova-Guirguinova, M., Mediero, L., Merz, R., Molnar, P., Montanari, A., Murphy, C., Osuch, M., Ovcharuk, V., Radevski, I., Salinas, J. L., Sauquet, E., Šraj, M., Szolgay, J., Volpi, E., Wilson, D., Zaimi, K., and Živković, N.: Changing climate both increases and decreases European river floods, Nature, 573, 108–111, https://doi.org/10.1038/s41586-019-1495-6, 2019b. 

Brönnimann, S., Allan, R., Atkinson, C., Buizza, R., Bulygina, O., Dahlgren, P., Dee, D., Dunn, R., Gomes, P., John, V. O., Jourdain, S., Haimberger, L., Hersbach, H., Kennedy, J., Poli, P., Pulliainen, J., Rayner, N., Saunders, R., Schulz, J., Sterin, A., Stickler, A., Titchner, H., Valente, M. A., Ventura, C., and Wilkinson, C.: Observations for reanalyses, B. Am. Meteorol. Soc., 99, 1851–1866, https://doi.org/10.1175/BAMS-D-17-0229.1, 2018. 

Brunner, M. I.: Reservoir regulation affects droughts and floods at local and regional scales, Environ. Res. Lett., 16, 124016, https://doi.org/10.1088/1748-9326/ac36f6, 2021. 

Brunner, M. I., Melsen, L. A., Wood, A. W., Rakovec, O., Mizukami, N., Knoben, W. J. M., and Clark, M. P.: Flood spatial coherence, triggers, and performance in hydrological simulations: large-sample evaluation of four streamflow-calibrated models, Hydrol. Earth Syst. Sci., 25, 105–119, https://doi.org/10.5194/hess-25-105-2021, 2021. 

Burek, P., van der Knijff, J., and De Roo, A.: LISFLOOD – Distributed water balance and flood simulation model – revised user manual 2013, EU publications, https://doi.org/10.2788/24719, 2013. 

Calvin, K., Patel, P., Clarke, L., Asrar, G., Bond-Lamberty, B., Cui, R. Y., Di Vittorio, A., Dorheim, K., Edmonds, J., Hartin, C., Hejazi, M., Horowitz, R., Iyer, G., Kyle, P., Kim, S., Link, R., McJeon, H., Smith, S. J., Snyder, A., Waldhoff, S., and Wise, M.: GCAM v5.1: representing the linkages between energy, water, land, climate, and economic systems, Geosci. Model Dev., 12, 677–698, https://doi.org/10.5194/gmd-12-677-2019, 2019. 

Cammalleri, C., Vogt, J., and Salamon, P.: Development of an operational low-flow index for hydrological drought monitoring over Europe, Hydrolog. Sci. J., 62, 346–358, https://doi.org/10.1080/02626667.2016.1240869, 2017. 

Cammalleri, C., Barbosa, P., and Vogt, J.: Evaluating simulated daily discharge for operational hydrological drought monitoring in the Global Drought Observatory (GDO), Hydrolog. Sci. J., 65, 1316–1325, https://doi.org/10.1080/02626667.2020.1747623, 2020a. 

Cammalleri, C., Naumann, G., Mentaschi, L., Bisselink, B., Gelati, E., De Roo, A., and Feyen, L.: Diverging hydrological drought traits over Europe with global warming, Hydrol. Earth Syst. Sci., 24, 5919–5935, https://doi.org/10.5194/hess-24-5919-2020, 2020b. 

Cannon, A. J.: Multivariate quantile mapping bias correction: an N-dimensional probability density function transform for climate model simulations of multiple variables, Clim. Dynam., 50, 31–49, https://doi.org/10.1007/s00382-017-3580-6, 2018. 

Cantoni, E., Tramblay, Y., Grimaldi, S., Salamon, P., Dakhlaoui, H., Dezetter, A., and Thiemig, V.: Hydrological performance of the ERA5 reanalysis for flood modeling in Tunisia with the LISFLOOD and GR4J models, J. Hydrol., 42, 101169, https://doi.org/10.1016/j.ejrh.2022.101169, 2022. 

CEMS-Flood online documentation: https://confluence.ecmwf.int/display/CEMS/CEMS-Flood, last access: 14 December 2023. 

Choulga, M., Moschini, F., Mazzetti, C., Grimaldi, S., Disperati, J., Beck, H., Salamon, P., and Prudhomme, C.: Technical note: Surface fields for global environmental modelling, Hydrol. Earth Syst. Sci., 28, 2991–3036, https://doi.org/10.5194/hess-28-2991-2024, 2024. 

Clark, M. P., Wilby, R. L., Gutmann, E. D., Vano, J. A., Gangopadhyay, S., Wood, A. W., Fowler, H. J., Prudhomme, C., Arnold, J. R., and Brekke, L. D.: Characterizing Uncertainty of the Hydrologic Impacts of Climate Change, Curr. Clim. Change Rep., 2, 55–64, https://doi.org/10.1007/s40641-016-0034-x, 2016. 

Copernicus: Copernicus Global Land Service – Leaf Area Index, Copernicus [data set], https://land.copernicus.eu/en/products/vegetation (last access: 20 January 2025), 2021. 

Decremer, D., Mazzetti, C., Carton, C., Gomes, G., Russo, C., Ramos, A., Grimaldi, S., Disperati, J., Ziese, M., Garcia Sanchez, R., Jacobson, T., Salamon, P., and Prudhomme, C.: EFAS v5.0 hydrological reanalysis, Joint Research Centre Data Catalogue [data set], http://data.europa.eu/89h/76b4b9de-a5c6-4344-8d88-c4bed7752ce3 (last access: 9 January 2025), 2023. 

De Roo, A., Odijk, M., Schmuck, G., Koster, E., and Lucieer, A.: Assessing the effects of land use changes on floods in the meuse and oder catchment, Phys. Chem. Earth Pt. B, 26, 593–599, https://doi.org/10.1016/S1464-1909(01)00054-5, 2001. 

Despax, A.: Incertitude des mesures de débit des cours d'eau au courantomètre. Amélioration des méthodes analytiques et apports des essais interlaboratoires, These de doctorat, Université Grenoble Alpes (ComUE), https://theses.hal.science/tel-01496704 (last access: 20 January 2025), 2016. 

Donat, M. G., Sillmann, J., Wild, S., Alexander, L. V., Lippmann, T., and Zwiers, F. W.: Consistency of temperature and precipitation extremes across various global gridded in situ and reanalysis datasets, J. Climate, 27, 5019–5035, https://doi.org/10.1175/JCLI-D-13-00405.1, 2014. 

Dottori, F., Alfieri, L., Bianchi, A., Skoien, J., and Salamon, P.: A new dataset of river flood hazard maps for Europe and the Mediterranean Basin, Earth Syst. Sci. Data, 14, 1549–1569, https://doi.org/10.5194/essd-14-1549-2022, 2022. 

Ekolu, J., Dieppois, B., Sidibe, M., Eden, J. M., Tramblay, Y., Villarini, G., Peña-Angulo, D., Mahé, G., Paturel, J.-E., Onyutha, C., and van de Wiel, M.: Long-term variability in hydrological droughts and floods in sub-Saharan Africa: new perspectives from a 65 year daily streamflow dataset, J. Hydrol., 613, 128359, https://doi.org/10.1016/j.jhydrol.2022.128359, 2022. 

European Commission, Joint Research Centre (JRC): Open Source Lisflood, European Commission, Joint Research Centre (JRC) [code], https://ec-jrc.github.io/lisflood/, last access: 12 January 2025a. 

European Commission, Joint Research Centre (JRC): OS LISFLOOD v4.1.2 release, European Commission, Joint Research Centre (JRC) [code], https://github.com/ec-jrc/lisflood-code/tree/v4.1.2, last access: 20 January 2025b. 

Feyen, L. and Dankers, R.: Impact of global warming on streamflow drought in Europe, J. Geophys. Res., 114, D17116, https://doi.org/10.1029/2008JD011438, 2009. 

Florczyk, A. J., Corbane, C., Ehrlich, D., Freire, S., Kemper, T., Maffenini, L., Melchiorri, M., Pesaresi, M., Politis, P., Schiavina, M., Sabo, F., and Zanchetta, L.: GHSL Data Package 2019 – Technical report by the Joint Research Centre (JRC), European Union, 38 pp, https://doi.org/10.2760/0726, 2019. 

Food and Agriculture Organisation: AQUASTAT, https://www.fao.org/aquastat/en/ (last access: 13 November 2024), 2023. 

Fortin, F.-A., De Rainville, F.-M., Gardner, M.-A., Parizeau, M., and Gagné, C.: DEAP: Evolutionary algorithms made easy, J. Mach. Learn. Res., 13, 2171–2175, 2012. 

François, H., Samacoïts, R., Bird, D. N., Köberl, J., Prettenthaler, F., and Morin, S.: Climate change exacerbates snow-water-energy challenges for European ski tourism, Nat. Clim. Change, 13, 935–942, https://doi.org/10.1038/s41558-023-01759-5, 2023. 

Frieler, K., Volkholz, J., Lange, S., Schewe, J., Mengel, M., del Rocío Rivas López, M., Otto, C., Reyer, C. P. O., Karger, D. N., Malle, J. T., Treu, S., Menz, C., Blanchard, J. L., Harrison, C. S., Petrik, C. M., Eddy, T. D., Ortega-Cisneros, K., Novaglio, C., Rousseau, Y., Watson, R. A., Stock, C., Liu, X., Heneghan, R., Tittensor, D., Maury, O., Büchner, M., Vogt, T., Wang, T., Sun, F., Sauer, I. J., Koch, J., Vanderkelen, I., Jägermeyr, J., Müller, C., Rabin, S., Klar, J., Vega del Valle, I. D., Lasslop, G., Chadburn, S., Burke, E., Gallego-Sala, A., Smith, N., Chang, J., Hantson, S., Burton, C., Gädeke, A., Li, F., Gosling, S. N., Müller Schmied, H., Hattermann, F., Wang, J., Yao, F., Hickler, T., Marcé, R., Pierson, D., Thiery, W., Mercado-Bettín, D., Ladwig, R., Ayala-Zamora, A. I., Forrest, M., and Bechtold, M.: Scenario setup and forcing data for impact model evaluation and impact attribution within the third round of the Inter-Sectoral Impact Model Intercomparison Project (ISIMIP3a), Geosci. Model Dev., 17, 1–51, https://doi.org/10.5194/gmd-17-1-2024, 2024. 

Garcia, F., Folton, N., and Oudin, L.: Which objective function to calibrate rainfall–runoff models for low-flow index simulations?, Hydrolog. Sci. J., 62, 1149–1166, https://doi.org/10.1080/02626667.2017.1308511, 2017. 

Gilbert, M., Nicolas, G., Cinardi, G., Van Boeckel, T. P., Vanwambeke, S. O., Wint, G. R. W., and Robinson, T. P.: Global distribution data for cattle, buffaloes, horses, sheep, goats, pigs, chickens and ducks in 2010, Sci Data, 5, 180227, https://doi.org/10.1038/sdata.2018.227, 2018. 

Gudmundsson, L., Boulange, J., Do, H. X., Gosling, S. N., Grillakis, M. G., Koutroulis, A. G., Leonard, M., Liu, J., Müller Schmied, H., Papadimitriou, L., Pokhrel, Y., Seneviratne, S. I., Satoh, Y., Thiery, W., Westra, S., Zhang, X., and Zhao, F.: Globally observed trends in mean and extreme river flow attributed to climate change, Science, 371, 1159–1162, https://doi.org/10.1126/science.aba3996, 2021. 

Gupta, H. V., Kling, H., Yilmaz, K. K., and Martinez, G. F.: Decomposition of the mean squared error and NSE performance criteria: implications for improving hydrological modelling, J. Hydrol., 377, 80–91, https://doi.org/10.1016/j.jhydrol.2009.08.003, 2009. 

Hanasaki, N., Matsuda, H., Fujiwara, M., Hirabayashi, Y., Seto, S., Kanae, S., and Oki, T.: Toward hyper-resolution global hydrological models including human activities: application to Kyushu island, Japan, Hydrol. Earth Syst. Sci., 26, 1953–1975, https://doi.org/10.5194/hess-26-1953-2022, 2022. 

Hannaford, J. and Marsh, T.: An assessment of trends in UK runoff and low flows using a network of undisturbed catchments, Int. J. Climatol., 26, 1237–1253, https://doi.org/10.1002/joc.1303, 2006. 

Harrigan, S., Zsoter, E., Alfieri, L., Prudhomme, C., Salamon, P., Wetterhall, F., Barnard, C., Cloke, H., and Pappenberger, F.: GloFAS-ERA5 operational global river discharge reanalysis 1979–present, Earth Syst. Sci. Data, 12, 2043–2060, https://doi.org/10.5194/essd-12-2043-2020, 2020. 

Hengl, T., Jesus, J. M. de, MacMillan, R. A., Batjes, N. H., Heuvelink, G. B. M., Ribeiro, E., Samuel-Rosa, A., Kempen, B., Leenaars, J. G. B., Walsh, M. G., and Gonzalez, M. R.: SoilGrids1km – global soil information based on automated mapping, PLOS ONE, 9, e105992, https://doi.org/10.1371/journal.pone.0105992, 2014. 

Hirpa, F. A., Salamon, P., Beck, H. E., Lorini, V., Alfieri, L., Zsoter, E., and Dadson, S. J.: Calibration of the Global Flood Awareness System (GloFAS) using daily streamflow data, J. Hydrol., 566, 595–606, https://doi.org/10.1016/j.jhydrol.2018.09.052, 2018. 

Hoch, J. M., Sutanudjaja, E. H., Wanders, N., van Beek, R. L. P. H., and Bierkens, M. F. P.: Hyper-resolution PCR-GLOBWB: opportunities and challenges from refining model spatial resolution to 1 km over the European continent, Hydrol. Earth Syst. Sci., 27, 1383–1401, https://doi.org/10.5194/hess-27-1383-2023, 2023. 

Huang, Z., Hejazi, M., Li, X., Tang, Q., Vernon, C., Leng, G., Liu, Y., Döll, P., Eisner, S., Gerten, D., Hanasaki, N., and Wada, Y.: Reconstruction of global gridded monthly sectoral water withdrawals for 1971–2010 and analysis of their spatiotemporal patterns, Hydrol. Earth Syst. Sci., 22, 2117–2133, https://doi.org/10.5194/hess-22-2117-2018, 2018. 

International Food Policy Research Institute: Global Spatially-Disaggregated Crop Production Statistics Data for 2010 Version 2.0, International Food Policy Research Institute [data set], https://doi.org/10.7910/DVN/PRFF8V, 2019. 

IPCC: Chapter 3: Human Influence on the Climate System, in: Climate Change 2021 – The Physical Science Basis: Working Group I Contribution to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change, Cambridge University Press, https://doi.org/10.1017/9781009157896, 2023. 

Karlsson, I. B., Sonnenborg, T. O., Refsgaard, J. C., Trolle, D., Børgesen, C. D., Olesen, J. E., Jeppesen, E., and Jensen, K. H.: Combined effects of climate models, hydrological model structures and land use scenarios on hydrological impacts of climate change, J. Hydrol., 535, 301–317, https://doi.org/10.1016/j.jhydrol.2016.01.069, 2016. 

Kauffeldt, A., Wetterhall, F., Pappenberger, F., Salamon, P., and Thielen, J.: Technical review of large-scale hydrological models for implementation in operational flood forecasting schemes on continental level, Environ. Modell. Softw., 75, 68–76, https://doi.org/10.1016/j.envsoft.2015.09.009, 2016. 

Klein Goldewijk, K., Beusen, A., Doelman, J., and Stehfest, E.: Anthropogenic land use estimates for the Holocene – HYDE 3.2, Earth Syst. Sci. Data, 9, 927–953, https://doi.org/10.5194/essd-9-927-2017, 2017. 

Kling, H., Fuchs, M., and Paulin, M.: Runoff conditions in the upper Danube basin under an ensemble of climate change scenarios, J. Hydrol., 424/425, 264–277, https://doi.org/10.1016/j.jhydrol.2012.01.011, 2012. 

Knoben, W. J. M., Freer, J. E., and Woods, R. A.: Technical note: Inherent benchmark or not? Comparing Nash–Sutcliffe and Kling–Gupta efficiency scores, Hydrol. Earth Syst. Sci., 23, 4323–4331, https://doi.org/10.5194/hess-23-4323-2019, 2019. 

Kohn, I., Stahl, K., and Stölzle, M.: Low Flow Events – a review in the context of climate change in Switzerland. Hydro-CH2018 Project, https://doi.org/10.6094/UNIFR/150448, 2019. 

Kreibich, H., Blauhut, V., Aerts, J. C. J. H., Bouwer, L. M., Van Lanen, H. A. J., Mejia, A., Mens, M., and Van Loon, A. F.: How to improve attribution of changes in drought and flood impacts, Hydrolog. Sci. J., 64, 1–18, https://doi.org/10.1080/02626667.2018.1558367, 2019. 

Kumar, R., Samaniego, L., and Attinger, S.: Implications of distributed hydrologic model parameterization on water fluxes at multiple scales and locations, Water Resour. Res., 49, 360–379, https://doi.org/10.1029/2012WR012195, 2013. 

Lange, S.: Trend-preserving bias adjustment and statistical downscaling with ISIMIP3BASD (v1.0), Geosci. Model Dev., 12, 3055–3070, https://doi.org/10.5194/gmd-12-3055-2019, 2019. 

Lange, S., Quesada-Chacón, D., Büchner, M.: Secondary ISIMIP3b bias-adjusted atmospheric climate input data (v1.4), ISIMIP Repository, https://doi.org/10.48364/ISIMIP.581124.4, 2024. 

Lehner, B., Liermann, C. R., Revenga, C., Vörösmarty, C., Fekete, B., Crouzet, P., Döll, P., Endejan, M., Frenken, K., Magome, J., Nilsson, C., Robertson, J. C., Rödel, R., Sindorf, N., and Wisser, D.: High-resolution mapping of the world's reservoirs and dams for sustainable river-flow management, Front. Ecol. Environ, 9, 494–502, https://doi.org/10.1890/100125, 2011. 

Li, X., Zhou, Y., Hejazi, M., Wise, M., Vernon, C., Iyer, G., and Chen, W.: Global urban growth between 1870 and 2100 from integrated high resolution mapped data and urban dynamic modeling, Communications Earth and Environment, 2, 1–10, https://doi.org/10.1038/s43247-021-00273-w, 2021. 

Lin, P., Pan, M., Beck, H. E., Yang, Y., Yamazaki, D., Frasson, R., David, C. H., Durand, M., Pavelsky, T. M., Allen, G. H., Gleason, C. J., and Wood, E. F.: Global Reconstruction of Naturalized River Flows at 2.94 Million Reaches, Water Resour. Res., 55, 6499–6516, https://doi.org/10.1029/2019WR025287, 2019. 

LISFLOOD static and parameter maps for Europe: http://data.europa.eu/89h/f572c443-7466-4adf-87aa-c0847a169f23, last access: 11 January 2024. 

LISVAP online documentation: https://ec-jrc.github.io/lisflood-lisvap/, last access: 15 December 2023. 

Mahto, S. S. and Mishra, V.: Does ERA-5 Outperform Other Reanalysis Products for Hydrologic Applications in India?, J. Geophys. Res.-Atmos., 124, 9423–9441, https://doi.org/10.1029/2019JD031155, 2019. 

McClean, F., Dawson, R., and Kilsby, C.: Intercomparison of global reanalysis precipitation for flood risk modelling, Hydrol. Earth Syst. Sci., 27, 331–347, https://doi.org/10.5194/hess-27-331-2023, 2023. 

Mentaschi, L., Alfieri, L., Dottori, F., Cammalleri, C., Bisselink, B., Roo, A. D., and Feyen, L.: Independence of future changes of river runoff in Europe from the pathway to global warming, Climate, 8, 22, https://doi.org/10.3390/cli8020022, 2020. 

Merz, B., Blöschl, G., Vorogushyn, S., Dottori, F., Aerts, J. C. J. H., Bates, P., Bertola, M., Kemter, M., Kreibich, H., Lall, U., and Macdonald, E.: Causes, impacts and patterns of disastrous river floods, Nat. Rev. Earth Environ., 2, 592–609, https://doi.org/10.1038/s43017-021-00195-3, 2021. 

Muñoz-Sabater, J., Dutra, E., Agustí-Panareda, A., Albergel, C., Arduini, G., Balsamo, G., Boussetta, S., Choulga, M., Harrigan, S., Hersbach, H., Martens, B., Miralles, D. G., Piles, M., Rodríguez-Fernández, N. J., Zsoter, E., Buontempo, C., and Thépaut, J.-N.: ERA5-Land: a state-of-the-art global reanalysis dataset for land applications, Earth Syst. Sci. Data, 13, 4349–4383, https://doi.org/10.5194/essd-13-4349-2021, 2021. 

Nash, J. E. and Sutcliffe, J. V.: River flow forecasting through conceptual models part I – A discussion of principles, J. Hydrol., 10, 282–290, https://doi.org/10.1016/0022-1694(70)90255-6, 1970. 

Nilsson, C., Reidy, C. A., Dynesius, M., and Revenga, C.: Fragmentation and flow regulation of the world's large river systems, Science, 308, 405–408, https://doi.org/10.1126/science.1107887, 2005. 

O'Neill, M. M. F., Tijerina, D. T., Condon, L. E., and Maxwell, R. M.: Assessment of the ParFlow–CLM CONUS 1.0 integrated hydrologic model: evaluation of hyper-resolution water balance components across the contiguous United States, Geosci. Model Dev., 14, 7223–7254, https://doi.org/10.5194/gmd-14-7223-2021, 2021. 

Paprotny, D. and Mengel, M.: Population, land use and economic exposure estimates for Europe at 100 m resolution from 1870 to 2020, Sci Data, 10, 372, https://doi.org/10.1038/s41597-023-02282-0, 2023. 

Paprotny, D., Terefenko, P., and Śledziowski, J.: HANZE v2.1: an improved database of flood impacts in Europe from 1870 to 2020, Earth Syst. Sci. Data, 16, 5145–5170, https://doi.org/10.5194/essd-16-5145-2024, 2024. 

Pfahl, S. and Wernli, H.: Quantifying the relevance of atmospheric blocking for co-located temperature extremes in the Northern Hemisphere on (sub-)daily time scales, Geophys. Res. Lett., 39, L12807, https://doi.org/10.1029/2012GL052261, 2012. 

Prudhomme, C., Parry, S., Hannaford, J., Clark, D. B., Hagemann, S., and Voss, F.: How well do large-scale models reproduce regional hydrological extremes in Europe?, J. Hydrometeorol., 12, 1181–1204, https://doi.org/10.1175/2011JHM1387.1, 2011. 

Richards, N. and Gutierrez-Arellano, C.: Effects of community-based water management decisions at catchment scale, an interdisciplinary approach: the case of the Great Ruaha River Catchment, Tanzania, Water Practice and Technology, 17, 598–611, https://doi.org/10.2166/wpt.2022.010, 2022. 

Sajikumar, N. and Remya, R. S.: Impact of land cover and land use change on runoff characteristics, J. Environ. Manage., 161, 460–468, https://doi.org/10.1016/j.jenvman.2014.12.041, 2015. 

Samaniego, L., Kumar, R., and Attinger, S.: Multiscale parameter regionalization of a grid-based hydrologic model at the mesoscale, Water Resour. Res., 46, W05523, https://doi.org/10.1029/2008WR007327, 2010. 

Samaniego, L., Thober, S., Wanders, N., Pan, M., Rakovec, O., Sheffield, J., Wood, E. F., Prudhomme, C., Rees, G., Houghton-Carr, H., Fry, M., Smith, K., Watts, G., Hisdal, H., Estrela, T., Buontempo, C., Marx, A., and Kumar, R.: Hydrological forecasts and projections for improved decision-making in the water sector in Europe, B. Am. Meteorol. Soc., 100, 2451–2472, https://doi.org/10.1175/BAMS-D-17-0274.1, 2019. 

Sauer, I. J., Reese, R., Otto, C., Geiger, T., Willner, S. N., Guillod, B. P., Bresch, D. N., and Frieler, K.: Climate signals in river flood damages emerge under sound regional disaggregation, Nat. Commun., 12, 2128, https://doi.org/10.1038/s41467-021-22153-9, 2021. 

Schellekens, J., Dutra, E., Weiland, F. S., Minvielle, M., Calvet, J.-C., Decharme, B., Eisner, S., Fink, G., Flörke, M., Peßenteiner, S., van Beek, R., Polcher, J., Beck, H., Orth, R., Calton, B., Burke, S., Dorigo, W., Weedon, G. P., and Delft, H.: A global water resources ensemble of hydrological models: the eartH2Observe Tier-1 dataset, 2017. 

Schiavina, M., Freire, S., and MacManus, K.: GHS-POP R2019A – GHS population grid multitemporal (1975–1990–2000–2015) (R2019A), Copernicus [data set], https://ghsl.jrc.ec.europa.eu/ghs_pop2019.php (last access: 7 January 2024), 2019. 

Scussolini, P., Luu, L. N., Philip, S., Berghuijs, W. R., Eilander, D., Aerts, J. C. J. H., Kew, S. F., van Oldenborgh, G. J., Toonen, W. H. J., Volkholz, J., and Coumou, D.: Challenges in the attribution of river flood events, WIRES Clim. Change, 15, e874, https://doi.org/10.1002/wcc.874, 2024. 

Sood, A. and Smakhtin, V.: Global hydrological models: a review, Hydrolog. Sci. J., 60, 549–565, https://doi.org/10.1080/02626667.2014.950580, 2015. 

Speckhann, G. A., Kreibich, H., and Merz, B.: Inventory of dams in Germany, Earth Syst. Sci. Data, 13, 731–740, https://doi.org/10.5194/essd-13-731-2021, 2021. 

Thiemig, V., Gomes, G. N., Skøien, J. O., Ziese, M., Rauthe-Schöch, A., Rustemeier, E., Rehfeldt, K., Walawender, J. P., Kolbe, C., Pichon, D., Schweim, C., and Salamon, P.: EMO-5: a high-resolution multi-variable gridded meteorological dataset for Europe, Earth Syst. Sci. Data, 14, 3249–3272, https://doi.org/10.5194/essd-14-3249-2022, 2022. 

Thober, S., Cuntz, M., Kelbling, M., Kumar, R., Mai, J., and Samaniego, L.: The multiscale routing model mRM v1.0: simple river routing at resolutions from 1 to 50 km, Geosci. Model Dev., 12, 2501–2521, https://doi.org/10.5194/gmd-12-2501-2019, 2019. 

Tilloy, A., Malamud, B. D., and Joly-Laugel, A.: A methodology for the spatiotemporal identification of compound hazards: wind and precipitation extremes in Great Britain (1979–2019), Earth Syst. Dynam., 13, 993–1020, https://doi.org/10.5194/esd-13-993-2022, 2022. 

Tilloy, A., Paprotny, D., Feyen, L., Grimaldi, S., Gomes, G., Beck, H., Lange, S., and Bianchi, A.: HERA: a high-resolution pan-European hydrological reanalysis (1951–2020), European Commission, Joint Research Centre (JRC) [data set], https://doi.org/10.2905/a605a675-9444-4017-8b34-d66be5b18c95, 2024. 

Tomkins, K. M.: Uncertainty in streamflow rating curves: methods, controls and consequences, Hydrol. Process., 28, 464–481, https://doi.org/10.1002/hyp.9567, 2014. 

Tramblay, Y., Arnaud, P., Artigue, G., Lang, M., Paquet, E., Neppel, L., and Sauquet, E.: Changes in Mediterranean flood processes and seasonality, Hydrol. Earth Syst. Sci., 27, 2973–2987, https://doi.org/10.5194/hess-27-2973-2023, 2023. 

Van Der Knijff, J. M., Younis, J., and De Roo, A. P. J.: LISFLOOD: a GIS-based distributed model for river basin scale water balance and flood simulation, Int. J. Geogr. Inf. Sci., 24, 189–212, https://doi.org/10.1080/13658810802549154, 2008. 

Van Lanen, H. A. J., Wanders, N., Tallaksen, L. M., and Van Loon, A. F.: Hydrological drought across the world: impact of climate and physical catchment structure, Hydrol. Earth Syst. Sci., 17, 1715–1732, https://doi.org/10.5194/hess-17-1715-2013, 2013. 

Van Loon, A.: Hydrological drought explained, WIRES Water, 2, 359–392, https://doi.org/10.1002/wat2.1085, 2015. 

Vanham, D., Alfieri, L., Flörke, M., Grimaldi, S., Lorini, V., Roo, A. de, and Feyen, L.: The number of people exposed to water stress in relation to how much water is reserved for the environment: a global modelling study, The Lancet Planetary Health, 5, e766–e774, https://doi.org/10.1016/S2542-5196(21)00234-5, 2021. 

Vanham, D., Alfieri, L., and Feyen, L.: National water shortage for low to high environmental flow protection, Sci. Rep.-UK, 12, 3037, https://doi.org/10.1038/s41598-022-06978-y, 2022.  

Vassolo, S. and Döll, P.: Global-scale gridded estimates of thermoelectric power and manufacturing water use, Water Resour. Res., 41, W04010, https://doi.org/10.1029/2004WR003360, 2005. 

Vicente-Serrano, S. M., Peña-Gallardo, M., Hannaford, J., Murphy, C., Lorenzo-Lacruz, J., Dominguez-Castro, F., López-Moreno, J. I., Beguería, S., Noguera, I., Harrigan, S., and Vidal, J.-P.: Climate, irrigation, and land cover change explain streamflow trends in countries bordering the Northeast Atlantic, Geophys. Res. Lett., 46, 10821–10833, https://doi.org/10.1029/2019GL084084, 2019. 

Wada, Y., Flörke, M., Hanasaki, N., Eisner, S., Fischer, G., Tramberend, S., Satoh, Y., van Vliet, M. T. H., Yillia, P., Ringler, C., Burek, P., and Wiberg, D.: Modeling global water use for the 21st century: the Water Futures and Solutions (WFaS) initiative and its approaches, Geosci. Model Dev., 9, 175–222, https://doi.org/10.5194/gmd-9-175-2016, 2016. 

Wang, H., Liu, J., Klaar, M., Chen, A., Gudmundsson, L., and Holden, J.: Anthropogenic climate change has influenced global river flow seasonality, Science, 383, 1009–1014, https://doi.org/10.1126/science.adi9501, 2024. 

Wood, E. F., Roundy, J. K., Troy, T. J., van Beek, L. P. H., Bierkens, M. F. P., Blyth, E., de Roo, A., Döll, P., Ek, M., Famiglietti, J., Gochis, D., van de Giesen, N., Houser, P., Jaffé, P. R., Kollet, S., Lehner, B., Lettenmaier, D. P., Peters-Lidard, C., Sivapalan, M., Sheffield, J., Wade, A., and Whitehead, P.: Hyperresolution global land surface modeling: meeting a grand challenge for monitoring Earth's terrestrial water, Water Resour. Res., 47, 2010WR010090, https://doi.org/10.1029/2010WR010090, 2011. 

Yamazaki, D.: CaMa-Flood, http://hydro.iis.u-tokyo.ac.jp/~yamadai/cama-flood/index.html (last access: 28 November 2024), 2023. 

Yamazaki, D., Ikeshima, D., Sosa, J., Bates, P. D., Allen, G. H., and Pavelsky, T. M.: MERIT hydro: a high-resolution global hydrography map based on latest topography dataset, Water Resour. Res., 55, 5053–5073, https://doi.org/10.1029/2019WR024873, 2019. 

Yang, Y., Pan, M., Lin, P., Beck, H. E., Zeng, Z., Yamazaki, D., David, C. H., Lu, H., Yang, K., Hong, Y., and Wood, E. F.: Global reach-level 3-hourly river flood reanalysis (1980–2019), B. Am. Meteorol. Soc., 102, E2086–E2105, https://doi.org/10.1175/BAMS-D-20-0057.1, 2021. 

Yu, Q., You, L., Wood-Sichra, U., Ru, Y., Joglekar, A. K. B., Fritz, S., Xiong, W., Lu, M., Wu, W., and Yang, P.: A cultivated planet in 2010 – Part 2: The global gridded agricultural-production maps, Earth Syst. Sci. Data, 12, 3545–3572, https://doi.org/10.5194/essd-12-3545-2020, 2020. 

Zajac, Z., Revilla-Romero, B., Salamon, P., Burek, P., Hirpa, F. A., and Beck, H.: The impact of lake and reservoir parameterization on global streamflow simulation, J. Hydrol., 548, 552–568, https://doi.org/10.1016/j.jhydrol.2017.03.022, 2017. 

Download
Short summary
This article presents a reanalysis of Europe's river streamflow for the period 1951–2020. Streamflow is estimated through a state-of-the-art hydrological simulation framework benefitting from detailed information about the landscape, climate, and human activities. The resulting Hydrological European ReAnalysis (HERA) can be a valuable tool for studying hydrological dynamics, including the impacts of climate change and human activities on European water resources and flood and drought risks.
Altmetrics
Final-revised paper
Preprint