Articles | Volume 15, issue 6
Data description paper
12 Jun 2023
Data description paper |  | 12 Jun 2023

CHELSA-W5E5: daily 1 km meteorological forcing data for climate impact studies

Dirk Nikolaus Karger, Stefan Lange, Chantal Hari, Christopher P. O. Reyer, Olaf Conrad, Niklaus E. Zimmermann, and Katja Frieler

Current changes in the world's climate increasingly impact a wide variety of sectors globally, from agriculture and ecosystems to water and energy supply or human health. Many impacts of climate on these sectors happen at high spatio-temporal resolutions that are not covered by current global climate datasets. Here we present CHELSA-W5E5 (, Karger et al., 2022): a climate forcing dataset at daily temporal resolution and 30 arcsec spatial resolution for air temperatures, precipitation rates, and downwelling shortwave solar radiation. This dataset is a spatially downscaled version of the 0.5 W5E5 dataset using the CHELSA V2 topographic downscaling algorithm. We show that the downscaling generally increases the accuracy of climate data by decreasing the bias and increasing the correlation with measurements from meteorological stations. Bias reductions are largest in topographically complex terrain. Limitations arise for minimum near-surface air temperatures in regions that are prone to cold-air pooling or at the upper extreme end of surface downwelling shortwave radiation. We further show that our topographically downscaled climate data compare well with the results of dynamical downscaling using the Weather Research and Forecasting (WRF) regional climate model, as time series from both sources are similarly well correlated to station observations. This is remarkable given the lower computational cost of the CHELSA V2 algorithm compared to WRF and similar models. Overall, we conclude that the downscaling can provide higher-resolution climate data with increased accuracy. Hence, the dataset will be of value for a wide range of climate change impact studies both at global level and for applications that cover more than one region and benefit from using a consistent dataset across these regions.

1 Introduction

With ongoing climate change, the assessment of climate change impacts on natural and social systems requires increasing attention (IPCC, 2022). Historically, a strong focus has been on the scientific exploration of climate impacts on agriculture, forestry, water management, human health, and other sectors by using climate impact models driven by historical or projected future climate data. Yet, with observed climate change impacts emerging widely already at current levels of warming (IPCC, 2022), a wide range of decision-making processes as well as business activities increasingly rely on actionable knowledge from impact models that is useful beyond the scientific community which is developing them. For example, so-called climate services are designed to support adaptation of stakeholders and their activities in response to climate change (Brasseur and Gallardo, 2016; Hewitt et al., 2012; Lourenço et al., 2016), where the attribution of climate impacts has become highly relevant for climate litigation (Mengel et al., 2021). There is also an increasing demand to quantify damage that cannot be avoided by climate mitigation or adaptation (Huber et al., 2022). These activities require highly accurate climate impact datasets at high spatio-temporal resolution. Daily temporal resolution for example allows extreme events such as heavy precipitation or heat waves to be captured that would not be visible at monthly resolution (Ban et al., 2021). Likewise, high spatial resolution (e.g. 30 arcsec, i.e.  1 km at the Equator) allows topographic effects in mountainous areas or patterns of climate variables with small-scale spatial variability to be captured (Gerlitz et al., 2015; Daly et al., 1994).

High-resolution climate data can typically be produced using either regional climate models for dynamical downscaling (Giorgi et al., 2009), statistical downscaling methods using large-scale predictors of the small-scale state of the atmosphere (Maraun and Widmann, 2018), or topographic downscaling methods that mainly use terrain-based predictors to increase the spatial resolution of climate data (Karger et al., 2017; Fiddes and Gruber, 2014). Regional climate models have the advantage of representing the fundamental physical, chemical, and biological processes of the climate system. While this makes them powerful tools for studying future climates, it also makes them computationally expensive, as a result of which they cannot easily be applied at the global level (Giorgi et al., 2009; Sørland et al., 2021; Schär et al., 2019). Statistical downscaling methods are based on empirical relationships between large-scale predictors and small-scale predictands (Wilby et al., 1998). These relationships are typically derived from historical observations of predictors and predictands and then applied to downscale large-scale climate projections. While this is computationally less expensive, it implies out-of-sample applications of a statistical model, which may lead to physically implausible results (Maraun et al., 2017; Lanzante et al., 2018). Lastly, topographic downscaling methods primarily use terrain-based information to add small-scale details to large-scale inputs, such as the influence of mountain ranges on precipitation patterns (Roe, 2005). Examples of such methods include Climatologies at high resolution for the Earth's land surface areas (CHELSA) (Karger et al., 2017, 2021, 2020) and the Parameter-elevation Regressions on Independent Slopes Model (PRISM) (Daly et al., 1997, 1994). Considering the out-of-sample limitation of statistical downscaling, topographic downscaling of climate projections is less problematic in comparison, at similar computational cost. On the downside, topographic downscaling is based on mechanistic equations which, due to their simplicity, may still introduce biases in the climate data (Karger et al., 2017, 2021). In addition, those equations are unable to represent small-scale spatial patterns that are unrelated to topography, such as small-scale convective precipitation over flat terrain (Karger et al., 2021).

All approaches have historically been challenged by computational and storage limitations if carried out at the global level (Schär et al., 2019). For example, the latest global reanalysis dataset based on the dynamic land surface model “Hydrology in the Tiled ECMWF Scheme for Surface Exchanges over Land” (HTESSEL; Balsamo et al., 2009) is only available at a resolution of  9 km, which still masks important local climate variability (Muñoz-Sabater et al., 2021). For these reasons, climate datasets at high spatial and temporal solution usually only exist at local to regional levels, which is adequate for analyses at these levels. However, there are no global products representing temperature, solar radiation, and precipitation at both high temporal (daily) and high spatial ( 1 km) resolution, although these would offer considerable benefits to climate impact modelling, for example, a consistent global dataset that allows regional hydrological models to be run at various locations using consistent climate driving data so that impacts can be integrated across regions (Huang et al., 2017; Krysanova and Hattermann, 2017). Likewise, global analyses that are strongly dependent on the resolution of the data could be carried out at much finer resolution than is currently the case. For example, Shi et al. (2021) calculated how aridity velocity affects a wide range of species using climate data at 0.5 resolution, yet this resolution neglects important topographic details that are important as species might benefit from topographic diversity for surviving extreme climatic conditions (Barton et al., 2019).

To address this gap in data availability and to enable tests of how beneficial such global datasets would be, the objective of this paper is to present a global climate dataset at 30 arcsec and daily resolution: CHELSA-W5E5 v1.0 (, Karger et al., 2022). This dataset builds upon WFDE5 over land merged with ERA5 over the ocean (W5E5) v1.0, an observational climate dataset that has been thoroughly evaluated and intensively used in climate impact modelling (Lange, 2019; Cucchi et al., 2020). CHELSA-W5E5 v1.0 is derived from W5E5 via topographic downscaling using the CHELSA V2 algorithm (Karger et al., 2017, 2020, 2021). Through a detailed evaluation of CHELSA-W5E5 v1.0, we aim to demonstrate the added value of a kilometre-scale resolution downscaling compared to the coarse-resolution (0.5) W5E5 data. We focus on a set of key climatic variables that are highly relevant for climate impact modelling, namely daily minimum (tasmin, in units of K), mean (tas, K), and maximum (tasmax, K) near-surface (2 m) air temperature, which are, for example, relevant for assessing heat extremes (Huber et al., 2020); daily mean precipitation rate (pr, kg m−2 s−1), which is a crucial variable for example for hydrological and vegetation models (Müller Schmied et al., 2014; Chang et al., 2017); and daily mean surface downwelling shortwave radiation (rsds; W m−2), which is for example crucial for agricultural modelling (Ruane et al., 2015, 2021). The analyses and data build on earlier efforts to downscale precipitation (Karger et al., 2017, 2020, 2021), and we focus on assessing where the new dataset improves the estimate of a climate variable by moving to a spatial resolution of 30 arcsec and what caveats have to be kept in mind when applying the data for climate impact analyses.

Here we describe the CHELSA downscaling procedure applied to W5E5 and evaluate its performance in improving the accuracy of modelled air temperatures, precipitation rates, and downwelling shortwave solar radiation. We give a description on the input data as well as a detailed description of the downscaling procedure applied, which includes the downscaling of near-surface air temperature (tas, tasmax, tasmin), surface downwelling shortwave radiation (rsds), and precipitation (pr). We evaluate our results using observations at meteorological stations and analyse the performance of the downscaling globally, regionally, and seasonally, as well as at the extremes, and additionally compare our results with dynamically downscaled data.

2 Material and methods

To downscale the coarse-resolution W5E5 data, we used the CHELSA V2 algorithm (Karger et al., 2017). This algorithm is a topographically informed, mechanistic downscaling method. It downscales 2 m air temperatures (tas, tasmax, tasmin) based on air temperature lapse rates in the lower atmosphere, precipitation rates (pr) using orographic terrain effects, and surface downwelling shortwave radiation (rsds) using a mechanistic terrain-based downscaling. In the following we describe the input data and downscaling procedure in more detail which is shown in Fig. 1.

2.1 Input data

2.1.1 W5E5

WFDE5 over land merged with ERA5 over the ocean (W5E5) v1.0 (Lange, 2019) is the observational reference climate input dataset used in the Inter-Sectoral Impact Model Intercomparison Project phase 3 (ISIMIP3,, 30 December 2021). It covers the years 1979–2016 for the entire globe. The data have daily temporal and 0.5 spatial resolution. W5E5 combines the Waterer and global Change (WATCH) Forcing Data methodology applied to ERA5 reanalysis data (WFDE5) v1.0 (Cucchi et al., 2020) over land with data from the latest version of the European Reanalysis (ERA5) (Hersbach et al., 2020) over the ocean. In the following we briefly describe ERA5, WFDE5, and W5E5.

The ERA5 global reanalysis (Hersbach et al., 2020) is produced at the European Centre for Medium-Range Weather Forecasts (ECMWF) as part of the EU-funded Copernicus Climate Change Service (C3S). It is the successor of ERA Interim (Dee, 2011) and in comparison benefits from 10 years of developments of the underlying weather forecast model and data assimilation system. More observations are assimilated in ERA5 than in its predecessor ERA Interim, including stratospheric sulfate aerosols. In addition, ERA5 has higher temporal and spatial resolution (hourly and 0.25 compared to 3-hourly and 0.7).

The WFDE5 meteorological forcing dataset is a bias-adjusted version of ERA5 that covers the global land surface at hourly temporal and 0.5 spatial resolution for selected near-surface atmospheric variables (air temperature, shortwave and longwave downwelling radiation, rainfall and snowfall, specific humidity, air pressure, and wind speed). Bias adjustments were applied according to the WATCH Forcing Data methodology (Weedon et al., 2014, 2011). That means that (i) monthly mean values of daily mean temperature and the diurnal temperature range were elevation-adjusted and bias-adjusted using version 4.03 of the Climate Research Unit gridded Time Series (CRU TS) (Harris et al., 2020); (ii) pressure, humidity, and longwave radiation were aligned with the adjusted temperature; (iii) monthly mean shortwave radiation was bias-adjusted using aerosol correction factors (Cucchi et al., 2020) and CRU TS4.03 cloud cover; and (iv) rainfall and snowfall rates were bias-adjusted with respect to the monthly number of wet days using CRU TS4.03 and monthly precipitation totals using observations from either CRU TS4.03 or data from the Global Precipitation Climatology Centre (GPCC) full data product version 2018 (Schneider et al., 2018), followed by a gauge-catch correction and a correction of the snowfall-to-rainfall ratio using the adjusted temperature (Cucchi et al., 2020). Using either CRU TS4.03 or GPCCv2018 precipitation totals, two different WFDE5 precipitation datasets were produced. The variant based on GPCCv2018 was used for W5E5. Wind speed is the only variable that was not adjusted.

Lastly, W5E5 combines WFDE5 data over land with ERA5 data over the ocean to cover the whole globe at daily temporal and 0.5 spatial resolution. Here we use daily total precipitation (pr) and daily mean downwelling shortwave radiation (rsds) as well as daily mean, minimum, and maximum near-surface air temperature (tas, tasmin and tasmax, respectively) from W5E5. The daily temperature values are equal to the daily mean (for tas), minimum (for tasmin), and maximum (for tasmax) of the hourly temperature values from WFDE5 over land and ERA5 aggregated to 0.5 spatial resolution over the ocean. Similarly, W5E5 pr (rsds) is equal to the daily sum (mean) of hourly total precipitation (shortwave radiation) from WFDE5 over land and ERA5 aggregated to 0.5 spatial resolution over the ocean, with the following exception: W5E5 pr over the ocean was bias-adjusted using monthly precipitation totals from version 2.3 of the Global Precipitation Climatology Project (Adler et al., 2003). Monthly rescaling factors used for this purpose were computed following the scale-selective rescaling procedure described by Balsamo et al. (2010).

2.1.2 Global Multi-resolution Terrain Elevation Data 2010 (GMTED2010)

The Global Multi-resolution Terrain Elevation Data 2010 (GMTED2010) (Danielson and Gesch, 2011) dataset contains elevation data for the globe collected from various sources at resolutions from 7.5 to 30 arcsec. We use the 30 arcsec version of the data that represents the mean elevation of all 7.5 arcsec grid cells.

2.1.3 Land–sea mask

The CHELSA downscaling algorithm only has an effect where topography varies in space. Over the ocean, the output of the downscaling is equivalent to a simple B-spline interpolation of the input data. To reduce the size of the high-resolution dataset, we therefore applied a land–sea mask that is intended to cut out the parts over the ocean that are not affected by topography. To make sure this mask actually covers all land masses, a cell of the 30 arcsec CHELSA-W5E5 grid is considered a land grid cell if it overlaps with any of the land polygons provided by the global, self-consistent, hierarchical, high-resolution shoreline database (GSHHG) v2.3.7 (Wessel and Smith, 1996); the 30 m spatial resolution global shoreline vector (GSV) (Sayre et al., 2019); and the MODIS-based Mosaic of Antarctica data (MOA) (Scambos et al., 2007). To ensure all land pixels are covered, we additionally added a buffer of 60 arcsec width to the boundaries of each land polygon.

2.2 Downscaling procedure

2.2.1 Downscaling of near-surface air temperature (tas, tasmax, tasmin)

The CHELSA downscaling algorithm was applied day by day. The downscaling of W5E5 air temperature (tas, tasmax, tasmin) was done by using a daily mean near-surface atmospheric temperature lapse rate, Γ¯, derived from ERA5, combined with differences in surface altitude between GMTED2010 and W5E5. Here, Γ¯ is the daily mean of hourly lapse rates, Γ, with

(1) Γ = ( t 850 - t 950 ) / ( z 850 - z 950 ) ,

where t850 and t950 are ERA5 hourly air temperatures at 850 and 950 hPa, respectively, and z850 and z950 are the geopotential heights of those pressure levels multiplied by the gravitational constant (9.80665 m s−2). We then interpolated W5E5 tas, tasmax, and tasmin from their original resolution of 0.5 to the 30 arcsec resolution of GMTED2010 using a B-spline interpolation (see Karger et al., 2021, for an example of how the B-spline interpolation is implemented), resulting in an interpolated high-resolution temperature surface, tc. To include the high-resolution topography, we first interpolated the 0.5 orography from W5E5 to 30 arcsec using a B-spline interpolation, this way creating a reference elevation grid, zc, that corresponds to tc. We then used Γ¯ together with zc and zh, the GMTED2010 orography at 30 arcsec, to do the topographic downscaling of tc, according to

(2) t h = t c + Γ ¯ ( z h - z c ) ,

where th is the downscaled near-surface air temperature at 30 arcsec resolution, either being tas, tasmax, or tasmin.

Figure 1Schematic representation of the most important calculation steps (rhombi) and input and output data (rectangles) of the CHELSA-W5E5 downscaling. Intermediate data that are not part of the published data (temperature lapse rate; Γ and clear-sky solar radiation; rsdscs). Only the most important equations are indicated. Spline indicates that a B-spline interpolation is used to change the spatial resolution to a higher target resolution. Mean indicates that the mean across grid cells is used. Proj. indicates that a reprojection to another geographic projection is performed. For the respective abbreviations, see the equations in the main text.


2.2.2 Downscaling of surface downwelling shortwave radiation (rsds)

Surface downwelling shortwave radiation at 30 arcsec resolution is strongly influenced by topographic features such as aspect or terrain shadows that are less pronounced at 0.5 resolution. The CHELSA downscaling algorithm combines such geometric effects with orographic effects on cloud cover for a topographic downscaling of rsds.

Geometric effects are considered by computing 30 arcsec clear-sky radiation estimates using the methods described in Böhner and Antonic (2009) as well as Wilson and Gallant (2000). This approach assumes that the net shortwave radiation, Sn, can be expressed as

(3) S n = S s + S h + S t - S r = ( S s + S h + S t ) ( 1 - r ) ,

with Sn being the sum of all direct solar radiation received from sun, Ss; diffuse solar radiation received from the sky's hemisphere, Sh; and radiation by reflection of surrounding land surfaces, St; minus the radiation which is reflected off the surface, Sr. Alternatively, the reflected fraction of the incoming radiation can be expressed using the dimensionless surface albedo, r. This formula for Sn is strictly only valid for a horizontal, unobstructed surface. However, topography can severely influence net shortwave solar radiation by, for example, shading. A topographically corrected Sn, Sn, is given by

(4) S n = S s + S h + S t ( 1 - r ) ,

where Ss and Sh are direct and diffuse solar radiation modified by the surrounding topography of a given 30 arcsec grid cell, and St gives the reflection from surrounding land surfaces.

2.2.3 Direct solar radiation under clear-sky conditions

Topographic direct solar radiation Ss is calculated using


where θ is the sun elevation angle, φ is sun azimuth, λ is the latitude, δ is the solar declination angle, J is Julian day number, ϖ is the hour angle in degrees, and the value 12 h is equal to the distance of the given mid-hour from the true solar noon (0.5, 1.5, and 2.5 h, etc.). The angle between a plane orthogonal to sun's rays and terrain (solar illumination angle, γ) is calculated at time steps of 15 min using

(9) cos γ = cos β sin θ + sin β cos θ cos ( φ - α ) ,

where β and α are surface slope and aspect, respectively, calculated from the high-resolution orography, zh, and θ and φ define the sun position on the sky. Shadowing from topography is calculated using the horizon angle, φ, which is defined as the maximum angle toward any other point in a given azimuth within 10 000 m horizontal distance,

(10) φ = max d 10 000 m arctan Δ z ( d ) d ,

where d is the distance to the point with higher elevation, and Δz(d) is the associated elevation difference. Topographic direct radiation at hour h, Ss(h), is then calculated using

(11) S s ( h ) = ς ( h ) S s ( h ) sin θ cos γ ,

where ς(h) indicates if a terrain shadow is present (with ς(h)=0 representing shadow and ς(h)=1 representing no shadow), depending on h and the horizon angle; Ss(h) is the direct solar radiation on an unobstructed horizontal surface at hour h; and θ and γ also depend on h via their dependence on ϖ. The inclusion of the effect of terrain angle is done by the division using sin θ that tilts the horizontal surface to a surface that is orthogonal to the sun's rays. Multiplication by cos γ accounts for terrain. Ss(h) also depends on the structure and the composition of the atmosphere. We assume a homogenous atmosphere with a transmissivity τ of 80 % and then calculate Ss(h) following Wilson and Gallant (2000) using

(12) S s h = sin θ G SC τ m ,

where GSC is the solar constant defined at 1367 kW m2, and m is the optical air mass, i.e. the length of the atmospheric path traversed by the sun's rays (List, 1968). For a sun elevation angle θ>30, m is calculated following Linacre (1992) using

(13) m = 1 cos ( 90 - θ ) ,

and for θ≤30 the optical air mass m is determined in 1 increments from a vector of known values after List (1968, p. 422), by using increments of 1, where


Then m is calculated using element i in M, where i is the position of θ in M, by

(14) m = M i + θ - i ( M i + 1 - M i ) .

Daily mean topographic direct radiation, Ss, is obtained via integration over all 15 min time steps of the day,

(15) S s = 1 n h = 1 n S s ( h ) = 1 n h = 1 n ς ( h ) S s ( h ) sin θ cos γ ,

where n denotes the number of 15 min intervals of the day.

2.2.4 Diffuse solar radiation under clear-sky conditions

Topographic corrected diffuse radiation Sh is calculated by quantifying how much of the sky is visible from a grid cell, using

(16) S h = S h Ψ s ,

where Ψs is based on the horizon angles ϕi in different azimuth directions Φi of the full circle originating in a focal grid cell, and Sh is the diffuse solar radiation calculated using

(17) S h = ( 0.271 - 0.294 τ m ) G SC Ψ s ,

where GSC is the solar constant defined at 1367 kW m2, and Ψs is the sky view factor defined as

(18) Ψ s = 1 N i = 1 N cos β cos φ i + sin β cos Φ i - α ( 90 - φ i - sin φ i cos φ i ) ,

with N=8 uniformly distributed directions used for an approximation of the topographic effect.

2.2.5 Shortwave downwelling solar radiation under cloudy conditions

To calculate rsds under cloudy conditions, we calculated surface cloud area fraction (clt) from atmospheric cloud fractions cl at pressure levels z from ERA5. We first calculated the windward leeward index H using the u and v wind components from ERA5 following the methods described in Karger et al. (2021). To distinguish between clouds that are influenced by orography from clouds in the free atmosphere, we first adjusted the windward leeward index relative to the number of pressure levels used, so that the windward leeward index is stronger at lower pressure levels than on pressure levels that are not influenced by the orography anymore. For each pressure level i..n we calculated the corrected windward leeward index Hicor1 using

(19) H i cor 1 = H i + ( 1 - H i ) i n - 1 .

This gives however the highest orographic effect directly at the surface altitude z, where often cloud formation is not possible yet. We therefore additionally corrected the windward leeward index by its distance to the cloud base height derived from its altitude zi and then B-spline-interpolated to the 30 arcsec resolution cloud base height (cbh) using

(20) H i cor 2 = H i cor 1 - ( 1 - H i cor 1 ) z i - cbh cbh ,

where the cloud area fraction on each pressure level i is then given by a horizontal spline interpolation of the coarse grid cloud fraction to a 30 arcsec resolution clic with the corrected windward leeward index:

(21) cl i h = H i cor 2 S ( cl i c ) .

Cloud area fraction at the ground level then follows the maximum overlap assumption so that

(22) clt = max ( cl 1 h cl i h ) .

To include surface cloud area fraction clt in rsds we used the parametrization from Kasten and Czeplak (1980):

(23) rsds = S n 1 - 0.75 clt 3.4 .

2.2.6 Downscaling of precipitation (pr)

The downscaling method for precipitation mostly follows that of Karger et al. (2021) but does not include the cloud cover correction based on satellite observations as those are not available for all years. We used the zonal and meridional wind components as well as the height of the planetary boundary layer to calculate the windward leeward index H. H, together with the height of the boundary layer following Karger et al. (2021), was used for a first approximation of the orographic precipitation intensity, Hpi, for the 30 arcsec resolution grid cell i. We then used a linear relationship between the input precipitation rate from W5E5, prW5E5, and pi to compute the downscaled precipitation of grid cells I, pri, according to

(24) pr i = p i 1 n i = 1 n p i pr W 5 E 5 ,

where n equals the number of 30 arcsec grid cells that fall within a 0.5 grid cell. This equation ensures that the data are to scale; i.e. the precipitation flux at 0.5 resolution is preserved. More details on the exact parametrization of the downscaling algorithm for precipitation are given in Karger et al. (2021).

3 Evaluation

The evaluation of the downscaling from low (0.5) to high (30 arcsec) resolution follows the evaluation approach outlined in Karger et al. (2021) and compares measurements at meteorological stations with data from both the low and the high spatial resolution. Since many observations at stations are already included in the W5E5 data due to the bias correction applied, we do not only evaluate the actual measurements at the stations but rather focus on the difference between evaluation metrics achieved by the 0.5 data and the downscaled data. This will directly evaluate the downscaling but not the forcing of the downscaling (see Karger et al., 2021). We use two observational datasets, GHCN-D (Global Historical Climatology Network Daily) and GEBA (Global Energy Balance Archive), as references for the evaluation. The evaluation is performed at daily, seasonal, and long-term climatological normals. The comparison to the station data is global, whereas the comparison to the dynamically downscaled data is constrained to the United States, where model output as well as a dense network of observational station data is available.

3.1 Evaluation datasets

To evaluate the performance of the downscaling algorithm we compute several test statistics at the original 0.5 resolution of the W5E5 data and the downscaled data at 30 arcsec from CHELSA-W5E5. We use observations at meteorological stations (Table 1) and compare those to W5E5 and CHELSA-W5E5 data from the corresponding 0.5 and 30 arcsec grid cells, respectively, to assess the value added by the downscaling.

Table 1Overview of the datasets used for evaluation, the variables contained, their temporal resolution, and the number of stations used for the evaluation. tas is the daily mean 2 m air temperature, pr the daily mean precipitation, tasmax the daily maximum 2 m air temperature, tasmin the daily minimum 2 m air temperature, and rsds the shortwave downwelling radiation.

Download Print Version | Download XLSX

3.1.1 GHCN-D

For the evaluation of 2 m air temperatures and precipitation rates, we used observations at meteorological stations from the Global Historical Climatology Network Daily (GHCN-D) network. This dataset contains meteorological-station-based measurements from global land areas. About two-thirds of the observations are precipitation measurements only (Menne et al., 2018).

3.1.2 GEBA

The station data of the GHCN-D network do not include energy flux variables. Thus, for the validation of shortwave downwelling radiation, we used the Global Energy Balance Archive (GEBA). This database is maintained by the Institute for Climate and Atmospheric Sciences (IAC) at ETH Zurich and consists of globally measured energy fluxes at the Earth's surface (Wild et al., 2017). Its first version was implemented in 1988; it has continuously been updated ever since and mainly been improved in terms of data availability, data access, and internet appearance (Wild et al., 2017). GEBA provides observations for 15 surface energy flux components. Shortwave radiation incident at the Earth's surface (global radiation) is the most widely measured quantity available in GEBA. The various observations have been compiled to monthly mean surface energy flux data from various sources.

3.2 Evaluation using observations at meteorological stations

To show the improvement resulting from the downscaling from 0.5 to 30 arcsec we compared each variable from both CHELSA-W5E5 and W5E5 to observations from meteorological stations (Table 1). For each meteorological station, the value of the grid cell that contains the location of the station was extracted and evaluated using several evaluation metrics.

3.2.1 Evaluation metrics

Evaluation metrics include the bias, correlation coefficient, root mean squared error, and mean absolute error. The correlation is calculated based on Pearson's correlation coefficient,

(25) r = cov ( x sim , x obs ) σ ( x sim ) σ ( x obs ) ,

where xobs represents the observed time series at a meteorological station xsim the downscaled time series, cov the covariance, and σ the standard deviation. The root mean squared error (RMSE) is defined as

(26) RMSE = 1 n i = 0 n x sim i - x obs i 2 ,

where n is the number of time steps of a time series. Furthermore, the mean absolute error (mae) was computed according to

(27) mae = 1 n i = 0 n x sim i - x obs i .

Finally, the relative bias was computed to investigate the average amount by which the observations are greater than the estimates of the model output data based on different resolutions by

(28) bias = x obs i - x sim i .

3.2.2 Seasonal performance

To investigate if the downscaling has a similar performance throughout the year, in a first step we aggregated the daily or, in the case of rsds, the monthly data to seasonal values; for example, winter is December, January, and February; spring is March, April, and May; summer is June, July, and August; and autumn is September, October, and November. Based on the seasonally aggregated means, Taylor diagrams were used to show the performance improvements based on correlations, standard deviation, and root mean squared error. Additionally, we calculated the Pearson correlation coefficient and the absolute bias between daily modelled values of either CHELSA-W5E5 or W5E5 and daily observations from GHCN-D and aggregated them to monthly means to assess possible trends in these two performance metrics over time. In the case of rsds, monthly means instead of daily values were used.

3.2.3 Global and regional performance

Further comparisons between observations from meteorological stations, W5E5 and CHELSA-W5E5, were also done at daily resolution (in the case of rsds a monthly resolution was used), as well as for long-term climatological normals. Additional analyses were carried out for North America (except for rsds), where both the density of meteorological stations and their quality are high. Both globally and for North America, several evaluation metrics were calculated (see Sect. 3.2.1). The main focus was on the difference in bias between CHELSA-W5E5 and W5E5 as this difference is an indicator of the value added by the downscaling algorithm.

To compare the performance spatially, we calculated the Pearson correlation between either CHELSA-W5E5 or W5E5 using daily values from either model in comparison to GHCN-D values for all meteorological stations globally. We then calculated the difference in the Pearson correlation coefficient and took the mean of all stations within a 0.5 grid cell that overlapped with these stations.

3.2.4 Evaluation at the extremes

To evaluate the performance of the downscaling at the extremes of the temperatures and precipitation rates, we defined extreme values based on quantiles over the entire time period 1979–2016. For extreme high temperatures we used the 95th percentile of tasmax. Extreme precipitation rates were defined as the 95th percentile precipitation rates on wet days (days with pr greater than 0.1 kg m−2 d−1), and for extreme cold days we used the 5th percentile of tasmin.

3.2.5 Comparison with dynamically downscaled data

To compare the terrain-based downscaling to a more complex and computationally demanding dynamical downscaling, our evaluation includes a comparison with a simulation of the Weather Research and Forecasting model (WRF) (Skamarock et al., 2019) for the historical climate of North America (Rasmussen and Liu, 2017). The simulation was performed over a 13-year period (October 2000–September 2013) with boundary conditions from ERA Interim, at a spatial resolution of 4 km. The comparison between WRF and CHELSA-W5E5 was conducted for the variables tas and pr.

4 Results

4.1 Evaluation using observations at meteorological stations

4.1.1 Seasonal performance

The correlation of both datasets with observations at meteorological stations is very high overall (r>0.9) for all variables globally as well as for North America except for daily pr. In general, the downscaling decreased the bias, RMSE, and mae and increased the correlation for all variables expect rsds (Fig. 2, Table 2). There is no obvious deviation during any of the four seasons for tas, tasmax, or tasmin, and the downscaling seems to perform equally well (Fig. 2). For pr the performance of both W5E5 and CHELSA-W5E5 is slightly higher during the northern winter months, while for rsds it is higher during northern spring and summer (Fig. 2).

Figure 2Seasonal performance based on a comparison of global long-term seasonal means normals (1979–2016) of the global topographically downscaled high-resolution (30 arcsec, i.e.  1 km) data (CHELSA-W5E5, orange) and the coarse (0.5) original data (W5E5, violet) with GHCN-D for daily mean 2 m air temperature (tas), daily minimum 2 m air temperature (tasmin), daily maximum 2 m air temperature (tasmax), precipitation (pr), and shortwave downwelling radiation (rsds), based on monthly aggregated data. Values are shown separately for the four seasons: winter (DJF), spring (MAM), summer (JJA), and autumn (SON). For the variables tas, tasmin, tasmax, and pr, the observational dataset GHCN-D was used for comparison. For rsds, the GEBA dataset was used.


4.1.2 Temporal performance

Both CHELSA-W5E5 and W5E5 do not show any significant trend in their performance when compared with observations at meteorological stations from GHCN-D (for tas, tasmin, tasmax, pr) or GEBA (for rsds) globally (Fig. 3). In general, the downscaled data show a slightly higher Pearson correlation coefficient r with observations than the coarse-resolution W5E5 data, except for rsds. The overall pattern in the Pearson correlation coefficient r overall is also similar between CHELSA-W5E5 and W5E5 for all variables. The absolute bias is more variable compared to r, with a generally lower bias but similar patterns for pr, tas, and rsds (Fig. 3) and a mixed pattern of a higher absolute bias in the 1980s and after 2000 an otherwise lower absolute bias and an higher absolute bias throughout for tasmin.

Figure 3Mean daily Pearson correlation r and absolute bias between GHCN-D stations and downscaled CHELSA-W5E5 (orange), as well as W5E5 (purple) calculated for each month from 1979–2016 separately for daily mean 2 m air temperature (tas), daily minimum 2 m air temperature (tasmin), daily maximum 2 m air temperature (tasmax), precipitation (pr), and shortwave downwelling radiation (rsds) globally.


4.1.3 Global and regional performance

For tas, tasmax, and pr, all error metrics (bias, mae, RMSE) decrease after downscaling, and the correlation coefficient increases (Fig. 4, Table 2). For rsds the bias is substantially reduced in the downscaled data, but the correlation coefficient is also slightly reduced (Fig. 4, Table 2). The lower correlation with yet a smaller bias seems to be driven by a systematic deviation of the downscaled rsds in areas with high rsds (Fig. 2). For tasmin, the pattern is opposite to rsds; i.e. the correlation coefficient increases after downscaling, but the bias increases (Table 2). This pattern for tasmin and rsds is even more pronounced when only stations in North America are used (Table 3). The reduction in bias and increase in correlations of air temperatures due to the downscaling to 30 arcsec are highest in topographically heterogeneous terrain (Fig. 4), such as the western parts of North America, whereas the topographic downscaling hardly added value in flat terrain (Fig. 5). Bias reduction and an increase in precipitation for precipitation are also highest in topographically complex terrain globally (Fig. 4) but considerable in flat terrain as well (Figs. 4, 5).

In regions with high-quality meteorological stations, such as the continental United States, the strong reduction in bias after downscaling in topographically complex terrain is also visible for tas, tasmax, and tasmin (Fig. 6). For tasmin, in the middle of the Rocky Mountains, the bias in the downscaled data is significantly higher than for tas and tasmax, both of which show less bias in the downscaled data in this region. tasmax and tasmin both show higher bias in the downscaled data over flat terrain. For pr, the patterns are similar to those for air temperatures, except that the bias is often lower over flat terrain (Fig. 6).

Figure 4Mean differences in Pearson's correlation coefficient r between daily observations at meteorological stations CHELSA-W5E5 and W5E5 over the period 1979–2016. Negative values (violet) indicate areas in which a decrease in the correlation between observations after downscaling is observed, while positive values (green) indicate areas with an increase in the correlation coefficient (green). Observations are based on GHCN-D for daily mean 2 m air temperature (tas), daily minimum 2 m air temperature (tasmin), daily maximum 2 m air temperature (tasmax), precipitation (pr), and GEBA for shortwave downwelling radiation (rsds).

Table 2Statistical scores from the comparison between CHELSA-W5E5 and W5E5, with observations from meteorological stations for all five variables (tas is the daily mean 2 m air temperature, pr the daily mean precipitation, tasmax the daily maximum 2 m air temperature, tasmin the daily minimum 2 m air temperature, and rsds the shortwave downwelling radiation) globally. Temp. res. is the temporal resolution, bias the bias between a modelled value and a measurement at a specific time step (temp. res.) at a specific station, sd_bias the standard deviation in bias, bias_re the reduction in bias (positive values indicate an increased performance), sd_bias_re the standard deviation in bias reduction, r the Pearson correlation coefficient, mae the mean absolute error, and RMSE the root mean squared error. Normals were calculated by averaging values over the entire observation period of a station between 1979–2016. Bias, sd_bias, bias_re, sd_ bias_ re, r, mae, and RMSE are based on comparisons of measurements between CHELSA-W5E5, W5E5, and observations at each station at each respective time step (temp. res). Bold values in bias_re indicate an increase in performance due to the downscaling.

Download Print Version | Download XLSX

Figure 5Scatter plots comparing long-term mean observations from GHCN-D with values from W5E5 (left column, before downscaling) and CHELSA-W5E5 (right column, after downscaling). Each point represents the mean of all observations at a specific GHCN-D station in the period 1979–2016, except for downwelling shortwave solar radiation, where each point represents a specific month.


Table 3Statistical scores from the comparison between the two simulated datasets CHELSA-W5E5 and W5E5 and observations from GHCN-D stations in North America for all five variables (tas is the daily mean 2 m air temperature, pr the daily mean precipitation, tasmax the daily maximum 2 m air temperature, tasmin the daily minimum 2 m air temperature, and rsds the shortwave downwelling radiation) globally. Temp. res. is the temporal resolution, bias the bias between a modelled value and a measurement at a specific time step (temp. res.) at a specific station, sd_bias the standard deviation in bias, bias_re the reduction in bias (positive values indicate an increased performance), sd_bias_ re the standard deviation in bias reduction, r the Pearson correlation coefficient, mae the mean absolute error, and RMSE the root mean squared error. Normals were calculated by averaging values over the entire observation period of a station between 1979–2016. Bias, sd_bias, bias_re, sd_bias_re, r, mae, and RMSE are based on comparisons of measurements between CHELSA-W5E5, W5E5, and observations at each station at each respective time step (temp. res).

Download Print Version | Download XLSX

Figure 6Mean bias of daily 2 m air temperatures and daily mean precipitation rates (from top to bottom) in North America averaged over the entire observational period of each station between 1979–2016. Left: bias between W5E5 and observations at GHCN-D meteorological stations. Middle: bias between the downscaled CHELSA-W5E5 and observations at GHCN-D meteorological observations. Right: bias reduction at each of the stations as a result of the downscaling, i.e. the changes in absolute bias between the 0.5 W5E5 and 30 arcsec CHELSA-W5E5, with negative values indicating a bias reduction and positive values indicating an increase in bias. The diameter of each dot scales with the absolute bias.

4.1.4 Extreme temperatures and precipitation

For extreme values such as the 95th percentile of daily maximum 2 m air temperature and the 5th percentile of daily minimum 2 m air temperature, the bias reduction is again strongest in topographically complex terrain (Fig. 7a, b). For extreme precipitation, the bias reduction is spatially not as coherent as for air temperature extremes, and the bias can even increase with the downscaling. Generally, the downscaling shows a higher bias reduction in topographically complex terrain, while in flat terrain the downscaling actually introduces a bias in the extremes (Fig. 4c).

Figure 7Mean bias of the extremes in maximum and minimum daily 2 m air temperatures and precipitation rates (from top to bottom) in North America averaged over the entire observational period of each station between 1979–2016. Left: mean bias between W5E5 and observations from GHCN-D meteorological stations for extreme values in air temperature and precipitation. Middle: bias between CHELSA-W5E5 and observations from GHCN-D meteorological stations for extreme values in air temperature and precipitation. Right: absolute bias reduction after downscaling from 0.5 to 30 arcsec for extreme values in air temperature and precipitation, defined as the difference between the absolute bias of W5E5 and the absolute bias of CHELSA-W5E5.

4.2 Comparison with dynamically downscaled data

Both the downscaled air temperatures as well as precipitation rates from CHELSA-W5E5 and WRF show relatively high congruence with observations at meteorological stations (Fig. 8). Correlation rates are overall higher, and biases are lower for CHELSA-W5E5 than WRF when both models are compared to observations at GHCN-D stations over the same observational period (Table 5, Fig. 8). Correlations are almost similar for air temperatures but slightly higher for CHELSA-W5E5 for precipitation compared to WRF (Fig. 8).

Table 4Statistical scores from the comparison of CHELSA-W5E5 and WRF with observations from GHCN-D stations in North America daily mean 2 m air temperature (tas) and daily mean precipitation (pr). Temp. res. is the temporal resolution, bias the bias between a modelled value and a measurement at a specific time step (temp. res.) at a specific station, sd_bias the standard deviation in bias, r the Pearson correlation coefficient, mae the mean absolute error, and RMSE the root mean squared error. Normals were calculated by averaging values over the entire observation period of a station between 1979–2016. Bias, sd_bias, r, mae, and RMSE are based on comparisons of measurements between CHELSA-W5E5, WRF, and observations at each station at each respective time step (temp. res).

Download Print Version | Download XLSX

Figure 8Performance based on a comparison of global long-term monthly means of the topographically downscaled high-resolution ( 1 km) data (CHELSA-W5E5, orange) with dynamically downscaled high-resolution (4 km) data (WRF, green) over North America, for the climatic variables daily mean 2 m air temperature and daily mean precipitation. The long-term means are shown separately for the four seasons: winter (DJF), spring (MAM), summer (JJA), and autumn (SON).


5 Discussion

This paper shows that the CHELSA downscaling procedure generally increases the accuracy of the modelled air temperatures, precipitation rates, and downwelling shortwave solar radiation. While correlations between simulated and observed variables in the coarse 0.5 resolution W5E5 data are already generally greater than 0.9, the downscaling increases this correlation further and decreases the bias and errors of the data in most cases. Notable exceptions are tasmin, where the increase in correlation comes with an increase in the bias of the downscaled data, and rsds, where the reduction in bias comes with a decrease in the correlation with observations, specifically for high values of rsds.

There are no significant temporal trends in these two performance indicators (Pearson's r, absolute bias) visible. For the correlations between observations at meteorological stations, both the coarse and the downscaled data show similar trends. This can be mainly attributed to the already good fit between the coarse data and the observations. Additionally, as the downscaling does not change the temporal pattern, a similar correlation over time is expected. The absolute bias however, shows deviations between the downscaled and the coarse-resolution data. While the bias for pr, tas, and rsds is generally lower in the downscaled data, tasmax shows a varying difference in bias over time and tasmin a generally higher bias in the downscaled data. This trend might be attributed to the condition that mean daily temperature lapse rates are applied for all air temperatures (tas, tasmax, and tasmin) equally, but under extreme conditions (tasmax, and tasmin), these lapse rates are not necessarily reflective of the observed conditions.

5.1 Air temperatures

The downscaling of the different air temperatures (tas, tasmax, tasmin) works best in topographically heterogeneous terrain, while its effect in flat terrain is much lower. This mainly comes from the relatively simple procedure applied that uses atmospheric temperature lapse rates, B-spline interpolations, and high-resolution orography alone to downscale air temperatures without any incorporation of, for example, radiation budgets or air movements. Downscaling additionally improves the representation of temperature extremes, with absolute bias reductions exceeding those for mean temperatures.

The temperature downscaling does not use a full physical scheme as usually used in dynamical downscaling routines. Although the inclusion of additional effects other than the atmospheric lapse rate correction in a downscaling procedure would give more physically realistic estimates of air temperatures, the differences from such increase in complexity at very high resolutions are minimal in this case, as shown by the comparison with the numerically downscaled WRF data over North America. Dynamical downscaling, however, comes at a large computational cost that makes it infeasible for global kilometre-scale application yet (Schär et al., 2019; Ban et al., 2021).

While overall, the performance of W5E5 and CHELSA-W5E5 is already high (r=0.9), the W5E5 data show a lower fit with observations from GHCN-D during the spring and summer period. There are also limitations of the downscaling using mean daily lapse rates, especially for minimum 2 m daily air temperatures. The evaluation shows that downscaling tasmin with a mean daily temperature lapse rate as applied here can actually also increase the bias. In North America, this seems to happen especially in the high plateaus of the Rocky Mountains (Fig. 3), where minimum temperatures are usually caused during conditions of nocturnal inversions (Whiteman, 1982), causing positive temperature lapse rates with elevation. In this case the use of a mean daily temperature lapse rate is not representative. Since the application of a different lapse rate for minimum daily 2 m air temperature and maximum daily 2 m air temperature could lead to higher minimum than maximum temperatures, this problem cannot be solved by running the CHELSA algorithm on a daily resolution but only by increasing the temporal resolution and deriving daily maximum and minimum daily 2 m air temperatures from hourly downscaled air temperatures.

5.2 Precipitation

Downscaling also increases the correlation of precipitation with observations, although not to such a large degree as in the case of air temperatures. The coarse W5E5 data already have a high (r>0.9) correlation with observations, which is globally not much improved by the downscaling. However, the global comparison might be misleading here as the downscaling mainly affects precipitation rates at a very local scale, where it has been shown to lead to large improvements (Karger et al., 2021). Topographic downscaling using the CHELSA v2.1 algorithm for precipitation rates has been shown to create long-term mean spatial patterns of precipitation rates that are extremely similar to those produced with dynamical downscaling using WRF over topographically complex terrain (Karger et al., 2021). A disadvantage of the presented precipitation downscaling is clearly that it cannot resolve convective precipitation, as only orographic effects are accounted for. While the mean bias in precipitation rates is generally decreased by the downscaling, the bias is larger during extreme precipitation events in topographically homogeneous terrain. These events are better captured by dynamical data using a dynamic model such as WRF at convection permitting resolutions.

5.3 Surface downwelling shortwave solar radiation

Surface downwelling shortwave solar radiation under clear-sky conditions is the only variable that is not directly downscaled but is fully mechanistically derived from terrain attributes. The algorithm for clear-sky solar radiation applied here captures terrain effects on solar radiation at very high spatial resolutions and has been shown to be effective in topographically complex terrain (Böhner and Antonic, 2009). Interpolations and direct downscaling are done on atmospheric cloud cover that is used to account for the amount of radiation which is absorbed and reflected by clouds. The high-resolution total cloud cover estimated by the algorithm has been shown to have monthly normals which correlate well with observations from GHCN-D (r=0.84; Brun et al., 2022), even though the algorithm does not include convective cloud formation at kilometre-scale resolutions. While the bias is substantially reduced in the mid-range of rsds values, extreme high solar radiation shows stronger deviations from observations. This might be due to the relatively simple correction applied for rsds using cloud cover or overestimates in the atmospheric scatter estimated with a bulk value of 80 %. While it is unclear which part of the downscaling is responsible for the deviation at high rsds values, it shows where future developments of the downscaling should focus on and where clear limitations are visible.

5.4 Implications for applications

While the topographic downscaling increases the accuracy of the data, it most likely violates certain physical relationships, due to both the simplicity of the downscaling algorithm and the fact that the five variables are downscaled independent from each other. These limitations are often encountered in univariate downscaling or bias correction procedures (Zscheischler et al., 2019) and should be kept in mind when applying the output data of the downscaling in further analysis. Additionally, extreme values of rsds should be used with care, and tasmin can show large deviations in areas with cold-air pooling.

The data provided are additionally cropped by a land–sea mask that has been designed to include all 30 arcsec grid cells that overlap with a land mask, plus a buffer to account for potential spatial inaccuracies. This practically excludes all ocean surface areas. However, the algorithms applied here are solely forced by topography, and if no topography is present, the downscaling is only done by a B-spline interpolation. Since this does not add information, we excluded all areas without topography to decrease the amount of data that needs to be stored.

6 Applications for impact modelling within ISIMIP

To test whether the improvements achieved by the downscaling, here shown as improved correlations and reduced biases compared to observed climate, also matter for impact modelling, the data will be further tested within the Inter-Sectoral Impact Model Intercomparison Project (ISIMIP). To this end, a range of impact models from different sectors (e.g. hydrological models, forest model or agricultural models) will be used to run at 1 km and 0.5 resolution (and essentially a range of resolutions in between produced using the same approach as presented here for 1 km) and compared to typical observational evaluation data for these impact models such as with ecosystem productivity data from eddy-covariance towers (Reyer et al., 2020) for forest models or discharge data for hydrological models (Huang et al., 2017; Liersch et al., 2020). Moreover, the CHELSA-W5E5 dataset will be employed to bias-adjust future climate projections in the upcoming ISIMIP phase 3 at high resolution to also allow for regional applications at high spatial resolution that are still consistent with the wider ISIMIP framework.

7 Data availability

The output of the CHELSA-W5E5 model is freely available under a CC0 1.0 Universal Public Domain Dedication (CC0 1.0) license at (Karger et al., 2022).

8 Code availability

Source codes of the CHELSA model used for the downscaling are available at (Karger et al., 2023a).

Source codes of the evaluation are available at (Karger et al., 2023b).

9 Conclusions

In conclusion, we show that the evaluation of the effectiveness of the CHELSA downscaling procedure applied to W5E5 improves the accuracy of modelled air temperatures, precipitation rates, and downwelling shortwave solar radiation. The downscaling generally increased the correlation between simulated and observed variables and decreased bias and errors in most cases. However, exceptions were noted in the case of tasmin and rsds. The downscaling of air temperatures was found to work best in topographically heterogeneous terrain, with improvements in the representation of temperature extremes. The downscaling of precipitation rates was found to lead to large improvements at a very local scale, but it could not resolve convective precipitation. Additionally, the downscaling of surface downwelling shortwave solar radiation was found to be also effective in topographically complex terrain. Despite these improvements, there are still limitations connected to the downscaling procedure, including the use of mean daily lapse rates to downscale tasmin, which can actually increase the bias in the data, and the inability of the downscaling to capture convective precipitation, that should be taken into account when applying the data in climate impact studies.

Author contributions

DNK, SL, and CPOR conceived the study with input from KF and NEZ. DNK developed and conducted the downscaling. OC developed the rsds algorithm. CH and DNK conducted the validation with input from SL and CPOR. DNK wrote the first version of the manuscript with input from all co-authors, and all authors contributed significantly to further revisions.

Competing interests

The contact author has declared that none of the authors has any competing interests.


Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.


This paper is based upon work undertaken as part of COST Action CA 19139 PROCLIAS, supported by the COST Association (European Cooperation in Science and Technology –, last access: 16 September 2021) and also benefitted from discussion within the Intersectoral Impact Model Intercomparison Project (ISIMIP). Dirk Nikolaus Karger and Niklaus E. Zimmermann acknowledge funding from the WSL internal grant exCHELSA, the 2019–2020 BiodivERsA joint call for research proposals, under the BiodivClim ERA-Net COFUND programme, with the funding organizations Swiss National Science Foundation SNF (project FeedBaCks, 193907) and the Swiss Data Science Center Project, SPEEDMIND and COMECO. Dirk Nikolaus Karger acknowledges funding from the ERA-Net BiodivERsA–Belmont Forum with the national funder the Swiss National Science Foundation (20BD21_184131), part of the 2018 Joint call BiodivERsA-Belmont Forum call (project “FutureWeb”), the WSL internal grant ClimEx, and the Swiss National Science Foundation (project ADOHRIS, 205530). Stefan Lange acknowledges funding from the German Research Foundation (DFG; project no. 427397136) and the German Federal Ministry of Education and Research (BMBF; grant ID 01LP1907A). We thank the GEBA data providers for making their data available. GEBA is co-funded by the Federal Office of Meteorology and Climatology MeteoSwiss within the framework of GCOS Switzerland. Funding from the EU Horizon 2020 Research and Innovation programme under grant agreement 821010 (CASCADES) supported the work of Christopher P. O. Reyer.

Financial support

This research has been supported by the European Cooperation in Science and Technology (grant no. CA 19139 PROCLIAS), the Biodiversa+ (grant nos. FeedBaCks and FutureWeb), the Schweizerischer Nationalfonds zur Förderung der Wissenschaftlichen Forschung (grant nos. 193907, 205530, and 20BD21_184131), the Swiss Federal Institute for Forest, Snow and Landscape Research (grant nos. ClimEx and exCHELSA), the Deutsche Forschungsgemeinschaft (grant no. 427397136), and the Horizon 2020 (grant no. 821010 (CASCADES)).

Review statement

This paper was edited by Guanyu Huang and reviewed by Minghu Ding and one anonymous referee.


Adler, R. F., Huffman, G. J., Chang, A., Ferraro, R., Xie, P. P., Janowiak, J., Rudolf, B., Schneider, U., Curtis, S., Bolvin, D., Gruber, A., Susskind, J., Arkin, P., and Nelkin, E.: The version-2 global precipitation climatology project (GPCP) monthly precipitation analysis (1979–present), J. Hydrometeorol., 4, 1147–1167, 2003. 

Balsamo, G., Beljaars, A., Scipal, K., Viterbo, P., van den Hurk, B., Hirschi, M., and Betts, A. K.: A Revised Hydrology for the ECMWF Model: Verification from Field Site to Terrestrial Water Storage and Impact in the Integrated Forecast System, J. Hydrometeorol., 10, 623–643,, 2009. 

Balsamo, G., Boussetta, S., Lopez, P., and Ferranti, L.: Evaluation of ERA-Interim and ERA-Interim-GPCP-rescaled precipitation over the U.S.A., ECMWF, Shinfield Park, Reading, 2010. 

Ban, N., Caillaud, C., Coppola, E., Pichelli, E., Sobolowski, S., Adinolfi, M., Ahrens, B., Alias, A., Anders, I., Bastin, S., Belušić, D., Berthou, S., Brisson, E., Cardoso, R. M., Chan, S. C., Christensen, O. B., Fernández, J., Fita, L., Frisius, T., Gašparac, G., Giorgi, F., Goergen, K., Haugen, J. E., Hodnebrog, Ø., Kartsios, S., Katragkou, E., Kendon, E. J., Keuler, K., Lavin-Gullon, A., Lenderink, G., Leutwyler, D., Lorenz, T., Maraun, D., Mercogliano, P., Milovac, J., Panitz, H.-J., Raffa, M., Remedio, A. R., Schär, C., Soares, P. M. M., Srnec, L., Steensen, B. M., Stocchi, P., Tölle, M. H., Truhetz, H., Vergara-Temprado, J., de Vries, H., Warrach-Sagi, K., Wulfmeyer, V., and Zander, M. J.: The first multi-model ensemble of regional climate simulations at kilometer-scale resolution, part I: evaluation of precipitation, Clim. Dynam., 57, 275–302,, 2021. 

Barton, M. G., Clusella-Trullas, S., and Terblanche, J. S.: Spatial scale, topography and thermoregulatory behaviour interact when modelling species' thermal niches, Ecography, 42, 376–389,, 2019. 

Böhner, J. and Antonic, O.: Land-Surface Parameters Specific to Topo-Climatology, in: Geomorphometry: Concepts, Software, Applications, edited by: Hengl, T. and Reuter, H. I., Elsevier Science, 195–226,, 2009. 

Brasseur, G. P. and Gallardo, L.: Climate services: Lessons learned and future prospects, Earth's Future, 4, 79–89,, 2016. 

Chang, J., Ciais, P., Wang, X., Piao, S., Asrar, G., Betts, R., Chevallier, F., Dury, M., François, L., Frieler, K., Ros, A. G. C., Henrot, A.-J., Hickler, T., Ito, A., Morfopoulos, C., Munhoven, G., Nishina, K., Ostberg, S., Pan, S., Peng, S., Rafique, R., Reyer, C., Rödenbeck, C., Schaphoff, S., Steinkamp, J., Tian, H., Viovy, N., Yang, J., Zeng, N., and Zhao, F.: Benchmarking carbon fluxes of the ISIMIP2a biome models, Environ. Res. Lett., 12, 045002,, 2017. 

Cucchi, M., Weedon, G. P., Amici, A., Bellouin, N., Lange, S., Müller Schmied, H., Hersbach, H., and Buontempo, C.: WFDE5: bias-adjusted ERA5 reanalysis data for impact studies, Earth Syst. Sci. Data, 12, 2097–2120,, 2020. 

Daly, C., Neilson, R. P., and Phillips, D. L.: A Statistical-Topographic Model for Mapping Climatological Precipitation over Mountainous Terrain, J. Appl. Meteor., 33, 140–158,<0140:ASTMFM>2.0.CO;2, 1994. 

Daly, C., Taylor, G. H., and Gibson, W. P.: The PRISM approach to mapping precipitation and temperature, Proc. 10th AMS Conf. on Applied Climatology, 20–23, 1997. 

Danielson, J. J. and Gesch, D. B.: Global multi-resolution terrain elevation data 2010 (GMTED2010), US Geological Survey, Open-File Report 2011–1073, 26 p., 2011. 

Dee, D. P., Uppala, S. M., Simmons, A. J., Berrisford, P., Poli, P., Kobayashi, S., Andrae, U., Balmaseda, M. A., Balsamo, G., Bauer, P., Bechtold, P., Beljaars, A. C., van de Berg, L., Bidlot, J., Bormann, N., Delsol, C., Dragani, R., Fuentes, M., Geer, A. J., Haimberger, L., Healy, S. B., Hersbach, H., Hólm, E. V., Isaksen, L., Kållberg, P., Köhler, M., Matricardi, M., Mcnally, A. P., Monge-Sanz, B. M., Morcrette, J. J., Park, B. K., Peubey, C., de Rosnay, P., Tavolato, C., Thépaut, J. N., and Vitart, F.: The ERA-Interim reanalysis: Configuration and performance of the data assimilation system, Q. J. Roy. Meteor. Soc., 137, 553–597,, 2011? 

Fiddes, J. and Gruber, S.: TopoSCALE v.1.0: downscaling gridded climate data in complex terrain, Geosci. Model Dev., 7, 387–405,, 2014. 

Gerlitz, L., Conrad, O., and Böhner, J.: Large-scale atmospheric forcing and topographic modification of precipitation rates over High Asia – a neural-network-based approach, Earth Syst. Dynam., 6, 61–81,, 2015. 

Giorgi, F., Jones, C., and Asrar, G. R.: Addressing climate information needs at the regional level: the CORDEX framework, WMO Bulletin, 58, 175–183, 2009. 

Harris, I., Osborn, T. J., Jones, P., and Lister, D.: Version 4 of the CRU TS monthly high-resolution gridded multivariate climate dataset, Sci. Data, 7, 1–18,, 2020. 

Hersbach, H., Bell, B., Berrisford, P., Hirahara, S., Horányi, A., Muñoz-Sabater, J., Nicolas, J., Peubey, C., Radu, R., Schepers, D., Simmons, A., Soci, C., Abdalla, S., Abellan, X., Balsamo, G., Bechtold, P., Biavati, G., Bidlot, J., Bonavita, M., Chiara, G. D., Dahlgren, P., Dee, D., Diamantakis, M., Dragani, R., Flemming, J., Forbes, R., Fuentes, M., Geer, A., Haimberger, L., Healy, S., Hogan, R. J., Hólm, E., Janisková, M., Keeley, S., Laloyaux, P., Lopez, P., Lupu, C., Radnoti, G., Rosnay, P. de, Rozum, I., Vamborg, F., Villaume, S., and Thépaut, J.-N.: The ERA5 global reanalysis, Q. J. Roy. Meteor. Soc., 146, 1999–2049,, 2020. 

Hewitt, C., Mason, S., and Walland, D.: The Global Framework for Climate Services, Nat. Clim. Change, 2, 831–832,, 2012. 

Huang, S., Kumar, R., Flörke, M., Yang, T., Hundecha, Y., Kraft, P., Gao, C., Gelfan, A., Liersch, S., Lobanova, A., Strauch, M., van Ogtrop, F., Reinhardt, J., Haberlandt, U., and Krysanova, V.: Evaluation of an ensemble of regional hydrological models in 12 large-scale river basins worldwide, Climatic Change, 141, 381–397,, 2017. 

Huber, V., Krummenauer, L., Peña-Ortiz, C., Lange, S., Gasparrini, A., Vicedo-Cabrera, A. M., Garcia-Herrera, R., and Frieler, K.: Temperature-related excess mortality in German cities at 2 C and higher degrees of global warming, Environ. Res., 186, 109447,, 2020. 

Huber, V., Ortiz, C. P., Puyol, D. G., Lange, S., and Sera, F.: Evidence of rapid adaptation integrated into projections of temperature-related excess mortality, Environ. Res. Lett., 17, 044075,, 2022. 

IPCC: Climate change 2022: impacts, adaptation and vulnerability, edited by: Pörtner, H.-O., Roberts, D. C., Tignor, M., Poloczanska, E. S., Mintenbeck, K., Alegría, A., Craig, M., Langsdorf, S., Löschke, S., Möller, V., Okem, A., and Rama, B., Cambridge University Press, Cambridge University Press, Cambridge, UK and New York, NY, USA, 3056 pp.,, 2022. 

Karger, D. N., Conrad, O., Böhner, J., Kawohl, T., Kreft, H., Soria-Auza, R. W., Zimmermann, N. E., Linder, H. P., and Kessler, M.: Climatologies at high resolution for the earth's land surface areas, Sci. Data, 4, 170122,, 2017. 

Karger, D. N., Schmatz, D. R., Dettling, G., and Zimmermann, N. E.: High-resolution monthly precipitation and temperature time series from 2006 to 2100, Sci. Data, 7, 248,, 2020. 

Karger, D. N., Wilson, A. M., Mahony, C., Zimmermann, N. E., and Jetz, W.: Global daily 1 km land surface precipitation based on cloud cover-informed downscaling, Sci. Data, 8, 307,, 2021. 

Karger, D. N., Lange, S., Hari, C., Reyer, C. P. O., and Zimmermann, N. E.: CHELSA-W5E5 v1.0: W5E5 v1.0 downscaled with CHELSA v2.0, ISIMIP Repository [data set],, 2022. 

Karger, D. N., Lange, S., Hari, C., Reyer, C. P. O., Conrad, O., Zimmermann, N. E., and Frieler, K.: CHELSA-W5E5: V1.0. In Earth System Science Data (V1.0), Zenodo [code],, 2023a. 

Karger, D. N., Lange, S., Hari, C., Reyer, C. P. O., Conrad, O., Zimmermann, N. E., and Frieler, K.: CHELSA-W5E5-validation: V1.0. In Earth System Science Data (V1.0), Zenodo [code],, 2023b 

Kasten, F. and Czeplak, G.: Solar and terrestrial radiation dependent on the amount and type of cloud, Sol. Energy, 24, 177–189,, 1980. 

Krysanova, V. and Hattermann, F. F.: Intercomparison of climate change impacts in 12 large river basins: overview of methods and summary of results, Climatic Change, 141, 363–379,, 2017. 

Lange, S.: WFDE5 over land merged with ERA5 over the ocean (W5E5) (1.0), GFZ Data Services,, 2019. 

Lanzante, J. R., Dixon, K. W., Nath, M. J., Whitlock, C. E., and Adams-Smith, D.: Some Pitfalls in Statistical Downscaling of Future Climate, B. Am. Meteorol. Soc., 99, 791–803,, 2018. 

Liersch, S., Drews, M., Pilz, T., Salack, S., Sietz, D., Aich, V., Larsen, M. A. D., Gädeke, A., s, K. H., Thiery, W., Huang, S., Lobanova, A., Koch, H., and Hattermann, F. F.: One simulation, different conclusions – the baseline period makes the difference!, Environ. Res. Lett., 15, 104014,, 2020. 

Linacre, E.: Climate Data and Resources: A reference and guide, Routledge, London, 384 pp.,, 1992. 

List, R. J.: Smithsonian meteorological tables, sixth revised edition., Smithsonian Institution Press, City of Washington, 521 pp., 1968. 

Lourenço, T. C., Swart, R., Goosen, H., and Street, R.: The rise of demand-driven climate services, Nat. Clim. Change, 6, 13–14,, 2016. 

Maraun, D. and Widmann, M.: Statistical Downscaling and Bias Correction for Climate Research, Cambridge University Press, Cambridge,, 2018. 

Maraun, D., Shepherd, T. G., Widmann, M., Zappa, G., Walton, D., Gutiérrez, J. M., Hagemann, S., Richter, I., Soares, P. M. M., Hall, A., and Mearns, L. O.: Towards process-informed bias correction of climate change simulations, Nat. Clim. Change, 7, 764–773,, 2017. 

Mengel, M., Treu, S., Lange, S., and Frieler, K.: ATTRICI v1.1 – counterfactual climate for impact attribution, Geosci. Model Dev., 14, 5269–5284,, 2021. 

Menne, M. J., Bryant, J. A., Korzeniewski, S. M., Kristy, T., Xungang, Y., Anthony, S., Ray, R., Vose, R. S., Gleason, B. E., and Houston, T. G.: Global Historical Climatology Network – Daily (GHCN-Daily), Version 3, NOAA National Climatic Data Center [data set],, 2018. 

Müller Schmied, H., Eisner, S., Franz, D., Wattenbach, M., Portmann, F. T., Flörke, M., and Döll, P.: Sensitivity of simulated global-scale freshwater fluxes and storages to input data, hydrological model structure, human water use and calibration, Hydrol. Earth Syst. Sci., 18, 3511–3538,, 2014. 

Muñoz-Sabater, J., Dutra, E., Agustí-Panareda, A., Albergel, C., Arduini, G., Balsamo, G., Boussetta, S., Choulga, M., Harrigan, S., Hersbach, H., Martens, B., Miralles, D. G., Piles, M., Rodríguez-Fernández, N. J., Zsoter, E., Buontempo, C., and Thépaut, J.-N.: ERA5-Land: a state-of-the-art global reanalysis dataset for land applications, Earth Syst. Sci. Data, 13, 4349–4383,, 2021. 

Rasmussen, R. and Liu, C.: High Resolution WRF Simulations of the Current and Future Climate of North America, Research Data Archive at the National Center for Atmospheric Research, Computational and Information Systems Laboratory [data set],, 2017. 

Reyer, C. P. O., Silveyra Gonzalez, R., Dolos, K., Hartig, F., Hauf, Y., Noack, M., Lasch-Born, P., Rötzer, T., Pretzsch, H., Meesenburg, H., Fleck, S., Wagner, M., Bolte, A., Sanders, T. G. M., Kolari, P., Mäkelä, A., Vesala, T., Mammarella, I., Pumpanen, J., Collalti, A., Trotta, C., Matteucci, G., D'Andrea, E., Foltýnová, L., Krejza, J., Ibrom, A., Pilegaard, K., Loustau, D., Bonnefond, J.-M., Berbigier, P., Picart, D., Lafont, S., Dietze, M., Cameron, D., Vieno, M., Tian, H., Palacios-Orueta, A., Cicuendez, V., Recuero, L., Wiese, K., Büchner, M., Lange, S., Volkholz, J., Kim, H., Horemans, J. A., Bohn, F., Steinkamp, J., Chikalanov, A., Weedon, G. P., Sheffield, J., Babst, F., Vega del Valle, I., Suckow, F., Martel, S., Mahnken, M., Gutsch, M., and Frieler, K.: The PROFOUND Database for evaluating vegetation models and simulating climate impacts on European forests, Earth Syst. Sci. Data, 12, 1295–1320,, 2020. 

Roe, G. H.: Orographic Precipitation, Annu. Rev. Earth Pl. Sc., 33, 645–671,, 2005. 

Ruane, A. C., Goldberg, R., and Chryssanthacopoulos, J.: Climate forcing datasets for agricultural modeling: Merged products for gap-filling and historical climate series estimation, Agr. Forest Meteorol., 200, 233–248,, 2015. 

Ruane, A. C., Phillips, M., Müller, C., Elliott, J., Jägermeyr, J., Arneth, A., Balkovic, J., Deryng, D., Folberth, C., Iizumi, T., Izaurralde, R. C., Khabarov, N., Lawrence, P., Liu, W., Olin, S., Pugh, T. A. M., Rosenzweig, C., Sakurai, G., Schmid, E., Sultan, B., Wang, X., de Wit, A., and Yang, H.: Strong regional influence of climatic forcing datasets on global crop model ensembles, Agr. Forest Meteorol., 300, 108313,, 2021. 

Sayre, R., Noble, S., Hamann, S., Smith, R., Wright, D., Breyer, S., Butler, K., Graafeiland, K. V., Frye, C., Karagulle, D., Hopkins, D., Stephens, D., Kelly, K., Basher, Z., Burton, D., Cress, J., Atkins, K., Sistine, D. P. V., Friesen, B., Allee, R., Allen, T., Aniello, P., Asaad, I., Costello, M. J., Goodin, K., Harris, P., Kavanaugh, M., Lillis, H., Manca, E., Muller-Karger, F., Nyberg, B., Parsons, R., Saarinen, J., Steiner, J., and Reed, A.: A new 30 meter resolution global shoreline vector and associated global islands database for the development of standardized ecological coastal units, J. Oper. Oceanogr., 12, S47–S56,, 2019. 

Scambos, T. A., Haran, T. M., Fahnestock, M. A., Painter, T. H., and Bohlander, J.: MODIS-based Mosaic of Antarctica (MOA) data sets: Continent-wide surface morphology and snow grain size, Remote Sens. Environ., 111, 242–257,, 2007. 

Schär, C., Fuhrer, O., Arteaga, A., Ban, N., Charpilloz, C., Di Girolamo, S., Hentgen, L., Hoefler, T., Lapillonne, X., Leutwyler, D., Osterried, K., Panosetti, D., Rüdisühli, S., Schlemmer, L., Schulthess, T., Sprenger, M., Ubbiali, S., and Wernli, H.: Kilometer-scale climate models: Prospects and challenges, B. Am. Meteorol. Soc., 101, E567–E587,, 2019. 

Schneider, U., Becker, A., Fingler, A., Meyer-Christoffer, A., and Ziese, M.: GPCC Full Data Monthly Product Version 2018 at 0.5: Monthly Land-Surface Precipitation from Rain-Gauges built on GTS-based and Historical Data, DWD [data set],, 2018. 

Shi, H., Tian, H., Lange, S., Yang, J., Pan, S., Fu, B., and Reyer, C. P. O.: Terrestrial biodiversity threatened by increasing global aridity velocity under high-level warming, P. Natl. Acad. Sci., 118, e2015552118,, 2021. 

Skamarock, C., Klemp, B., Dudhia, J., Gill, O., Liu, Z., Berner, J., Wang, W., Powers, G., Duda, G., Barker, D., and Huang, X.: A Description of the Advanced Research WRF Model Version 4, OpenSky,, 2019. 

Sørland, S. L., Brogli, R., Pothapakula, P. K., Russo, E., Van de Walle, J., Ahrens, B., Anders, I., Bucchignani, E., Davin, E. L., Demory, M.-E., Dosio, A., Feldmann, H., Früh, B., Geyer, B., Keuler, K., Lee, D., Li, D., van Lipzig, N. P. M., Min, S.-K., Panitz, H.-J., Rockel, B., Schär, C., Steger, C., and Thiery, W.: COSMO-CLM regional climate simulations in the Coordinated Regional Climate Downscaling Experiment (CORDEX) framework: a review, Geosci. Model Dev., 14, 5125–5154,, 2021. 

Weedon, G. P., Gomes, S., Viterbo, P., Shuttleworth, W. J., Blyth, E., Österle, H., Adam, J. C., Bellouin, N., Boucher, O., and Best, M.: Creation of the WATCH Forcing Data and Its Use to Assess Global and Regional Reference Crop Evaporation over Land during the Twentieth Century, J. Hydrometeorol., 12, 823–848,, 2011. 

Weedon, G. P., Balsamo, G., Bellouin, N., Gomes, S., Best, M. J., and Viterbo, P.: The WFDEI meteorological forcing data set: WATCH Forcing Data methodology applied to ERA-Interim reanalysis data, Water Resour. Res., 50, 7505–7514,, 2014. 

Wessel, P. and Smith, W. H. F.: A global, self-consistent, hierarchical, high-resolution shoreline database, J. Geophys. Res.-Sol. Ea., 101, 8741–8743,, 1996. 

Whiteman, C. D.: Breakup of Temperature Inversions in Deep Mountain Valleys: Part I. Observations, J. Appl. Meteorol. Climatol., 21, 270–289,<0270:BOTIID>2.0.CO;2, 1982. 

Wilby, R. L., Wigley, T. M. L., Conway, D., Jones, P. D., Hewitson, B. C., Main, J., and Wilks, D. S.: Statistical downscaling of general circulation model output: A comparison of methods, Water Resour. Res., 34, 2995–3008,, 1998. 

Wild, M., Ohmura, A., Schär, C., Müller, G., Folini, D., Schwarz, M., Hakuba, M. Z., and Sanchez-Lorenzo, A.: The Global Energy Balance Archive (GEBA) version 2017: a database for worldwide measured surface energy fluxes, Earth Syst. Sci. Data, 9, 601–613,, 2017. 

Wilson, J. P. and Gallant, J. C.: Terrain Analysis: Principles and Applications, 1st Edn., Wiley, New York, 479 pp., 2000. 

Zscheischler, J., Fischer, E. M., and Lange, S.: The effect of univariate bias adjustment on multivariate hazard estimates, Earth Syst. Dynam., 10, 31–43,, 2019. 

Short summary
We present the first 1 km, daily, global climate dataset for climate impact studies. We show that the high-resolution data have a decreased bias and higher correlation with measurements from meteorological stations than coarser data. The dataset will be of value for a wide range of climate change impact studies both at global and regional level that benefit from using a consistent global dataset.
Final-revised paper