the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
SHEDIS-Temperature: linking temperature-related disaster impacts to subnational data on meteorology and human exposure
Sara Lindersson
Gabriele Messori
International databases of disaster impacts are crucial for advancing disaster risk research, particularly as climate change intensifies the frequency and intensity of many natural hazards – including temperature extremes. However, many widely-used disaster impact databases lack information on the physical dimension of the hazards associated with an impact, and on the exposure to such hazards. This hinders analysing drivers of severe disaster outcomes. To bridge this knowledge gap, we present SHEDIS-Temperature, a dataset that provides Subnational Hazard and Exposure information for temperature-related DISaster impact records (https://doi.org/10.7910/DVN/WNOTTC; Lindersson and Messori, 2025). This open-access dataset links temperature-related impact records from the Emergency Events Database (EM-DAT) with subnational data on their locations, associated meteorological time series, and population maps. SHEDIS-Temperature provides hazard and exposure data for 2835 subnational locations associated with 382 disaster records from 1979–2018 in 71 countries. Detailed hazard metrics, derived from 0.1° 3 hourly data, encompass absolute indicators, such as the heat stress measure apparent temperature accounting for humidity and wind speed, as well as percentile-based indicators of when and where temperatures exceeded local thresholds. Population exposure data include annual population figures for impacted subnational administrative units and person-days of exposure to threshold-exceeding temperatures. Outputs are available at grid-point level as well as zonally aggregated to administrative subdivision units, and disaster-record levels. Technical validation against a station-based dataset indicated minor systematic biases – slightly overestimated minimum and underestimated maximum temperatures – but confirmed high consistency between datasets, with correlation coefficients ≥0.9 and mean absolute errors ≤2 °C. By providing comprehensive attributes across the hazard-exposure spectrum, SHEDIS-Temperature supports interdisciplinary research on past temperature-related disasters, offering valuable insights for future risk mitigation and resilience strategies.
- Article
(8766 KB) - Full-text XML
- BibTeX
- EndNote
International databases documenting impacts from natural hazards play a central role in advancing quantitative research on disaster risk (Jones et al., 2022; UNDRR, 2022). These collections enable researchers, organisations and agencies to track how disaster impacts vary across regions and over time, including to monitor progress of disaster risk reduction globally (Aitsi-Selmi et al., 2015). Combined with additional risk-relevant information – such as estimates of the physical hazard, exposed population, and socioeconomic indicators – impact records can pinpoint factors contributing to particularly severe outcomes (Kahn, 2005; Lindersson et al., 2023; Mochizuki et al., 2014; Tselios and Tompkins, 2019; Vestby et al., 2024). Lessons learned from past events are, furthermore, increasingly important for guiding future risk mitigation and resilience efforts as climate change drives shifts in the frequency and intensity of many natural hazards (McBean and McCarthy, 1990). Hot and cold extremes are primary examples of fatal hazards under rapid change (Gallo et al., 2024; García-León et al., 2024; Gasparrini et al., 2015; IPCC, 2023; Lüthi et al., 2023; Russo et al., 2019).
The Emergency Events Database (EM-DAT; CRED and UCLouvain, 2023; Delforge et al., 2025), maintained by the Centre for Research on the Epidemiology of Disasters, is a leading open-access resource for international disaster impact data (Jones et al., 2022; Panwar and Sen, 2020). Widely used for its extensive set of national-level records of human and economic losses from major disasters, EM-DAT remains a cornerstone of empirical disaster research despite certain limitations, such as underreporting and a bias toward advanced economies (Acevedo Mejia, 2016; Green et al., 2019; Jones et al., 2022, 2023; Wirtz et al., 2014). Beyond issues with missing data, EM-DAT lacks spatiotemporal detail for impacts and the associated hazards. The physical magnitude of temperature extremes, for instance, is often missing or reduced to a single maximum or minimum air temperature, without specifics on the timing, duration or location. Furthermore, multiple meteorological factors beyond the (dry bulb) air temperature, including humidity and wind, substantially influence stress levels experienced by the human body during extreme temperatures (Cvijanovic et al., 2023). These limitations place the responsibility on users to link impact records to additional data sources when a more comprehensive risk analysis is needed. However, recent advancements in high-resolution data on disaster locations, meteorological data and population patterns present new opportunities for systematic data integration across the risk spectrum.
This article introduces SHEDIS-Temperature, an open-access dataset that provides Subnational Hazard and Exposure information for temperature-related DISaster impact records (Fig. 1). To achieve this, we integrated the open-source Geocoded Disasters extension (GDIS; Rosvold and Buhaug, 2021), which geocodes many EM-DAT records to subnational locations, with high-resolution global time-series of meteorological variables from Multi-Source Weather (MSWX; Beck et al., 2022) and population data from the Global Human Settlement Population grids (GHS-POP; European Commission, 2023; Carioli et al., 2023). SHEDIS-Temperature (Lindersson and Messori, 2025) provides hazard and exposure data for 2835 subnational locations (referred to hereafter as subdivisions) associated with 382 disaster records from 1979–2018 in 71 countries (Fig. 1).
Figure 1Introduction to SHEDIS-Temperature. (a) The 382 national-level disaster records from 1979–2018 in 71 countries that underpin the dataset. These records comprise EM-DAT records (b) that have been geocoded to administrative subdivisions by GDIS (c). SHEDIS-Temperature expands on this by providing an extensive catalogue of hazard- and exposure-related attributes for each subdivision and disno (d), as well as data on grid point level.
SHEDIS-Temperature advances a growing field of research that links disaster impact data to in-situ and satellite-derived information (Brimicombe et al., 2021; Dellmuth et al., 2021; Felbermayr and Gröschl, 2014; Kageyama and Sawada, 2022; Mester et al., 2023). Our dataset offers three primary contributions. First, it includes detailed information on the physical hazards, including both absolute and percentile-based indicators. The absolute indicators, like maximum 2 m air temperature and apparent temperature are provided as daily statistics derived from 3 hourly data. The percentile-based threshold analysis identifies if, when, where, and by how much daily temperatures exceeded local 90th and 95th percentiles, enabling more context-sensitive assessments of extreme events. Second, SHEDIS-Temperature provides data on population exposure to these extreme temperatures, detailing annual population figures for each impacted subdivision and exposure to threshold-exceeding temperatures, expressed as person-days. Third, to support diverse research needs, we present outputs at three levels: grid point, subdivision and EM-DAT record (referred to hereafter as disno, short for disaster number). Open-access source scripts enable users to further adjust the outputs if needed.
The usefulness of SHEDIS-Temperature is multifaceted. It can serve as a corroboration of EM-DAT and GDIS by cross-verifying reported impact locations against observed extreme weather events. We also anticipate that SHEDIS-Temperature can support empirical analysis of temperature-related disasters across disciplines. We consider the granularity and flexibility of the dataset to be crucial, especially since disasters often have uneven impacts – not only across countries but within them as well (Masselot et al., 2023; Yin et al., 2023). Our work is also aligned with UNDRR's call for more integrated tracking systems that capture both the origins of hazards and their impacts (UNDRR, 2022). Ultimately, systematically connecting data on hazards, exposure and impacts is essential for quantifying the social vulnerability to disasters.
SHEDIS-Temperature links temperature-related impact records to subnational data on their physical occurrence and human exposure. The dataset was constructed through three main steps: (1) sampling and geocoding, (2) data processing at the grid point level, and (3) aggregating of outputs into the final dataset (Fig. 2). All analyses were performed in R v.4.3.3 and the WGS84 coordinate system. Area-corrected calculations were applied to derive grid cell areas and polygon areas with the R packages “terra” (Hijmans, 2025) and “sf” (Pebesma, 2018), respectively. Meteorological data in NetCDF were processed with Climate Data Operators (CDO; Schulzweida, 2023).
Figure 2Flowchart illustrating the main steps of integrating data from multiple sources to derive SHEDIS-Temperature. (1) A total of 382 temperature-related impact records from EM-DAT were successfully matched to subnational locations by GDIS, which we used to identify 2835 subdivision boundaries at level 1 (province/equivalent) and level 2 (county/district/equivalent) from GADM. (2) Within these identified geographical extents, daily statistics of meteorological variables (absolute values and percentiles) were computed from MSWX. Annual population figures were also interpolated from GHS-POP, and percentile-exceeding temperature events were identified. (3) Outputs were exported at three levels: grid point, subdivision and disno.
2.1 Sampling and geocoding
SHEDIS-Temperature extends the international disaster database EM-DAT (CRED and UCLouvain, 2023), which documents national-level disaster impacts meeting at least one of the following criteria: ≥10 fatalities, ≥100 affected individuals, a declared state of emergency, and/or a request for international assistance. SHEDIS-Temperature includes records that EM-DAT classifies as heat waves and cold waves. EM-DAT defines a heat wave as a period of abnormally hot and/or unusually humid weather, while a cold wave is a period of abnormally cold weather that may be exacerbated by high winds (CRED and UCLouvain, 2024b). Both types of events are also described as typically lasting two or more days, and specific temperature thresholds vary by region (CRED and UCLouvain, 2024).
Our dataset incorporates records from 1979–2018, aligning with the temporal scope of supporting datasets – Multi-Source Weather (MSWX; Beck et al., 2022) and the Global Human Settlement Population grid (GHS-POP; European Commission, 2023; Carioli et al., 2023), which begin in 1979 and 1975 respectively, and the Geocoded Disasters (GDIS) dataset (Rosvold and Buhaug, 2021), reaching up to 2018. Limiting the dataset to four recent decades also enhances data reliability, since impact records from earlier periods are generally more uncertain and biased (CRED and UCLouvain, 2024; Gall et al., 2009). The final sample of SHEDIS-Temperature includes impact records that meet this timespan and have been geocoded to administrative subdivisions by GDIS.
The creators of GDIS geocoded EM-DAT entries (disnos) to subnational levels by matching their location description in EM-DAT to administrative subdivision names provided by the Global database of Administrative boundaries (GADM), version 3.6 (https://www.gadm.org, last access: 5 October 2023). GADM provides names and corresponding polygons of administrative subdivisions across multiple hierarchical levels, including level-1 (province/equivalent), level-2 (county/district/equivalent), and level-3 (municipality/equivalent). The creators of GDIS thus linked each disno in EM-DAT to one or more of these subdivisions in GADM, across one or more hierarchical levels – when the location description in EM-DAT was sufficient to do so.
For each disno, GDIS provides the original location description from EM-DAT along with the name, level and centroid coordinate pair for one or more matched subdivisions from GADM. However, we identified several mismatches where a disno has been linked to a subdivision in the wrong country due to shared subdivision names. To address this, we derived country-specific ISO codes directly from the GDIS-provided coordinates and used them to reconstruct the disno number. If the assigned location falls outside the expected country, the disno number does not match with the list from EM-DAT and is excluded from the final sample.
To retrieve the boundary polygons instead of the centroid coordinates, we first converted GADM polygons to centroids and then applied a nearest-neighbour approach for each administrative level separately. We then controlled for discrepancies between the original location description in EM-DAT and the matched subdivision name. We identified one mismatch where the centroid of a subdivision had been misplaced by GDIS, which we corrected manually1. After having run these consistency checks, we replaced each centroid coordinate pair (from GDIS) with its original boundary polygon (from GADM).
For the purpose of SHEDIS-Temperature, we chose to replace the level-3 impacted subdivisions (n=148, associated with 45 unique disnos) with their parent level-2 divisions – due to the relatively coarse resolution of the global supporting datasets and the wide spatial extent of temperature extremes. Moreover, GADM provides level-3 subdivisions for only certain countries, making level-1 and level-2 divisions more suitable for consistent cross-country comparisons. Duplicates, due to multiple level-3 units having been replaced with the same level-2 unit for the same disno, were removed.
To reduce file size, we simplified the polygon shapes with the R package “rmapshaper” (Teucher et al., 2024), which performs topologically aware polygon simplifications. To clarify, aside from this minor shape simplification, the subdivision geometries provided by SHEDIS-Temperature are identical to those provided by the GADM v3.6 dataset.
2.1.1 Sample of disaster records and their subnational locations
The final dataset comprises 2835 impacted subdivisions, including 2353 level-1 administrative units (province/equivalent) and 482 level-2 units (county/district/equivalent), linked to 382 distinct disaster records (disnos) across 71 countries (Fig. 1). Of these, 63 % of the subdivisions are linked to 243 cold wave records in 60 countries, while the remaining subdivisions are linked to 139 heat wave disnos in 47 countries. The majority (83 %) of impacted subdivisions are level-1 administrative units. Since several subdivisions experienced multiple events during the study period, the dataset includes 931 unique level-1 and 343 unique level-2 subdivisions (Fig. A1).
The dataset spans 1979–2018, with most disnos recorded after 2000 (Fig. 3a). A notable spike in cold wave records appears in 2012, when cold waves in Europe led to recorded disasters in 26 countries, ten of which recorded events both at the beginning and end of the year. The European heat waves of 2003 and 2007 are also evident (Fig. 3a), resulting in 15 and 11 disaster impact records, respectively. Each disno in the sample includes a reported start month from EM-DAT, collectively illustrating the seasonal variability of these hot and cold extremes across continents (Fig. 3b).
Figure 3Overview of the disaster records underpinning SHEDIS-Temperature. (a) Histogram of the temporal distribution of disaster impact records from 1979–2018, highlighting the higher number of records in the latter part of the recording period. (b) Density plots illustrating the seasonal variability of reported start months for heat wave and cold wave records across continents. The uneven number of observations per continent also highlight the geographic bias of the parent dataset, EM-DAT. (c) Boxplots showing the number of subdivisions linked to each disno, per continent and reporting period. Note the cut in the y axis for visualization purposes.
The geographic distribution of the final sample reflects a bias in the parent dataset EM-DAT, with most records originating from Europe, Asia and the Americas (Figs. 1a and 3b). This bias arises from two factors: a reporting tendency towards advanced economies and the high density of small countries in continental Europe, which leads to multiple national-level records per meteorological event.
The median number of subdivisions impacted by each disno is four for cold waves and three for heat waves, though this also varies across continents and recording periods (Fig. 3c), from six in Europe to four in the Americas, three in Asia and Africa and 1.5 in Oceania. More recent records tend to be linked to a greater number of subdivisions compared to older ones, likely reflecting increased detail in disaster reporting over time. Figure 3c also displays an outlier in the sample with 78 linked subdivisions – a heat wave disno from Turkey (disno 2000-0381-TUR) for which EM-DAT offers an unusually long list of impacted locations.
2.2 Data processing at grid point level
2.2.1 Spatiotemporal boundaries for analysis
The simplified polygons outlining the impacted subdivisions define the spatial boundaries for the subsequent analysis, which we also refer to as the geometry for analysis. The analysis period for each disno is defined by the first date of the start month and last date of the end month as reported by EM-DAT, but expanded with one week at both ends for precaution. A majority (n=175) of the disnos have reported start months that coincide with the reported end month, resulting in a roughly six weeks analysis period. One impact record (disno 2007-0673-ROU) missed a reported end month, which we then assumed to be the month following the reported start month.
We chose not to rely on the daily information from EM-DAT because only approximately half of the disnos provide start and end days. Additionally, about a quarter of the records with daily information have start- and end-days that coincide, which contradicts the very disaster definition stating that heat waves and cold waves typically last for two days or more (CRED and UCLouvain, 2024).
2.2.2 Meteorological data processing
Multi-Source Weather (MSWX; Beck et al., 2022) is a high-resolution meteorological dataset derived from hourly ERA5 reanalysis data (Hersbach et al., 2020). MSWX bias-corrects and downscales the ERA5 data using nearest-neighbour interpolation to a spatial resolution of 0.1°. It provides seamless global NetCDF files at 3 hourly intervals beginning 1 January 1979. The 3 hourly MSWX values represent averages of the 1 hourly ERA5 data (Beck et al., 2022). ERA5 is widely regarded as the most reliable reanalysis dataset available. For instance, Liu et al. (2024) recently demonstrated its consistent quality for 2 m air temperature across most regions across the globe. For this study, we used MSWX-Past data on 2 m air temperature (°C), 2 m relative humidity (%) and 10 m wind speed (m s−1).
ERA5-Land (Muñoz Sabater, 2019) is another widely used 0.1° reanalysis product based on ERA5 and could also have been used in this study. MSWX was selected over ERA5-Land primarily for practical reasons. The 3 hourly temporal resolution of MSWX, compared to the 1 hourly structure of ERA5-Land, is less computationally demanding. MSWX also provides ready-to-use variables such as 10 m wind speed and 2 m relative humidity, whereas ERA5-Land only provides wind components and variables from which relative humidity must be derived, requiring additional processing steps. Unlike ERA5-Land, which is a physically consistent reanalysis without explicit bias correction (Muñoz Sabater, 2019), MSWX applies a statistical bias correction using multiple observational datasets (Beck et al., 2022). For a subset of events, we compared maximum and minimum temperature estimates from MSWX and ERA5-Land against EM-DAT records (as described for MSWX in Sect. 3.3.3). Both datasets showed broadly similar levels of agreement with EM-DAT, with the main difference being that ERA5-Land aligned less well with minimum temperatures during cold waves. Considering these factors, MSWX was chosen as the primary dataset for this study.
Using 3 hourly values of air temperature, wind speed, and relative humidity from MSWX, we calculated apparent temperature for each grid point within the impacted subdivisions and their respective analysis period. Apparent temperature quantifies the amplification of perceived temperatures due to wind and humidity, and can thus be used as a metric for thermal stress in humans (Steadman, 1984). The model assumes that the temperature is experienced outdoors but not in direct sunlight (Buzan et al., 2015). Although radiation is sometimes included in these calculations, we used the non-radiant version for our analysis, following the methodology by Steadman (1994). These calculations were performed using the “apparentTemp” function in the R package “HeatStress” (Casanueva, 2019), which calculates apparent temperature (at, °C) using air temperature (ta, °C), relative humidity (RH, %) and wind speed (u, m s−1), following Eqs. (1) and (2).
where e is the water vapour pressure (hPa), derived from air temperature and relative humidity as
with .
The constants a, b, and c differ depending on whether the air temperature is above or below the freezing point:
For each disno and subdivision in the sample, we then compiled daily time series for all grid points within the boundary polygon and the analysis period. The following variables were included: daily mean air temperature, daily maximum air temperature (TX), daily minimum air temperature (TN), daily mean apparent temperature, daily maximum apparent temperature (ATX), and daily minimum apparent temperature (ATN). The variables TX, TN, ATX, and ATN thus represent the most extreme 3 hourly values recorded within each 24 h period.
2.2.3 Population data processing
For estimating human exposure to extreme temperatures, we used global population maps from the Global Human Settlement Layer R2023 A (GHS-POP; European Commission, 2023; Carioli et al., 2023), which combines satellite imagery and census data to generate 5 year time series from 1975–2020. The population maps, initially at a spatial resolution of 30 arcsec, were resampled to align with the MSWX 0.1° data grid using the R package “exactextractr” (Baston, 2024).
During resampling, we extracted population estimates for each 5 year time step for all grid points within the boundaries of each subdivision. These population values were simultaneously scaled with the coverage fraction, which represents the proportion of each grid point being located within the subdivision boundaries. To address minor discrepancies in population values introduced during resampling, we also scaled the resampled population cell values to ensure that the population sum across each subdivision polygon matched the original, non-resampled population sum. Finally, we generated annual population estimates for each grid point by linearly interpolating the 5 year population estimates.
2.2.4 Percentiles of air temperature
We used copies of the TX and TN times series to derive percentiles for each day of the year, with respect to the 30 year reference period 1981–2010. This reference period was selected because it best corresponds to our analysis period (1979–2018), ensuring consistency with the temporal coverage of the study. We calculated percentiles centred on a 31 d moving window (following e.g. Russo et al., 2014, 2015, 2017; Vogel et al., 2019), and thus extended the reference period to also include the last 15 d of 1980 and the first 15 d of 2011 prior to the percentile calculations. 29 February was assigned the percentile value of 28 February in leap years.
Before calculating percentiles, we linearly detrended these reference period-long copies of the TX and TN time series, to remove the potential influence of long-term trends. This is in line with the notion that a climatological period should ideally be uniform (WMO Climatological Normals, 2024). We did this by using the CDO function “detrend” (Schulzweida, 2023), which removes the long-term linear trend along the specified time dimension. This function fits a least-squares linear regression model to each grid cell and subtracts the fitted trend component from the original time series, thereby isolating short-term variability from long-term changes, following Eq. (3).
Where x(t) is the original time series, and are the estimated intercept and slope of the linear trend, respectively, and x′(t) is the detrended series. To preserve the baseline characteristics of the temperature fields, the temporal mean of the original series, , was added back to the detrended values, yielding the final series as written in Eq. (4).
This ensured that the resulting data retained their original climatological mean while excluding long-term linear trends. The outputs from Eq. (4) were then used to calculate the percentiles. Please note that the detrended time series were used only to derive percentiles, while the rest of our analysis was conducted on the original non-detrended time series to maintain consistency with the rest of the analysis (which also relied on the non-detrended meteorological data).
For heat waves, we derived the 90th and 95th percentiles of TX. For cold waves, we calculated the 10th and 5th percentiles of TN.
2.2.5 Event detection analysis
We identified heat wave and cold wave events using percentile-based threshold analysis at grid point-level (Fig. 4). Heat waves were detected when TX exceeded the 90th or 95th percentile thresholds (referred to hereafter as pct90 and pct95 events), while cold waves were identified when TN fell below the 10th or 5th percentiles (referred to hereafter as pct10 and pct05 events). The identification of percentile exceedances was performed on the original (non-detrended) time series. Moreover, the percentiles are relative to the selected reference period. Consequently, the final number of identified percentile exceedances might deviate from the specified percentile numbers.
Figure 4Percentile-based methodology for detecting heat waves at grid point level. Red numbers indicate the exceedance of the daily maximum air temperature (TX) above the smoothed percentile-based threshold, which are summed to determine the event magnitude. A minimum duration of three days is required for classification as a heat wave. If a non-qualifying day falls between threshold-surpassing days, it is reclassified as also being threshold-surpassing. The diagram represents a synthetic time series, and the y axis does not start at zero. For detecting cold waves, we use daily minimum air temperature (TN) instead.
We consider the start of an event to be the first day the temperature crossed the threshold, and its end to be the first day it no longer did. If a non-qualifying day was directly preceded and followed by threshold-surpassing days, it is treated as also being threshold-surpassing, as exemplified in Fig. 4. A minimum duration of three consecutive days was required for a sequence to be classified as a heat wave or cold wave, consistent with common definitions in the climate literature (e.g. Meehl and Tebaldi, 2004; Perkins and Alexander, 2013; Perkins-Kirkpatrick and Lewis, 2020).
We define the event duration as the number of days between its start and end. Event magnitude was calculated as the sum of temperature exceedances relative to the threshold over the event duration, following e.g. Brown (2020). Human exposure, expressed in person-days, was quantified by multiplying the population count at each grid point by the event duration. For example, if a grid point with a population count of 1000 people experienced a seven-day event (as illustrated in Fig. 4), the total event exposure would be 7000 person-days.
Event-specific metrics were stored in CSV files: with one file per disno and one row per event at the grid point level.
Figure 5Illustration of the three output levels of SHEDIS-Temperature, using the maximum duration of a 95th percentile-exceeding event in France as an example. (a) At the disno level, the spatial extent (i.e. geometry for analysis) encompasses the combined boundaries of all identified impacted subdivisions, with the maximum duration representing the highest value among the three subdivisions. (b) At the subdivision level, the spatial extent corresponds to an individual impacted subdivision, where the maximum duration reflects the longest duration at any grid point within that specific polygon. (c) At the grid-point level, events are detected and recorded at the resolution of individual pixels.
2.3 Output aggregation
The hazard- and exposure-related attributes of SHEDIS-Temperature were aggregated into output files at two spatial levels: disno level and subdivision level. At the disno level, the spatial extent is defined by the combined boundaries of all identified impacted subdivisions within a given disno (Fig. 5a). At the subdivision level, the spatial extent corresponds to the individual polygon of each subdivision (Fig. 5b). Both levels provide the same set of attributes, including:
-
Metadata attributes
-
Temperature attributes averaged over the analysis period
-
Extreme daily temperature attributes
-
Extreme 3 hourly temperature attributes
-
Hazard and exposure attributes from the percentile-based event detection analysis
Additionally, results from the percentile-based threshold analysis at the grid point level are stored in separate files for each disno, with one row per percentile-exceeding event (Fig. 5c). Where applicable, the date and location of specific attributes are also saved (e.g., the coordinate pair and date of the warmest 3 hourly air temperature recorded at the grid point level). The full set of attributes provided by SHEDIS-Temperature are provided under Sect. 4.
3.1 Global analysis of human exposure to temperature extremes
In building SHEDIS-Temperature, we have successfully quantified a wide range of hazard- and exposure-related attributes from 2835 administrative subdivisions associated with 382 records of temperature-related disasters. Distinct patterns emerge across continents regarding both extreme temperatures and the populations bearing the brunt of these extremes. For instance, North America, Europe and northern Asia stand out for having experienced very cold extremes in highly populated areas (Fig. 6a). In terms of human exposure to hot extremes, India, Pakistan and Bangladesh are notably affected by the combination of very high temperatures and large population numbers (Fig. 6c).
Figure 6Geographical distribution of human exposure to extreme temperatures. (a) The lowest apparent temperatures recorded within the analysis period for each cold wave disno in the dataset. Each dot represents a subdivision, with color indicating temperature and size representing the total population of the subdivision the year of the record. (b) Scatter plot showing the relationship between population and temperature. Each dot represents a subdivision, with color indicating the continent. (c, d) are analogous to (a, b) but depict the highest apparent temperatures in heat wave-impacted subdivisions instead. The x axes in (b, d) are logarithmic for visualization purposes.
The data gaps in Africa, the Middle East and Southeast Asia are particularly striking and highlights a broader challenge of international disaster databases. Despite these gaps, our results reveal a global pattern in which warmer administrative subdivisions also tend to be more populated (Fig. 6). This trend is particularly pronounced in subdivisions that have experienced heat wave disasters (Fig. 6d), emphasizing a critical challenge in the face of climate change. These findings also underscore the importance of integrating data across multiple risk dimensions to better identify and understand risk hotspots globally.
Turning now to the results of the event detection analysis using percentile-based thresholds. For a vast majority of the disnos in SHEDIS-Temperature, we could detect percentile exceeding events within the respective subdivisions and analysis periods. All 139 heat wave disnos in the dataset record at least one pct90 event at grid point level within the defined spatiotemporal boundaries, while 133 disnos also experience pct95 events. For over 70 % of the heat wave disnos, pct95 events cover more than half of the analysed area. Similarly, among the 243 cold wave disnos, 233 show pct10 events, and 214 also record pct05 events. For nearly 50 % of the cold wave disnos, pct05 events cover more than half of the analysed area. Taken together, these results support the reliability of EM-DAT reports and their recorded locations.
In terms of human exposure to these percentile-exceeding events, India once again emerges as particularly affected. All eleven disnos in our dataset with the highest number of person-days exposed to pct95 events are recorded in India. Thereafter follows two heat wave records from the United States (1998 and 2011) and the 2003 heat wave in Germany. Despite not being prone to the lowest absolute temperatures, India also ranks prominently for person-days exposed to pct05 cold waves, accounting for five of the top ten disnos in our sample. Other highly ranked cold wave disnos include events in China (2011), Germany, Bangladesh, France, and Poland (all in 2012).
3.2 A case study from the fatal European heat waves in 2003
As previously noted, the year 2003 stands out as one of the years with the highest number of heat wave disnos in the dataset, driven by widespread and severe European heat waves. Four of the five disnos in our sample with the highest reported fatalities in EM-DAT correspond to this event (all disnos beginning with 2003-0391): Italy (20 089 deaths), France (19 490 deaths), Spain (15 090 deaths), and Germany (9355 deaths)2. Among these, the French disno is the most severe fatal impact record for which EM-DAT also includes information on its physical magnitude (maximum temperature of 43 °C) as well as start and end dates (August 1–20). We now examine how this information aligns with the attributes provided by SHEDIS-Temperature (Fig. 7).
Figure 7Selected SHEDIS-Temperature attributes for the 2003 heat wave in France (disno 2003-0391-FRA). Panels depict the maximum 3 hourly air temperature (a, b), duration of pct95 events (c, d) and human exposure expressed in person-days (e, f). The left panels show grid point-level data, while the right panels present outputs at subdivision-level. Note that palette ranges vary across panels, and logarithmic scales are applied for person-days.
The highest 3 hourly air temperature recorded at the grid point level within the impacted subdivisions, based on MSWX data, is 41 °C on 12 August. This is slightly lower than the magnitude reported by EM-DAT, presumably due to the inherent limitations of gridded datasets, which may miss localized extreme values within individual grid cells. The highest temperatures were recorded in inland France, particularly in Haute-Vienne (Fig. 7a and b). However, there is considerable spatial variation within subdivisions, especially in the mountainous areas of the French Alps and the Pyrenees.
Almost all grid points within the analysed subdivisions recorded pct95 events during the analysis period, with a median duration of 11 d (Fig. 7c and d). The longest duration, 31 d, was recorded on the island of Corsica, while the shortest duration, lasting only a few days, occurred in coastal Brittany. Across the entire geometry, pct95 events occurred from 25 July–31 August. Regarding human exposure, the total population in these subdivisions was 43 million in 2003, according to GHS-POP data. The total number of person-days exposed to pct95 events amounted to 508 million, with the largest numbers recorded in the region of Île-de-France, which includes Paris (Fig. 7e and f).
3.3 Technical validation
3.3.1 Sample coverage and geocoding
The dataset includes approximately 80 % (382 out of 468) of the heat wave and cold wave disnos recorded in EM-DAT between 1979 and 2018, covering 243 of 293 cold waves and 139 of 175 heat waves. This coverage is limited to disnos that have been geocoded in GDIS, which primarily depends on the level of detail in the original location descriptions in EM-DAT (Rosvold and Buhaug, 2021). Additionally, a small number of subdivisions have been excluded due to geocoding errors in GDIS, and one misplaced GDIS centroid has been manually corrected (see Sect. 2.1).
Our approach verifies the consistency of GDIS with EM-DAT and eliminates dependence on GDIS-provided ISO codes, which are not completely consistent – with certain countries having been assigned multiple ISO codes in GDIS. While these steps result in partial coverage of EM-DAT disnos and may lead to occasional omissions of subdivisions, they enhance the overall reliability of our dataset by ensuring a high degree of confidence in the included cases.
3.3.2 Evaluation of temperature extremes in MSWX using E-OBS
We use meteorological data from MSWX-Past, a bias-corrected and downscaled version of ERA5. Previous studies show that ERA5 shows reduced performance in areas with sparse in situ observations, as it integrates remotely sensed and ground-based measurements (Hersbach et al., 2020; Liu et al., 2024), and MSWX likely shares this shortcoming. Accuracy is also often reduced in regions with complex terrain and high altitudes. For example, ERA5 tends to underestimate wind speeds in mountainous regions (Beck et al., 2022) which may affect the modelled apparent temperature attributes in SHEDIS-Temperature. Additionally, errors in 2 m air temperature tend to be larger in regions with complex topography and high elevations, as well as deserts and tropical rainforests (Beck et al., 2022; Liu et al., 2024). This inevitably influences the quality of MSWX. The validation study by Beck et al. (2022) showed that MSWX performs comparably to ERA5 over flat terrain and outperforms ERA5 in high-relief areas. This highlights the advantage of using a high-resolution resampled variant of ERA5 to capture local variations in temperature extremes, as exemplified in Fig. 7.
To assess the reliability of the hazard attributes in SHEDIS-Temperature, we systematically evaluated our MSWX-derived extreme temperatures with those of the Europe-wide dataset E-OBS (Cornes et al., 2018; Copernicus Climate Change Service, 2025). The E-OBS dataset is constructed from quality-controlled station data and interpolated over a regular 0.1° grid (Cornes et al., 2018). For this assessment, we used the daily ensemble mean of TX and TN from E-OBS (Copernicus Climate Change Service, 2025).
We evaluated the consistency between MSWX and E-OBS for each subdivision in our dataset fully covered by the E-OBS grid (Fig. B1 and Table 1). We evaluated the agreement for attributes that represent averages across the spatiotemporal domain (zonally averaged across the subdivision geometry and temporally averaged across the respective analysis period; mean_tn and mean_tx) as well as the most 3 hourly extreme value at grid point level (xy_min_tn and xy_max_tx) per record at subdivision level. For each variable and record, temperature errors were calculated as defined in Eq. (5).
Hence, positive values represent overestimation by MSWX in relation to E-OBS.
We then calculated, for each variable and subdivision, the mean bias error (MBE), the mean absolute error (MAE) and the root mean square error (RMSE) as defined in Eqs. (6)–(8).
Table 1 provides the results of these calculations, along with Pearson correlation coefficients (r) and Fig. 8 illustrates correlation between the datasets and the distribution of errors. In general, MSWX shows small systematic biases compared to E-OBS, with a slight overestimation of minimums and an underestimation of maximums. For extreme values (xy_min_tn and xy_max_tx), absolute errors increased, reflecting the greater challenge of reproducing local and short-lived temperature extremes. However, high correlation values seem to reproduce both average and extreme temperature patterns with high fidelity at the subdivision level, reinforcing our confidence in the attributes of SHEDIS-Temperature.
Table 1Validation statistics for MSWX against E-OBS for extreme daily temperatures. Statistics include the number of observations (n), mean bias error (MBE), mean absolute error (MAE), root mean square error (RMSE), and Pearson correlation coefficient (r). The variables mean_tn and mean_tx represent spatiotemporally averaged values across each record's domain, while xy_min_tn and xy_max_tx correspond to the most extreme TN and TX values at the grid-cell level within the domain.
Figure 8Validation of MSWX against E-OBS for extreme daily temperatures. Panels (a, b) represent minimum temperature agreement for cold wave records, and panels (c, d) show maximum temperatures for heat wave records (covered by the E-OBS grid). Scatter plots illustrate agreement between the datasets, and histograms show the distribution of errors. Each dot in the scatter plots represents a subdivision-level record. The variables mean_tn and mean_tx represent spatiotemporally averaged values across each record's domain, while xy_min_tn and xy_max_tx correspond to the most extreme TN and TX values at the grid-cell level within the domain. Grey lines in the scatter plots indicate linear trend fits, with the coefficient of determination annotated – and grey lines in the histograms indicate the mean bias error. Please note that axes intervals vary between panels.
3.3.3 Consistency of temperature extremes in MSWX and EM-DAT
We also evaluated the consistency of our most extreme 3 hourly MSWX temperatures recorded at the grid point level with the temperatures reported in EM-DAT. The latter correspond to a maximum temperature for heat waves and a minimum temperature for cold waves. This information is, however, only available for a subset of cases, specifically 94 heat wave disnos and 120 cold wave disnos in our sample (Fig. 9).
Figure 9Comparison of extreme temperatures between EM-DAT and MSWX. The scatter plots illustrate the relationship between reported temperature extremes in EM-DAT and the corresponding 3 hourly minimum (a) and maximum (b) temperatures from MSWX for 120 cold waves and 94 heat waves, respectively. Each point represents a disno, with colours indicating the continent.
Overall, the agreement between MSWX and EM-DAT is stronger for heat waves (MAE=2.6 °C) than for cold waves (MAE=8.3 °C). For heat waves, MSWX and EM-DAT show reasonable consistency, though EM-DAT values tend to be slightly higher, with an average bias of 0.81°C (Fig. 9a). This discrepancy may reflect the inability of gridded datasets like MSWX to fully capture localized temperature extremes, as previously discussed.
A few outliers are evident. For example, a heatwave in the Borno Province, Nigeria, in June 2002 records a maximum temperature of 60 °C in EM-DAT, whereas the corresponding MSWX estimate is 44 °C (Fig. 9b). To put these values into context, according to the World Meteorological Organisation the official highest registered air temperature on Earth is 56.7 °C, recorded in Death Valley in the United States (WMO Records of Weather and Climate Extremes, 2024). This casts doubts on the veridicity of the EM-DAT record, which likely echoes news reporting from the time describing temperatures reaching 55–60 °C in Maiduguri (The New Humanitarian, 2002). While the EM-DAT value may sometimes be an overestimation of the actual conditions, differences between the two datasets may also reflect challenges of global reanalysis datasets such as MSWX to capture localized extreme temperatures, as shown by our evaluation using E-OBS. For instance, MSWX will miss hot temperatures exacerbated by urban heat island effects.
The comparison for cold waves exhibits greater variability (Fig. 9a). On average, EM-DAT reports minimum temperatures 1.8 °C higher than MSWX estimates, but the spread is substantial in both directions. For instance, a group of cold wave disnos in India show a stark contrast, with MSWX minimum temperatures near −40 °C, while EM-DAT reports values slightly above 0 °C (Fig. 9a). In December 2002 (disno 2002-0818-IND), for instance, EM-DAT reports a magnitude of 5 °C across a number of regions encompassing the most northern part of India (Bihar, Uttar Pradesh, Himachal Pradesh, Rajasthan, Jharkhand, Jammu and Kashmir, Punjab, Haryana, Delhi provinces). Within these subdivisions, MSWX estimates a minimum of −42 °C, recorded in the mountainous Jammu and Kashmir region, which is climatically diverse across altitude levels and regularly experiences sub-zero temperatures in winter.
This discrepancy is likely driven by factors beyond differences in temperature measurement methods. The EM-DAT magnitude record of 5 °C for the 2002 cold wave likely comes from accounts such as “On many occasions the average temperature was less than 5 °C for consecutive days” (Samra et al., 2003, p. “Preface”). EM-DAT magnitude records may thus, in some cases, reflect prolonged conditions in areas that suffered large socioeconomic losses (e.g. agricultural damage). In contrast, SHEDIS-Temperature quantifies extremes across all grid points within the impacted subdivisions. The percentile-based event detection analysis at the grid-point level can provide users with a more spatially detailed representation of cold waves in regions with high climatic variability. Taken together, these findings highlight the need for systematic approaches to linking hazard magnitude estimates with disaster impact records.
3.3.4 Sensitivity analysis of the influence of detrending
We also conducted a sensitivity analysis to assess the influence of detrending the TX and TN time series prior to the percentile calculations, by deriving percentiles directly from the non-detrended data (see Appendix C for details). The analysis compared differences in the resulting percentiles and their downstream effects on the SHEDIS-Temperature outputs.
Overall, using detrended time series resulted in slightly less extreme percentile thresholds compared to the non-detrended series – meaning slightly milder warmer thresholds for cold wave detection and slightly cooler thresholds for heat wave detection. The influence was also slightly larger on percentiles for heat waves than for cold waves: the former showed mean differences of approximately 0.05 °C and the latter of about 0.01 °C (averaged across the spatiotemporal domains for each subdivision in our sample).
The list of disnos associated with threshold-exceeding events at grid point level remained unchanged, although the number of subdivisions threshold exceeding events changed slightly. The differences appeared in subdivisions for which small parts of the geometries experienced threshold-exceeding events. Regarding the relevant SHEDIS-Temperature attributes, using non-detrended instead of detrended data resulted in differences of less than 1 % across all related attributes, except for the person-day attributes for heat wave records, which differed by maximum 2.4 %.
SHEDIS-Temperature is publicly available from a Harvard Dataverse repository (https://doi.org/10.7910/DVN/WNOTTC; Lindersson and Messori, 2025), with replication code published on Zenodo (https://doi.org/10.5281/zenodo.17571341, Lindersson, 2025). The dataset is organized into two main folder structures: one for heat waves and one for cold waves (Fig. 10). Each folder contains four primary files, with content as outlined in Table 2. Two files (CSV and GeoPackage) contain attributes aggregated at the disno-level, with one row per disno. Two additional files (CSV and GeoPackage) contain attributes aggregated at the subdivision-level, with one row per subdivision and disno. These files do, however, also include information derived at the grid point level. The only distinction between the CSV and GeoPackage files is that the latter also contain the geometries delineating the analysis domain.
Table 2Attributes available in files beginning with “shedis”. These files are provided in both CSV and GeoPackage formats at two spatial levels: the disno level (one row per disno) and the subdivision level (one row per disno and subdivision). This table lists attributes for heat waves in files beginning with “shedis_heatwaves”, including: TX (daily maximum temperature) and ATX (daily maximum apparent temperature). For cold waves, these attributes are replaced with TN (daily minimum air temperature) and ATN (daily minimum apparent temperature). Attributes from the event detection analysis are here denoted as “pctXX”, with pct90 and pct95 applied to heat waves, and pct10 and pct05 applied to cold waves. Variables prefixed with “xy_” represent values calculated at the grid-point level, while all other hazard-related variables have been spatially averaged over the corresponding geometry. An asterisk (*) denotes attributes available exclusively in the GeoPackage format.
Figure 10Folder structure of SHEDIS-Temperature. The dataset is organized into two main folders, each corresponding to a specific hazard type. Within the “threshold-exceeding-events” directories, subfolders are labeled based on the threshold and minimum duration used for analysis. Files within these subfolders are named by their disaster identifier number (disno) from EM-DAT (e.g. “2003-0391-FRA”).
SHEDIS-Temperature also includes subfolders containing detailed outputs from the detection of threshold-exceeding events, with subfolder names specifying the threshold and minimum duration used for analysis (Fig. 10). These subfolders contain one CSV file per disno for which threshold-exceeding events were detected, with one row per subdivision, coordinate pair, and detected event. The information in these files is detailed in Table 3.
Table 3Attributes in CSV files in the folders named “threshold-exceeding-events”. The files are named according to their respective disaster identifier number (disno) from EM-DAT and contain one row per subdivision and event. Each event is identified using percentile-based thresholds at grid point-level. This table lists attributes for heat waves, including TX (daily maximum air temperature) and ATX (daily maximum apparent temperature). For cold waves, these attributes are replaced with TN (daily minimum air temperature) and ATN (daily minimum apparent temperature). Results are provided for thresholds of analysis using the 90th and 95th percentiles for heat waves and the 10th and 5th percentiles for cold waves, as explicitly indicated in the subfolder names and the “mean_pctXX”-attribute.
4.1 Usage notes
SHEDIS-Temperature provides attributes for records in the international disaster database EM-DAT, which is known to have reporting biases, with higher coverage in advanced economies such as those in Europe and North America (CRED and UCLouvain, 2024; Gall et al., 2009; Osuteye et al., 2017). Users should be mindful about this bias when comparing disaster frequencies across continents. Furthermore, EM-DAT only records major disasters that meet at least one of the following criteria: ≥10 fatalities, ≥100 affected individuals, a declared state of emergency, and/or a request for international assistance. The database's coverage is thus affected by exposure and vulnerability, as well as differing national criteria to declare a state of emergency.
The meteorological and population attributes in SHEDIS-Temperature are derived from global gridded products. As such, the results should be interpreted with caution at local scales. Consequently, SHEDIS-Temperature includes administrative subdivisions at the level-1 (province/equivalent) and level-2 (county/district/equivalent) scales, but not finer.
We reiterate that localized extreme temperature events occurring at spatial scales smaller than the grid resolution are not fully captured. The aggregation from hourly ERA5 data to the 3 hourly MSWX time steps may also obscure short-lived temperature peaks. The technical validation against the station-based E-OBS dataset indicated higher agreement for the spatiotemporally averaged extreme temperature attributes (mean_tn and mean_tx) compared to the 3 hourly grid-point attributes (xy_mean_tn and xy_mean_tx). The analysis also showed that MSWX slightly underestimates temperature extremes for both attribute types. Nonetheless, correlation coefficients exceeded 0.9 across all variables, supporting the overall reliability of the hazard metrics derived from MSWX. It should be noted, however, that the validation was performed only for air temperature over Europe, and not for apparent temperature or regions outside Europe.
Our dataset is provided at two spatial levels: the disno-level and subdivision-level. We anticipate that the disno-level data will be particularly useful for comparative analyses across countries, while the subdivision-level data will facilitate the examination of within-country variations.
The spatial boundaries of the SHEDIS-Temperature analysis are limited to the administrative subdivisions recorded as impacted locations in EM-DAT, where “impacted” refers to areas affected by socioeconomic losses. As a result, these boundaries are not meant to outline the spatial extent of the meteorological events per se. These boundaries also outline the domain for the analyses at grid point level.
For the percentile-based detection analysis, we enforce a minimum duration of three consecutive days for heat waves and cold waves, a widely used criterion in extreme temperature studies (e.g. Meehl and Tebaldi, 2004; Perkins and Alexander, 2013; Perkins-Kirkpatrick and Lewis, 2020). This is, however, more conservative than the definition by EM-DAT of two days or longer (CRED and UCLouvain, 2024). While this kind of methodological choices will always be, to some extent, arbitrary we think that the main benefit of SHEDIS-Temperature is the application of consistent methodological choices across all records to ensure comparability. Users who prefer different event detection settings can use our publicly available R-scripts (https://doi.org/10.5281/zenodo.17571341, Lindersson, 2025) to do so.
We highlight below some key practical usage points to note:
-
To link SHEDIS-Temperature with EM-DAT, users can match the disaster identifier code (“disno”) present in both datasets, but in EM-DAT currently written as “DisNo.”.
-
Users should ensure UTF-8 encoding is used when reading SHEDIS files to correctly display location names.
-
For projecting coordinate-specific CSV outputs to raster files, users should adopt the same grid as MSWX.
-
The polygons in the GeoPackage files in SHEDIS-Temperature are simplified versions of the original polygons from GADM v3.6. To access the original polygons, users may retrieve the “gadm_gid” identifiers in SHEDIS-Temperature, which correspond to “GID_1” for level-1 subdivisions and “GID_2” for level-2 subdivisions in GADM.
-
The R scripts used to generate SHEDIS-Temperature outputs are available on Zenodo (Lindersson, 2025) as R Markdown files, along with an accompanying ReadMe file.
International databases of socioeconomic disaster impacts are essential for disaster risk research, yet they display important geographic coverage biases. The data gap is particularly striking in Africa, the Middle East and Southeast Asia, and addressing it will require continued efforts from the global disaster research community. Nonetheless, it is critical to maximize the usefulness of the data that we do have available. SHEDIS-Temperature addresses this need by enriching the information about major temperature-related disasters across five continents.
By providing detailed hazard information – such as temperature thresholds, duration, and geographic distribution – and linking it to exposure data (e.g., population counts during threshold-exceeding events), SHEDIS-Temperature enables more comprehensive analyses of past temperature-related disasters. For instance, users may calculate mortality rates by combining EM-DAT's fatality numbers with the exposure information in SHEDIS-Temperature. Researchers can further combine SHEDIS-Temperature with other socioeconomic and political indicators. This type of information is essential for statistical studies of how risk varies across time and regions. Ultimately, we think that this also can enhance the understanding of social and societal vulnerabilities, revealing how exposure to extreme temperatures intersects with socioeconomic factors over time and across regions.
At first glance, the results from SHEDIS-Temperature evidence a concerning trend: more populated subdivisions tend to face higher temperatures, a pattern that will likely intensify as climate change progresses. The intersection of rising temperatures and population growth will amplify risk, particularly in regions already facing the most severe temperature-related disasters. Identifying such risk hotspots underscores the importance of collecting data across the entire disaster risk spectrum in a systematic manner, and of making the outputs accessible to an interdisciplinary set of disaster researchers.
Figure A1Spatial distribution of administrative subdivisions linked to SHEDIS-Temperature disaster records. A total of 1274 distinct subdivisions is linked to 382 disaster records in the SHEDIS-Temperature sample. Panel (a) shows the 931 distinct level-1 administrative units (provinces or equivalents) and panel (b) shows the 343 distinct level-2 administrative units (counties, districts, or equivalents). Colours indicate the number of times each subdivision is linked to a disaster record in the sample.
Figure B1Spatial distribution of administrative level 1 subdivisions included in the technical validation against the E-OBS dataset. Colours indicate the error e as defined in Eq. (5), where positive values represent overestimation by MSWX relative to E-OBS. Some subdivisions are associated with multiple records per hazard type, for which mean e values are shown. Only level 1-units are visualized to avoid spatial overlap and because they constitute the majority of the sample, although level 2-units were also included in the validation analysis.
This appendix presents the results of the sensitivity analysis evaluating the influence of detrending the TX and TN time series prior to percentile calculation. Percentiles were here derived directly from the non-detrended data, and all subsequent processing steps were repeated. The sensitivity analysis then assessed the impact of detrending on (a) the resulting percentiles, which serve as thresholds in the event detection analysis, and (b) the downstream effects on the SHEDIS-Temperature outputs.
Here we define differences between the two methods following Eq. (C1).
Hence, a positive difference indicates that the value derived from the non-detrended time series is higher than that from the detrended series, whereas a negative difference indicates that the detrended series yields a higher value.
C1 Influence on percentiles
Turning first to how detrending of TX and TN time series influences the percentile values subsequently used in the event detection analysis. Percentiles were calculated for each day of the year and grid cell, while here we present values averaged across the spatiotemporal domain of each SHEDIS-Temperature subdivision (i.e. zonally averaged across the geometry and temporally averaged over the analysis period).
Both Table C1 and Fig. C1 show that the differences are somewhat larger for percentiles based on TX (pct90 and pct95) than for those based on TN (pct10 and pct05). The mean differences and standard deviations were also greater for the 5th and 95th percentiles than for the 10th and 90th percentiles. This indicates that the detrending choice affects results for the heat wave records at the 95th percentile level the most. However, the (spatiotemporally averaged) percentile differences between the methods are generally small, with maximum deviations of –0.33 °C for pct05 and 0.42 °C for pct95.
C2 Influence on SHEDIS-Temperature outputs
Next, we assessed how the relatively small differences in percentile values between the detrended and non-detrended methods affected the subsequent SHEDIS-Temperature outputs. We first provide an overview by comparing the number of subdivisions and disnos for which threshold-exceeding events were detected using each method (Table C2). The results here refer to the event detection analysis at grid point level. The number of disnos associated with threshold-exceeding events remained unchanged, whereas the total number of subdivisions with detected threshold-exceeding events varied slightly, with a maximum difference of four subdivisions per percentile-level.
Differences in total counts do not necessarily imply that events were detected in the same subdivisions. At the 10th percentile level, seven cold-wave subdivisions were flagged as experiencing threshold-exceeding events in either the detrended or non-detrended method, but not both; this number increased to 16 subdivisions at the 5th percentile level. For heat waves, six subdivisions were flagged at pct90 and six at pct95. In general, these differences occurred in subdivisions where threshold-exceeding events covered small portions of the geometry area.
Finally, we summarize the influence of detrending on relevant subdivision-level attributes across all SHEDIS-Temperature records (Table C3). Using non-detrended time series instead of detrended series resulted in relative differences of less than 1 % for all attributes, except for the person-day attributes for pct90 and pct95, which differed by maximum 2.4 %.
Table C1summarizes the differences between the detrended and non-detrended methods. Using detrended time series resulted in slightly less extreme percentile thresholds compared to the non-detrended series: marginally warmer thresholds for cold wave detection and slightly cooler thresholds for heat wave detection. Figure C1 also illustrates these differences across the full temperature range.
Table C2Changes in the number of subdivisions with at least one threshold-exceeding event at grid point level, depending on whether detrending was applied prior to percentile calculation. The event detection analysis was conducted in all 1796 subdivisions associated with the 243 cold wave disnos, and all 1030 subdivisions associated with 132 heat wave disnos. The percentage values refer to the share of subdivisions or disnos in the sample.
Table C3Changes in relevant attributes of the SHEDIS-temperature outputs. The difference is defined as the values coming from using the non-detrended time series minus the detrended time series.
Figure C1Scatter plots of percentile differences (°C) across temperature levels. Panels (a, b) illustrate percentiles used for cold wave records (pct10 and pct05 from TN), and panels (c, d) show for heat-wave records (pct90 and pct95 from TX). Each dot represents a SHEDIS-Temperature record at the subdivision level, with percentile values averaged across its spatiotemporal domain. Differences are defined as percentile values from the non-detrended time series minus those from the detrended time series. Black lines denote linear trend fits.
SL: Conceptualization (equal); Data curation (lead); Formal analysis and validation (lead); Methodology (lead); Visualization (lead); Writing – original draft preparation (lead); Writing – review and editing (lead). GM: Conceptualization (equal); Funding Acquisition (lead); Methodology (supporting); Writing – review and editing (supporting).
The contact author has declared that neither of the authors has any competing interests.
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
We gratefully acknowledge the data providers, including the Centre for Research on the Epidemiology of Disasters (EM-DAT), NASA Socioeconomic Data and Applications Center (GDIS), the GADM project, GloH2O (MSWX), and the European Commission Joint Research Centre (GHS-POP). AI tools, including ChatGPT (OpenAI) and Copilot (Microsoft), provided occasional assistance with R code writing and manuscript revisions, but they were not used to generate content.
This research has been supported by the Swedish Centre for Impacts of Climate Extremes (climes), the Centre of Natural Hazards and Disaster Science (CNDS), the Swedish Research Council (Vetenskapsrådet; grant no. 2022-06599), and Formas (grant no. 2023-01774).
This paper was edited by Qingxiang Li and reviewed by Aolin Jia and two anonymous referees.
Acevedo Mejia, S.: Gone with the wind: estimating hurricane and climate change costs in the Caribbean, IMF Working Papers, 2016, https://doi.org/10.5089/9781475544763.001, 2016.
Aitsi-Selmi, A., Egawa, S., Sasaki, H., Wannous, C., and Murray, V.: The Sendai Framework for Disaster Risk Reduction: Renewing the Global Commitment to People's Resilience, Health, and Well-being, International Journal of Disaster Risk Science, 6, 164–176, https://doi.org/10.1007/s13753-015-0050-9, 2015.
Baston, D.: exactextractr: Fast Extraction from Raster Datasets using Polygons, R package version 0.10.0, GitHub [code], https://github.com/isciences/exactextractr, last access: 10 October 2024.
Beck, H. E., van Dijk, A. I. J. M., Larraondo, P. R., McVicar, T. R., Pan, M., Dutra, E., and Miralles, D. G.: MSWX: Global 3-Hourly 0.1° Bias-Corrected Meteorological Data Including Near-Real-Time Updates and Forecast Ensembles, B. Am. Meteorol. Soc., 103, E710–E732, https://doi.org/10.1175/BAMS-D-21-0145.1, 2022.
Brimicombe, C., Di Napoli, C., Cornforth, R., Pappenberger, F., Petty, C., and Cloke, H. L.: Borderless Heat Hazards With Bordered Impacts, Earth's Future, 9, e2021EF002064, https://doi.org/10.1029/2021EF002064, 2021.
Brown, S. J.: Future changes in heatwave severity, duration and frequency due to climate change for the most populous cities, Weather and Climate Extremes, 30, 100278, https://doi.org/10.1016/j.wace.2020.100278, 2020.
Buzan, J. R., Oleson, K., and Huber, M.: Implementation and comparison of a suite of heat stress metrics within the Community Land Model version 4.5, Geosci. Model Dev., 8, 151–170, https://doi.org/10.5194/gmd-8-151-2015, 2015.
Carioli, A., Schiavina, M., MacManus, K. J., and Freire, S.: GHS-POP R2023A – GHS population grid multitemporal (1975–2030), European Commission, Joint Research Centre (JRC) [Dataset], https://doi.org/10.2905/2FF68A52-5B5B-4A22-8F40-C41DA8332CFE, 2023.
Casanueva A.: HeatStress (v1.0.7_zenodo), Zenodo [code], https://doi.org/10.5281/zenodo.3264930, 2019.
Copernicus Climate Change Service: E-OBS dataset, v 31.0e 0.1 deg regular grid, ensemble mean TN and TX, https://surfobs.climate.copernicus.eu/dataaccess/access_eobs.php, last access: 10 September 2025.
Cornes, R. C., van der Schrier, G., van den Besselaar, E. J. M., and Jones, P. D.: An Ensemble Version of the E-OBS Temperature and Precipitation Data Sets, J. Geophys. Res.-Atmos., 123, 9391–9409, https://doi.org/10.1029/2017JD028200, 2018.
CRED and UCLouvain: EM-DAT [data set], https://www.emdat.be, last access: 2 October 2023.
CRED and UCLouvain: EM-DAT documentation, https://doc.emdat.be/docs/, last access: 14 November 2024.
Cvijanovic, I., Mistry, M. N., Begg, J. D., Gasparrini, A., and Rodó, X.: Importance of humidity for characterization and communication of dangerous heatwave conditions, npj Climate and Atmospheric Science, 6, 33, https://doi.org/10.1038/s41612-023-00346-x, 2023.
Delforge, D., Wathelet, V., Below, R., Sofia, C. L., Tonnelier, M., van Loenhout, J. A. F., and Speybroeck, N.: EM-DAT: the Emergency Events Database, Int. J. Disaster Risk Reduct., 124, 105509, https://doi.org/10.1016/j.ijdrr.2025.105509, 2025.
Dellmuth, L. M., Bender, F. A.-M., Jönsson, A. R., Rosvold, E. L., and von Uexkull, N.: Humanitarian need drives multilateral disaster aid, P. Natl. Acad. Sci. USA, 118, e2018293118, https://doi.org/10.1073/pnas.2018293118, 2021.
European Commission: GHSL Data Package 2023, Publications Office of the European Union, Luxembourg [data set], https://doi.org/10.2760/098587, 2023.
Felbermayr, G. and Gröschl, J.: Naturally negative: The growth effects of natural disasters, Journal of Development Economics, 111, 92–106, https://doi.org/10.1016/j.jdeveco.2014.07.004, 2014.
Gall, M., Borden, K. A., and Cutter, S. L.: When Do Losses Count?: Six Fallacies of Natural Hazards Loss Data, B. Am. Meteorol. Soc., 90, 799–810, https://doi.org/10.1175/2008BAMS2721.1, 2009.
Gallo, E., Quijal-Zamorano, M., Méndez Turrubiates, R. F., Tonne, C., Basagaña, X., Achebak, H., and Ballester, J.: Heat-related mortality in Europe during 2023 and the role of adaptation in protecting health, Nat. Med., 30, 3101–3105, https://doi.org/10.1038/s41591-024-03186-1, 2024.
García-León, D., Masselot, P., Mistry, M. N., Gasparrini, A., Motta, C., Feyen, L., and Ciscar, J.-C.: Temperature-related mortality burden and projected change in 1368 European regions: a modelling study, The Lancet Public Health, 9, e644–e653, https://doi.org/10.1016/S2468-2667(24)00179-8, 2024.
Gasparrini, A., Guo, Y., Hashizume, M., Lavigne, E., Zanobetti, A., Schwartz, J., Tobias, A., Tong, S., Rocklöv, J., Forsberg, B., Leone, M., De Sario, M., Bell, M. L., Guo, Y.-L. L., Wu, C., Kan, H., Yi, S.-M., de Sousa Zanotti Stagliorio Coelho, M., Saldiva, P. H. N., Honda, Y., Kim, H., and Armstrong, B.: Mortality risk attributable to high and low ambient temperature: a multicountry observational study, The Lancet, 386, 369–375, https://doi.org/10.1016/S0140-6736(14)62114-0, 2015.
Green, H. K., Lysaght, O., Saulnier, D. D., Blanchard, K., Humphrey, A., Fakhruddin, B., and Murray, V.: Challenges with Disaster Mortality Data and Measuring Progress Towards the Implementation of the Sendai Framework, International Journal of Disaster Risk Science, 10, 449–461, https://doi.org/10.1007/s13753-019-00237-x, 2019.
Hersbach, H., Bell, B., Berrisford, P., Hirahara, S., Horányi, A., Muñoz-Sabater, J., Nicolas, J., Peubey, C., Radu, R., Schepers, D., Simmons, A., Soci, C., Abdalla, S., Abellan, X., Balsamo, G., Bechtold, P., Biavati, G., Bidlot, J., Bonavita, M., De Chiara, G., Dahlgren, P., Dee, D., Diamantakis, M., Dragani, R., Flemming, J., Forbes, R., Fuentes, M., Geer, A., Haimberger, L., Healy, S., Hogan, R. J., Hólm, E., Janisková, M., Keeley, S., Laloyaux, P., Lopez, P., Lupu, C., Radnoti, G., de Rosnay, P., Rozum, I., Vamborg, F., Villaume, S., and Thépaut, J.-N.: The ERA5 global reanalysis, Q. J. Roy. Meteor. Soc., 146, 1999–2049, https://doi.org/10.1002/qj.3803, 2020.
Hijmans, R: terra: Spatial Data Analysis, R package version 1.8-70 [code], https://rspatial.org/, last access: 18 November 2025.
IPCC: Climate Change 2023: Synthesis Report. Contribution of Working Groups I, II and III to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change, IPCC, Geneva, Switzerland, https://doi.org/10.59327/IPCC/AR6-9789291691647, 2023.
Jones, R. L., Guha-Sapir, D., and Tubeuf, S.: Human and economic impacts of natural disasters: can we trust the global data?, Sci. Data, 9, 572, https://doi.org/10.1038/s41597-022-01667-x, 2022.
Jones, R. L., Kharb, A., and Tubeuf, S.: The untold story of missing data in disaster research: a systematic review of the empirical literature utilising the Emergency Events Database (EM-DAT), Environ. Res. Lett., 18, 103006, https://doi.org/10.1088/1748-9326/acfd42, 2023.
Kageyama, Y. and Sawada, Y.: Global assessment of subnational drought impact based on the Geocoded Disasters dataset and land reanalysis, Hydrol. Earth Syst. Sci., 26, 4707–4720, https://doi.org/10.5194/hess-26-4707-2022, 2022.
Kahn, M. E.: The Death Toll from Natural Disasters: The Role of Income, Geography, and Institutions, The Review of Economics and Statistics, 87, 271–284, 2005.
Lindersson, S.: sara-lindersson/shedis-temperature-replication-code: Replication code for 'SHEDIS-Temperature' (v1.0.1), Zenodo [code], https://doi.org/10.5281/zenodo.17571341, 2025.
Lindersson, S. and Messori, G.: SHEDIS-temperature v1, Harvard Dataverse, https://doi.org/10.7910/DVN/WNOTTC, 2025.
Lindersson, S., Raffetti, E., Rusca, M., Brandimarte, L., Mård, J., and Di Baldassarre, G.: The wider the gap between rich and poor the higher the flood mortality, Nat. Sustain., 6, 995–1005, https://doi.org/10.1038/s41893-023-01107-7, 2023.
Liu, R., Zhang, X., Wang, W., Wang, Y., Liu, H., Ma, M., and Tang, G.: Global-scale ERA5 product precipitation and temperature evaluation, Ecol. Indic., 166, 112481, https://doi.org/10.1016/j.ecolind.2024.112481, 2024.
Lüthi, S., Fairless, C., Fischer, E. M., Scovronick, N., Ben Armstrong, Coelho, M. D. S. Z. S., Guo, Y. L., Guo, Y., Honda, Y., Huber, V., Kyselý, J., Lavigne, E., Royé, D., Ryti, N., Silva, S., Urban, A., Gasparrini, A., Bresch, D. N., and Vicedo-Cabrera, A. M.: Rapid increase in the risk of heat-related mortality, Nat. Commun., 14, 4894, https://doi.org/10.1038/s41467-023-40599-x, 2023.
Masselot, P., Mistry, M., Vanoli, J., Schneider, R., Iungman, T., Garcia-Leon, D., Ciscar, J.-C., Feyen, L., Orru, H., Urban, A., Breitner, S., Huber, V., Schneider, A., Samoli, E., Stafoggia, M., de'Donato, F., Rao, S., Armstrong, B., Nieuwenhuijsen, M., Vicedo-Cabrera, A. M., Gasparrini, A., Achilleos, S., Kyselý, J., Indermitte, E., Jaakkola, J. J. K., Ryti, N., Pascal, M., Katsouyanni, K., Analitis, A., Goodman, P., Zeka, A., Michelozzi, P., Houthuijs, D., Ameling, C., Rao, S., das Neves Pereira da Silva, S., Madureira, J., Holobaca, I.-H., Tobias, A., Íñiguez, C., Forsberg, B., Åström, C., Ragettli, M. S., Analitis, A., Katsouyanni, K., Surname, F. name, Zafeiratou, S., Vazquez Fernandez, L., Monteiro, A., Rai, M., Zhang, S., and Aunan, K.: Excess mortality attributed to heat and cold: a health impact assessment study in 854 cities in Europe, The Lancet Planetary Health, 7, e271–e281, https://doi.org/10.1016/S2542-5196(23)00023-2, 2023.
McBean, G. and McCarthy, J.: Narrowing the Uncertainties: A scientific Action Plan for Improved Prediction of Global Climate Change, Chapter 11 in “FAR Climate Change: Scientific Assessment of Climate Change”, Cambridge University Press, Cambridge, New York, USA, ISBN 0521 407206, 1990.
Meehl, G. A. and Tebaldi, C.: More Intense, More Frequent, and Longer Lasting Heat Waves in the 21st Century, Science, 305, 994–997, https://doi.org/10.1126/science.1098704, 2004.
Mester, B., Frieler, K., and Schewe, J.: Human displacements, fatalities, and economic damages linked to remotely observed floods, Sci. Data, 10, 482, https://doi.org/10.1038/s41597-023-02376-9, 2023.
Mochizuki, J., Mechler, R., Hochrainer-Stigler, S., Keating, A., and Williges, K.: Revisiting the “disaster and development” debate – Toward a broader understanding of macroeconomic risk and resilience, Clim. Risk Manag., 3, 39–54, https://doi.org/10.1016/j.crm.2014.05.002, 2014.
Muñoz Sabater, J.: ERA5-Land hourly data from 1950 to present. Copernicus Climate Change Service Climate Data Store [data set], https://doi.org/10.24381/cds.e2161bac, 2019.
Osuteye, E., Johnson, C., and Brown, D.: The data gap: An analysis of data availability on disaster losses in sub-Saharan African cities, International Journal of Disaster Risk Reduction, 26, 24–33, https://doi.org/10.1016/j.ijdrr.2017.09.026, 2017.
Panwar, V. and Sen, S.: Disaster Damage Records of EM-DAT and DesInventar: A Systematic Comparison, Economics of Disasters and Climate Change, 4, 295–317, https://doi.org/10.1007/s41885-019-00052-0, 2020.
Pebesma, E.: Simple Features for R: Standardized Support for Spatial Vector Data, The R Journal 10, 439–446, https://doi.org/10.32614/RJ-2018-009, 2018.
Perkins, S. E. and Alexander, L. V.: On the Measurement of Heat Waves, J. Climate, 26, 4500–4517, https://doi.org/10.1175/JCLI-D-12-00383.1, 2013.
Perkins-Kirkpatrick, S. E. and Lewis, S. C.: Increasing trends in regional heatwaves, Nat. Commun., 11, 3357, https://doi.org/10.1038/s41467-020-16970-7, 2020.
Rosvold, E. L. and Buhaug, H.: GDIS, a global dataset of geocoded disaster locations, Sci. Data, 8, 61, https://doi.org/10.1038/s41597-021-00846-6, 2021.
Russo, S., Dosio, A., Graversen, R. G., Sillmann, J., Carrao, H., Dunbar, M. B., Singleton, A., Montagna, P., Barbola, P., and Vogt, J. V.: Magnitude of extreme heat waves in present climate and their projection in a warming world, J. Geophys. Res.-Atmos., 119, 12500–12512, https://doi.org/10.1002/2014JD022098, 2014.
Russo, S., Sillmann, J., and Fischer, E. M.: Top ten European heatwaves since 1950 and their occurrence in the coming decades, Environ. Res. Lett., 10, 124003, https://doi.org/10.1088/1748-9326/10/12/124003, 2015.
Russo, S., Sillmann, J., and Sterl, A.: Humid heat waves at different warming levels, Sci. Rep., 7, 7477, https://doi.org/10.1038/s41598-017-07536-7, 2017.
Russo, S., Sillmann, J., Sippel, S., Barcikowska, M. J., Ghisetti, C., Smid, M., and O'Neill, B.: Half a degree and rapid socioeconomic development matter for heatwave risk, Nat. Commun., 10, 136, https://doi.org/10.1038/s41467-018-08070-4, 2019.
Samra, J. S., Singh, G., and Ramakrishna, Y. S.: Cold wave of 2002-03 – Impact on Agriculture, Indian Council of Agricultural Research, accessed 2025-03-05 from https://un-spider.org/sites/default/files/6-Cold%20Wave_Impacts_India_Indian%20Council%20of%20Agricultural%20Research.pdf (last access: 18 November 2025), 2003.
Schulzweida, U.: CDO User Guide, Zenodo [data set], https://doi.org/10.5281/zenodo.10020800, 2023.
Steadman, R. G.: A Universal Scale of Apparent Temperature, J. Appl. Meteorol. Clim., 23, 1674–1687, https://doi.org/10.1175/1520-0450(1984)023<1674:AUSOAT>2.0.CO;2, 1984.
Steadman, R. G.: Norms of apparent temperature in Australia, Aust. Meteorol. Mag., 43, 1–16, 1994.
Teucher, A., Russell, K., and Bloch, M.: R package “rmapshaper” version 0.5.0, GitHub [code], https://github.com/ateucher/rmapshaper, last access: 1 October 2024.
The New Humanitarian: At least 60 die of heat stroke in Maiduguri, https://www.thenewhumanitarian.org/news/2002/06/11/least-60-die-heat-stroke-maiduguri (last access: 5 March 2025), 2002.
Tselios, V. and Tompkins, E. L.: What causes nations to recover from disasters? An inquiry into the role of wealth, income inequality, and social welfare provisioning, International Journal of Disaster Risk Reduction, 33, 162–180, https://doi.org/10.1016/j.ijdrr.2018.10.003, 2019.
UNDRR: Data and digital maturity for disaster risk reduction: Informing the next generation of disaster loss and damage databases, United Nations Office for Disaster Risk Reduction (UNDRR) United Nations Development Programme (UNDP), Bonn, Germany, https://www.undrr.org/quick/75391 (last access: 18 November 2025), 2022.
Vestby, J., Schutte, S., Tollefsen, A. F., and Buhaug, H.: Societal determinants of flood-induced displacement, P. Natl. Acad. Sci. USA, 121, e2206188120, https://doi.org/10.1073/pnas.2206188120, 2024.
Vogel, M. M., Zscheischler, J., Wartenburger, R., Dee, D., and Seneviratne, S. I.: Concurrent 2018 Hot Extremes Across Northern Hemisphere Due to Human-Induced Climate Change, Earth's Future, 7, 692–703, https://doi.org/10.1029/2019EF001189, 2019.
Wirtz, A., Kron, W., Löw, P., and Steuer, M.: The need for data: natural disasters and the challenges of database management, Nat. Hazards, 70, 135–157, https://doi.org/10.1007/s11069-012-0312-4, 2014.
WMO Climatological Normals: https://community.wmo.int/en/activity-areas/climate-services/climate-products-and-initiatives/wmo-climatological-normals, last access: 4 December 2024.
WMO Records of Weather and Climate Extremes: https://wmo.int/sites/default/files/2024-07/Table_Records_02Jul2024.pdf, last access: 4 December 2024.
Yin, Y., He, L., Wennberg, P. O., and Frankenberg, C.: Unequal exposure to heatwaves in Los Angeles: Impact of uneven green spaces, Sci. Adv., 9, eade8501, https://doi.org/10.1126/sciadv.ade8501, 2023.
For disno 1999-0068-RUS, the centroid of the Russian subdivision Chukot had been incorrectly located within the neighbouring subdivision Sakha by GDIS. This error may arose because Chukot spans the 180th meridian, which can distort the centroid location depending on the methodology used.
The only disno with a higher reported fatality count is a Russian heatwave disno from 2010, with over 55 000 fatalities recorded.
- Abstract
- Introduction
- Data and methods
- Results
- Code and data availability
- Conclusions
- Appendix A: Supplementary figure of the sample at subnational level
- Appendix B: Supplementary figure from the MSWX evaluation with E-OBS
- Appendix C: Sensitivity analysis of the influence of detrending prior to percentile calculation
- Author contributions
- Competing interests
- Disclaimer
- Acknowledgements
- Financial support
- Review statement
- References
- Abstract
- Introduction
- Data and methods
- Results
- Code and data availability
- Conclusions
- Appendix A: Supplementary figure of the sample at subnational level
- Appendix B: Supplementary figure from the MSWX evaluation with E-OBS
- Appendix C: Sensitivity analysis of the influence of detrending prior to percentile calculation
- Author contributions
- Competing interests
- Disclaimer
- Acknowledgements
- Financial support
- Review statement
- References