Marine Heat waves &ndash; Multiple Analysis / Definitions (MHW-MAD): A Multi-Definition Global Marine Heatwave Dataset from Satellite Sea Surface Temperature data

Hayward, Alexander; Dasgupta, Nishka; McAdam, Ronan; Payne, Mark R.; Raj, Roshin P.; Bonino, Giulia; Chatterjee, Sourav; Combes, Vincent; Denaxa, Dimitra; De Rovere, Francesco; Englyst, Pia; Haapaniemi, Veera; Hargous, Paul; Høyer, Jacob; Joseph, K. Ajith; Lopes, Beatriz; Oliveira, Ana; Paixão, João; Silva, Fabiola; Surendran, Saradhy; Zegna-Rata, Artemis; Olsen, Steffen

doi:10.5194/essd-2025-590

Preprints

https://doi.org/10.5194/essd-2025-590

Preprints

26 Nov 2025

| 26 Nov 2025

Status: this preprint is currently under review for the journal ESSD.

Marine Heat waves – Multiple Analysis / Definitions (MHW-MAD): A Multi-Definition Global Marine Heatwave Dataset from Satellite Sea Surface Temperature data

Alexander Hayward, Nishka Dasgupta, Ronan McAdam, Mark R. Payne, Roshin P. Raj, Giulia Bonino, Sourav Chatterjee, Vincent Combes, Dimitra Denaxa, Francesco De Rovere, Pia Englyst, Veera Haapaniemi, Paul Hargous, Jacob Høyer, K. Ajith Joseph, Beatriz Lopes, Ana Oliveira, João Paixão, Fabiola Silva, Saradhy Surendran, Artemis Zegna-Rata, and Steffen Olsen

Abstract. Marine heatwaves (MHWs) are prolonged anomalies of warm sea surface temperature (SST) that can disrupt marine ecosystems, physical climate processes, and human coastal activities. MHW definitions vary due to different stakeholders requirements, such as ecological scientists and climate scientists having differing yet specific thresholds and metrics. Here we introduce a new global dataset of daily MHW metrics: climatological baselines, threshold exceedances, SST anomalies, and categorical event classifications of severity, derived from the European Space Agency SST Climate Change Initiative (ESA SST CCI) climate data record (CDR; 1982–2021) version 3.0 and an extension from 2022–2024 provided as an interim climate data record (iCDR). Building on the widely used definition of MHWs, periods in which SST exceeds the local 90th percentile for 5 or more days, our dataset extends this framework by incorporating multiple baseline climatologies (including fixed 30-year periods and rolling 30-year windows, as well as the period for reanalysis 1993–2016), varied percentile thresholds (90th, 95th, 99th), and both raw and linearly detrended SST anomalies. We also implement alternative event duration criteria (minimum 10-day and 30-day persistence) to classify longer-lasting warm events. All data products are provided at daily resolution on a 0.05° (~5 km) grid, with outputs including daily climatological percentiles, SST anomalies and binary MHW flags with severity category indices. This comprehensive dataset provides a consistent foundation for detecting and analysing MHWs across time and space, enabling researchers to assess how methodological choices affect MHW characterisation. By offering multiple definitions in parallel, the dataset facilitates intercomparison studies and supports applications from climate monitoring and model evaluation to marine ecological impact assessment, thereby providing users with pre-made indices for extremes.

How to cite. Hayward, A., Dasgupta, N., McAdam, R., Payne, M. R., Raj, R. P., Bonino, G., Chatterjee, S., Combes, V., Denaxa, D., De Rovere, F., Englyst, P., Haapaniemi, V., Hargous, P., Høyer, J., Joseph, K. A., Lopes, B., Oliveira, A., Paixão, J., Silva, F., Surendran, S., Zegna-Rata, A., and Olsen, S.: Marine Heat waves – Multiple Analysis / Definitions (MHW-MAD): A Multi-Definition Global Marine Heatwave Dataset from Satellite Sea Surface Temperature data, Earth Syst. Sci. Data Discuss. [preprint], https://doi.org/10.5194/essd-2025-590, in review, 2025.

Received: 26 Sep 2025 – Discussion started: 26 Nov 2025

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Status: open (until 21 Jan 2026)

Post a comment Subscribe to comment alert

RC1: 'Comment on essd-2025-590', Anonymous Referee #1, 11 Dec 2025 reply

Summary
The authors have created an extensive database of MHW results based on the ESA SST CCI v3.0 product. They have applied a numeric algorithm, similar to the Hobday et al. 2016, 2018 definition, to detect MHWs based on a range of different factors. Specifically they have detected events based on different climatology periods, raw vs de-trended data, different minimum lengths, and different percentile thresholds. All of these outputs are saved in a series of NetCDF files and available via HTTPS.

While the database appears to be technically sound, I am not convinced that it will see much use in its current form. I provide examples for why this may be in the specific comments below. Even if the authors were to address the comments I’ve made, or point out why I may be mistaken, I don’t think a database of these sorts of results would fundamentally see much use. I say this because the Hobday definition has existed for almost a decade now, and databases of global results using this analysis are not new. Many have been published or are otherwise publicly available. To be clear, this is not a reason that this current work should not be published, rather it would have been something to consider before all of this work was performed. Now that the files exist online and are publicly accessible it would be best to publish this work here in ESSD. Though do consider the points I’ve made below as they may improve the likelihood of extended use and valorisation of the work performed by the authors.

Title
- If the authors want to spell marine heat waves as three words, ‘waves’ should be capitalised in the title.

Abstract
- If there are plans to update the database on an annual basis (or more regularly), it would be good to mention that somewhere in the abstract. The other plans for future development, like adding other satellite products, should also be mentioned.

1. Introduction
- 51 : km → km^2
- 64 : ‘MHW events’ Here and throughout the text. A MHW is an event, so writing ‘MHW event’ is tautology. This is a common grammatical error in the MHW literature.
- 91 : v.30 → v3.0
- 94 : ‘dataset’ Here and throughout. I think the size of this output, and its standardised structure as a series of NetCDF files, could warrant the title of ‘database’ instead of a dataset. As the authors prefer.

2. SST data
- 113 : extra space after ‘x’

3. MHW definitions
- 161-168 : I’m not certain that would result in a trend-free time series. I could see how de-trending based on each DOY could still allow for an overall long-term trend in the full time series. Did the authors test the difference in output for their de-trending method vs a simple linear model fit to the full time series at a given pixel?
- 175 : Some sort of formatting issue with header sizes
- 179-195 : The choice of the moving smooth window is non-negligible, as the authors noted in the text. Did the authors perform any sensitivity studies for how the choice of the moving window for the smooth chosen here may create different basic results (i.e. 90^th percentile 5-day limit) from the standard python script created for Hobday et al. 2016?
Note that the basic Hobday methodology performs two smooths, not one, as noted in this manuscript. The first smooth is an 11 day window (5 day double sided running mean centred on the DOY) that takes into account all DOY values across all years and is used to calculate the DOY percentiles. This is similar to what the authors have done in their methodology with the 21 day window. Meaning that the base Hobday definition uses a smaller window than the authors have.
However, a second smooth of a 31 day total window is applied to the outputs of the first smooth. This is done so that any outsized effects from one or a few particularly extreme years does not overly bias the percentile thresholds applied to all other years.
Therefore the methodology employed by the authors is arguably a large departure from the initial methodology on which the analysis in this manuscript departed from. It would be preferable to provide at least a basic comparison of how this differs.
I suspect that this is just one of the unintended differences caused to the results by diverging from the base Hobday methodology as there are more specific steps in the process than most researchers realise. These sorts of issues come up often when one incorporates CDO into a methodology instead of performing everything in a more malleable language like R or Python because CDO is more limited in its operations.
Ultimately, this is just a technical issue. The authors have clearly explained what they have done and why, so it is up to readers/users to decide if they want to use the outputs. As the authors noted in the intro, there is no agreed upon methodology to be used. Nor should there be one single numerical approach.
- 218 -221: So does the authors methodology insert an NA value on DOY 60 for non-leap years? If so, how does the algorithm handle this missing day? For reference, the Hobday algorithm creates DOY 60 from DOY 59 and 61 via a linear interpolation BEFORE then taking all DOY values when calculating the seasonal and 90^th percentile signals. This may be another source of varying outputs.
- 248 : So the authors have not provided metrics like mean, maximum, or cumulative intensity? These would be very relevant for a number of applications. That a MHW lasted 10 days with an average anomaly of 0.1°C is very different from an event with an average anomaly of 1.0°C.

4. Results
- No comments.

5. Data
- Why add the 10^th and 50^th percentiles to every 90^th, 95^th, and 99^th percentile climatology file? Wouldn’t it be much more space efficient on the server to host these as separate files?
- 363 : ‘difference above threshold’ Do the authors mean to say the difference above the seasonal signal?
- As the files are organised now, if a researcher wanted to access all of the years of data for their particular study site (even if it is very small), they would need to download all of the files on the server for their input values of choice (i.e. 95^th percentile, 10 day limit). Then open each one and extract just the portion they want. Combine these all into another file and save it for continued use. This task is generally outside of the competence of most non-physical oceanography marine researchers. Have the authors considered rectangling their data to also provide the same output as a series of files organised along latitude or longitude lines? For example, providing all of the DOY percentile values for all days for all pixels in the latitude range of 0-5°E. The exact method isn’t too important. The point is that, one can create an impressive database of results (as the authors have) but if it is too complex or inconvenient to use, it will not be used. Have the authors consulted with a broad range of stakeholders to see who amongst them would be able to make use of the database as it stands now?
- Additionally, the lack of standard MHW metrics (e.g. duration, cumulative intensity) provided for discrete events will likely prove problematic for most researchers. If I have understood correctly, the author provide the per day results per pixel, but no summary metrics. It is the summary metrics (e.g. total duration) that tend to be of most interest to a researcher. Therefore, any use of this database will inherently require a lengthy post-processing analysis by the user in order to arrive at the final metrics that they will probably need for their work. Have the authors considered this? If so, why not provide files that give these summary metrics? This would not be too terribly difficult to create, and would require quite a lot less work from a user. For example, create a file that contains all of the summary results for all pixels for a given longitude band. This would contain the default Hobday-ish output that lists each discrete event, its duration, max intensity, etc. etc. Nothing new or innovative, just the standard stuff that people are used to.
- Ultimately, who is the target user for this database? Whose life would be made easier by using these data as they exist now? In what circumstance would a researcher with a limited study area not simply calculate the results for exactly what they need? With an output that matches exactly what they need? Because these data are stocked in a NetCDF file format (a choice that I completely agree with, just to be clear) it will generally require that a user (or someone in their lab) be able to access, open, and extract data from them using a programming language. Considering that there are already MHW toolboxes for Python, R, and MATLAB, why would they not just use one of these to perform the tests that they need? While the database that the authors have created is impressive, and would be very useful, I do not immediately see that it will receive much use as is. While true that this database would inherently be more useful to a research question with a global scope, most researchers tackling investigations at this scale will already have the necessary competence, and will most likely already have a particular dataset or other peculiarities that they will need to pursue that would prevent them from exploiting this database.

6. Usage
- 403 : The point about climate model evaluation is a tenuous one. Unless the model is in some way related to the ESA CCI SST product, the modelling of temperature extremes tend to be so incongruous between different models that comparing them directly is rarely a fruitful endeavour.
- 411 : The original python code, written in 2016, already allows a user to make all of the changes detailed in this manuscript. Including the changing of the DOY moving window. Minus of course the de-trending of the raw data.

7. Summary
- 415-420: A problem with this statement is that the authors themselves have deviated from the base Hobday algorithm, meaning that the results in the database may already differ from an analysis that uses the exact same climatological baseline, duration limit, etc. Let alone comparisons for results with different baselines.
- 420 : HMW → MHW

8. Conclusions
- No comments.

Table 1-2
No comments.

Figure 1-7
No comments.

Reply

Citation: https://doi.org/10.5194/essd-2025-590-RC1

Data sets

MHW-MAD Alex Hayward et al. https://download.dmi.dk/public/MHW/

Viewed

Total article views: 446 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
334	87	25	446	18	20

HTML: 334
PDF: 87
XML: 25
Total: 446
BibTeX: 18
EndNote: 20

Views and downloads (calculated since 26 Nov 2025)

Month	HTML	PDF	XML	Total
Nov 2025	88	14	8	110
Dec 2025	232	71	17	320
Jan 2026	14	2	0	16

Cumulative views and downloads (calculated since 26 Nov 2025)

Month	HTML	PDF	XML	Total
Nov 2025	88	14	8	110
Dec 2025	232	71	17	320
Jan 2026	14	2	0	16

Viewed (geographical distribution)

Total article views: 408 (including HTML, PDF, and XML) Thereof 408 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 06 Jan 2026

Short summary

We present a global marine heatwave dataset (1982–2024) based on satellite sea surface temperature. The dataset applies multiple definitions in parallel, varying baselines, thresholds, detrending, and event durations. It enables consistent comparisons of marine heatwave characterisation across methods and supports climate monitoring, model evaluation, and ecological impact studies.


Total:	0
HTML:	0
PDF:	0
XML:	0