A global database of extreme fire events from satellite data from 2003 to 2022

Solano-Romero, Erika; Segura-Garcia, Carlota; Pettinari, M. Lucrecia; Khairoun, Amin; Torres-Vázquez, Miguel Ángel; Chuvieco, Emilio

doi:10.5194/essd-2026-236

Preprints

https://doi.org/10.5194/essd-2026-236

Preprints

11 May 2026

| 11 May 2026

Status: this preprint is currently under review for the journal ESSD.

A global database of extreme fire events from satellite data from 2003 to 2022

Erika Solano-Romero, Carlota Segura-Garcia, M. Lucrecia Pettinari, Amin Khairoun, Miguel Ángel Torres-Vázquez, and Emilio Chuvieco

Abstract. Extreme fires represent a significant threat due to their impacts on climate, ecosystems, and society. Despite their increasing prevalence, their definition remains controversial, as their characteristics vary depending on the region considered. In this article, we present the first version of the Extreme Fire Events (EFEs) database, a global dataset of extreme fires in NetCDF format containing monthly rasters on a regular grid with a spatial resolution of 0.25 degrees. The database includes the period 2003–2022, when a consistent satellite record was available. The basic unit of analysis is a cell-month event (CME), which represents aggregated fire activity within a grid cell during a given month. The identification of extreme events was based on two main satellite-derived variables: Burned Area (BA) from the European Space Agency’s FireCCI51 dataset and Fire Radiative Power (FRP) obtained from the NASA MCD14ML active fire product. Both variables were derived from the MODIS sensor. They were aggregated to the spatial and temporal scale defined for the CMEs and were used to compute standardised anomalies within each of the 55 defined regions, in order to account for spatial and seasonal differences in fire activity in the main global biomes. A CME was classified as an EFE when it presented anomalous values in both variables according to the established regional thresholds. Further, for each EFE, the database also indicates if any fire perimeter from the FRY v2.0 dataset identified as extreme by a certain attribute (fire size, duration, mean FRP, rate of spread and severity) overlapped with the CME. The database includes 19,951 EFEs between 2003 and 2022, with the highest frequency in 2010 and 2007, and the lowest in 2013. The dataset is intended for climate and Earth System modellers aiming to understand the causes and impacts of EFEs, as well as to forecast their occurrence under future scenarios or include them in broader Earth System models.

Received: 28 Mar 2026 – Discussion started: 11 May 2026

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Preprint (PDF, 1932 KB)

Supplement (1518 KB)

Download & links

Erika Solano-Romero, Carlota Segura-Garcia, M. Lucrecia Pettinari, Amin Khairoun, Miguel Ángel Torres-Vázquez, and Emilio Chuvieco

Status: final response (author comments only)

RC1:
'Comment on essd-2026-236', Anonymous Referee #1, 13 Jun 2026
The manuscript presents a global extreme fire dataset from 2003 to 2022, derived from FireCCI51 and MCD14ML. The effort to aggregate global extreme fire data is valuable, while the current manuscript and the dataset have problems in method and structure. My primary concerns are the validity of the dataset and the lack of validation. Thus I suggest major revision for further consideration.
Major concern:
The manuscript defines extreme fire events based on a 0.25 degree and monthly grid. But extreme fires can cross boundaries and span multiple months. Applying a grid-based approach inevitably divides contiguous fires into multiple parts, thus make it difficult to define as “events”.

The manuscript needs to clarify how multiple satellite overpasses (e.g., Terra and Aqua, day and night) for the MCD14ML product were processed. If these observations are not appropriately deduplicated, it may lead to double-counting and bias the fire radiative power metrics.

The authors explain that the database ends in 2022 to maintain consistency, given the transition from FireCCI51 to Sentinel-3 products. However, since alternative MODIS products (MCD64A1) are updated in near real-time, ending in 2022 misses recent extreme fire seasons (e.g., 2023 Canadian fire season and 2025 South California).

As a data descriptor paper, validation of the product is important and necessary. The manuscript currently primarily focuses on analyses. A cross validation with other data sources (Global Fire Atlas, FIRED), such as disaster databases (e.g., EM-DAT), media news, or regional records, is needed to verify that the identified events correspond to existing records.

Line-by-line comments:
L13: The phrase “as their characteristics vary depending on the region considered” may not be necessary, as it is somewhat ambiguous and does not introduce new information.
L14: The description “a global dataset of extreme fires in NetCDF format containing monthly rasters on a regular grid with a spatial resolution of 0.25 degrees” could be more concisely written as “a global monthly and 0.25-degree extreme fires dataset in NetCDF format.”
L22: Please clarify why “main global biomes” is used here, given that the database claims global coverage.
L24: Please avoid using ambiguous words such as “certain”. It is better to list them explicitly.
L27: I have reservations about the dataset's value for forecasts and projections, given that it is not updated in (near) real-time.
L45: The term “unique” should be clarified: does it mean unified, comprehensive, or broadly accepted?
L47: It would be valuable to mention the fire’s impacts here.
L52-54: A region-specific threshold may also not fully resolve this issue, as it primarily highlights anomalies relative to the region's historical average rather than absolute physical extremes.
L54-56: Please clarify this sentence. Small fires in fuel-limited regions are typically not considered extreme, so the current phrasing is slightly confusing.
L62: This appears to be an incomplete sentence as it only includes landscapes.
L63: The text says “several” but provides only one reference. Please include additional relevant datasets, such as the Global Fire Atlas (ESSD) and FIRED (https://www.mdpi.com/2072-4292/12/21/3498).
L66-68: The necessity of mentioning the specific ESA project here is unclear.
L70: This expression is physically imprecise. Multiple distinct fire events can occur within a single month and a 0.25° cell, so aggregating them as a single 'event' introduces artificial artifacts.
L81: By using a 0.25° and monthly resolution, a single, contiguous large-scale fire event is inevitably fractured into multiple CMEs. The authors should explicitly discuss this limitation and how it impacts the definition of an "event".
L82: Why use the word “roughly”? Geographic divisions should be precise and accurate.
L87-89: By providing only binary values rather than standardized anomalies, the dataset's usage for diverse modeling purposes is restricted.
L89: This implies the added value is merely calculating anomalies for the existing FRY v2.0 dataset.
L95: It is uncommon to have a one-sentence paragraph. Please merge or expand.
L107-110: While the authors justify stopping at 2022 due to the discontinuity between FireCCI51 and Sentinel-3 products, relying on a discontinued product restricts the database's reuse value. Given that alternative MODIS BA products are updated in near real-time, the 2003-2022 cutoff misses critical recent extremes.
L116-120: The issue of multiple overpasses (Terra and Aqua for day and night) seems completely unaddressed. Without proper deduplication, fire metrics are systematically biased and double-counted. See: https://www.nature.com/articles/s41559-024-02452-2
L122-138: The added value of including FRY v2.0 is questionable, as end-users could calculate anomalies more straightforwardly from the original FRY v2.0 data.
L176-182: The merge and divide procedure introduces arbitrary thresholds (e.g., why exactly 125,000 km²?), and its added value over a standard continental-biome approach is unclear.
Results (General): For a data descriptor paper, data validation is important. While comprehensive validation is challenging due to the scarcity of similar global products, cross-validating against other sources such as EM-DAT, media reports, and regional disaster databases is necessary to prove the dataset's reliability.
Citation: https://doi.org/10.5194/essd-2026-236-RC1
RC2:
'Comment on essd-2026-236', Anonymous Referee #2, 02 Jul 2026
The paper by Solano-Romero et al. presents a global database on extreme fires events between 2003 and 2022 derived from satellite data. No such database exists and would be valuable for global comparisons, same as a broadly used definition of “extreme fire events”. Still the validation of the data set is week, and needs to be improved. I recommend some minor changes in the meta data. The manuscript itself is written in a clear and accessible way, describing detailed the data source and workflow. The results and discussion section should be structured in a more data set supporting way, and potential and limitations should be better highlited.
Overall I see the potential of the database, and recommend the publication after major revisions. Major concern is the validation of the data.

Comments per line:
L 84: Which standardized method was used?
Fig 1: Seems to have many colors, reduce to two or three.
L 178: The meaning and use of “social heterogeneity” is not clear to me as the focus in the workflow lies on climatic and ecological factors.
L 196: “based on standard deviations from the mean” → what is the mean based on? Whole time period or other?

Comments on single sections:
Section 3.1: The results section primarily describes the shown figures, which seems redundant to me having them in there. It would be more interesting to verbally compare the results of the figures with the intention to show the potential use of the data. For example Fig 3, 4, and 5a show global occurrences and frequencies but don’t give any hint why this would be interesting. Fig 5b suggests a seasonal pattern, while it’s questionable to look at this pattern globally. Same goes for Fig 6 and 7: why is this interesting, what is the potential question this analysis could answer?
Section 3.2: Due to the lack of a comparable data set and coherent definition it is valid to compare the EFEs data to literature documented events. Still, the validation of a global data set covering 20 years on only 20 events is too week and needs to be corroborated by more evidence. For example simple statistical thresholds from other data sets could be used or part of the data compared with locally restricted data sets like US FPA-FOD for the USA or the EFFIS Large Fire for Europe (ideally based on different satellite products). Discussing differences to the validation data sets the strengths and weaknesses of the new EFEs database could be pointed out better.

Notes on the data set itself:
I couldn’t access the database via the provided link but did find it with description given in the paper. A published data set should be understandable by itself. That’s why I recommend to improve mainly the provided meta information in the Readme.txt by the following points:
improve structure, “GENENRAL INFORMATION” is quite long and “4. Description of the data set” better fits to “files”; “author” I would put together with “Contact”. Also I recommend to use a common description format, ie. .json format

“4. Description of the data set”:
visually improve folder structure, for example by adding more indents for sub-points

please state the meaning of FRY, consider the same for gBA and gsumFRP at first mention

add Version date of data set

“GEOGRAPHIC INFORMATION” → add resolution

“RELATED PUBLICATIONS” → add ESSD-publication

I don’t see the need for “Keywords” in the readme, as long as you can’t search for them in a database. Maybe they could be used in the databases platform mask

What is meant by “OTHERS” 1. Data dictionary?

The python scripts seem to be well documented, though I didn’t test them.
Citation: https://doi.org/10.5194/essd-2026-236-RC2

Erika Solano-Romero, Carlota Segura-Garcia, M. Lucrecia Pettinari, Amin Khairoun, Miguel Ángel Torres-Vázquez, and Emilio Chuvieco

Supplement

https://doi.org/10.5194/essd-2026-236-supplement

Data sets

Extreme_Fire_Events_EFEs_database Erika Solano-Romero and Carlota Segura-Garcia https://edatos.consorciomadrono.es/previewurl.xhtml?token=94b577ad-c940-4042-9a4e-239102294a4e

Erika Solano-Romero, Carlota Segura-Garcia, M. Lucrecia Pettinari, Amin Khairoun, Miguel Ángel Torres-Vázquez, and Emilio Chuvieco

Viewed

Total article views: 490 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	Supplement	BibTeX	EndNote
357	117	16	490	44	16	18

HTML: 357
PDF: 117
XML: 16
Total: 490
Supplement: 44
BibTeX: 16
EndNote: 18

Views and downloads (calculated since 11 May 2026)

Month	HTML	PDF	XML	Total
May 2026	206	69	12	287
Jun 2026	39	11	3	53
Jul 2026	112	37	1	150

Cumulative views and downloads (calculated since 11 May 2026)

Month	HTML	PDF	XML	Total
May 2026	206	69	12	287
Jun 2026	39	11	3	53
Jul 2026	112	37	1	150

Viewed (geographical distribution)

Total article views: 465 (including HTML, PDF, and XML) Thereof 465 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 01 Aug 2026

Download

Preprint (1932 KB)
Metadata XML

Short summary

To better understand where and when the world’s most unusual fires occur, we created the first global database of extreme fire events for 2003 to 2022. Using satellite observations of burned land and fire energy, we identified 19,951 extreme events and found clear differences across regions and seasons. This new resource can help researchers and decision makers improve studies of climate, ecosystems, and future fire risk.


Total:	0
HTML:	0
PDF:	0
XML:	0