BRIGHT: A globally distributed multimodal building damage assessment dataset with very-high-resolution for all-weather disaster response

Chen, Hongruixuan; Song, Jian; Dietrich, Olivier; Broni-Bediako, Clifford; Xuan, Weihao; Wang, Junjue; Shao, Xinlei; Wei, Yimin; Xia, Junshi; Lan, Cuiling; Schindler, Konrad; Yokoya, Naoto

doi:10.5194/essd-2025-269

Preprints

https://doi.org/10.5194/essd-2025-269

Preprints

18 Jun 2025

| 18 Jun 2025

Status: a revised version of this preprint was accepted for the journal ESSD and is expected to appear here in due course.

BRIGHT: A globally distributed multimodal building damage assessment dataset with very-high-resolution for all-weather disaster response

Hongruixuan Chen, Jian Song, Olivier Dietrich, Clifford Broni-Bediako, Weihao Xuan, Junjue Wang, Xinlei Shao, Yimin Wei, Junshi Xia, Cuiling Lan, Konrad Schindler, and Naoto Yokoya

Abstract. Disaster events occur around the world and cause significant damage to human life and property. Earth observation (EO) data enables rapid and comprehensive building damage assessment, an essential capability crucial in the aftermath of a disaster to reduce human casualties and inform disaster relief efforts. Recent research focuses on developing artificial intelligence (AI) models to accurately map unseen disaster events, mostly using optical EO data. These solutions based on optical data are limited to clear skies and daylight hours, preventing a prompt response to disasters. Integrating multimodal EO data, particularly combining optical and synthetic aperture radar (SAR) imagery, makes it possible to provide all-weather, day-and-night disaster responses. Despite this potential, the lack of suitable benchmark datasets has constrained the development of robust multimodal AI models. In this paper, we present a Building damage assessment dataset using veRy-hIGH-resoluTion optical and SAR imagery (BRIGHT) to support AI-based all-weather disaster response. To the best of our knowledge, BRIGHT is the first open-access, globally distributed, event-diverse multimodal dataset specifically curated to support AI-based disaster response. It covers five types of natural disasters and two types of human-made disasters across 14 regions worldwide, focusing on developing countries where external assistance is most needed. The dataset's optical and SAR images with spatial resolutions between 0.3 and 1 meters provide detailed representations of individual buildings, making it ideal for precise damage assessment. We train seven advanced AI models on BRIGHT to validate transferability and robustness. Beyond that, it also serves as a challenging benchmark for a variety of tasks in real-world disaster scenarios, including unsupervised domain adaptation, semi-supervised learning, unsupervised multimodal change detection, and unsupervised multimodal image matching. The experimental results serve as baselines to inspire future research and model development. The dataset (Chen et al., 2025), along with the code and pretrained models, is available at https://github.com/ChenHongruixuan/BRIGHT and will be updated as and when a new disaster data is available. BRIGHT also serves as the official dataset for the 2025 IEEE GRSS Data Fusion Contest Track II. We hope that this effort will promote the development of AI-driven methods in support of people in disaster-affected areas.

Received: 08 May 2025 – Discussion started: 18 Jun 2025

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Hongruixuan Chen, Jian Song, Olivier Dietrich, Clifford Broni-Bediako, Weihao Xuan, Junjue Wang, Xinlei Shao, Yimin Wei, Junshi Xia, Cuiling Lan, Konrad Schindler, and Naoto Yokoya

Status: closed

RC1:
'Comment on essd-2025-269', Anonymous Referee #1, 11 Jul 2025
This paper introduces BRIGHT, a novel and timely benchmark dataset for building damage assessment using multimodal high-resolution optical and SAR imagery. Covering 14 globally distributed disaster events, BRIGHT provides pixel-level damage annotations for over 384,000 buildings. The dataset is designed to facilitate AI-based disaster response research, particularly in challenging all-weather conditions. The authors also benchmark a suite of machine learning and deep learning models on multiple tasks. The authors provided detailed documents and descriptions, making the data, related source code, and pretrained weights of models easy to understand and use.
In summary, this is quite interesting and solid work. I’d like to recommend the acceptance of this work since it represents an important contribution to Earth observation and disaster response communities. Yet before acceptance, several clarifications and refinements are suggested.
The manuscript would benefit from deeper exploration of what the models learn from multimodal fusion. Specifically, what roles do optical images play in multimodal building damage assessment? Is it beyond just building footprint localization? On the other words, are the features extracted from optical imagery actively compared with SAR representations? Some discussion (e.g., based on CAMs in Fig. 7) is provided but can be more explicitly elaborated.

The manuscript makes extensive evaluation of supervised and unsupervised change detection models, but the conceptual and methodological relationship between building damage assessment and generic change detection remains unclear, which is largely implied rather than discussed. An explicit and clearer explanation would be great for readers who lack of related background.

Since UMCD methods underperform, consider including a random guessing baseline for reference. This would contextualize the difficulty of BRIGHT and help readers understand the performance floor under UMCD setup.

While Table 1 offers a comprehensive comparison of datasets, several datasets seem relevant and should be included to enhance its completeness, like CRASAR-U-DROIDs [arXiv:2407.17673] and Noto-Earthquake building damage dataset [10.5194/essd-2024-363].

The paper describes careful multimodal alignment but omits the software used, e.g., ENVI, ArcGIS, or QGIS. Please provide related details.

Appendix G includes important new experimental setups and evaluation methods for UMCD. However, too much content is composed together now. It is not easy for people to grasp information. Adding section subtitles could improve readability.

8: Please specify in the figure or caption that the values represent average ± standard deviation across models.

10: Add a note in the caption to clarify that each dot corresponds to performance on a single test event under cross-event transfer.

Typo in Table 7: “Object-based major voting” should be corrected to “Object-based majority voting”.

Clarify the meaning of “–” symbols in Table 11. Do they indicate missing data or inapplicability? This should be stated explicitly.

“ML” should be defined on its first use and consistently used thereafter instead of alternating with [machine learning].

Standardize currency formatting (e.g., USD vs. US$).

Define abbreviations such as IGN and GSI when first mentioned as data providers.

The format of references should be standardized. Some of these entries use abbreviations for journals, while others have full titles.
Citation: https://doi.org/10.5194/essd-2025-269-RC1
- AC1:
  'Reply on RC1', Hongruixuan Chen, 24 Jul 2025
  
  Thank you so much for the instructive and constructive comments for our paper! We have updated a point by point response attached as a supplement for your reference.
  
  Citation: https://doi.org/10.5194/essd-2025-269-AC1
  - RC2: 'Reply on AC1', Anonymous Referee #1, 24 Jul 2025
    
    Thanks for the author's responses. This revision has significantly improved the quality and soundness of their work. My major concerns have been fully addressed. I recommend accepting this paper.
    
    Citation: https://doi.org/10.5194/essd-2025-269-RC2
    
    AC2: 'Reply on RC2', Hongruixuan Chen, 26 Jul 2025
    
    Thank you once again for your insightful comments, which have greatly helped us improve our manuscript. We will upload the revised version, based on your suggestions and as reflected in our responses, as the Final Response after the open discussion.
    
    Citation: https://doi.org/10.5194/essd-2025-269-AC2
    
    AC6: 'Reply on RC2', Hongruixuan Chen, 25 Sep 2025
    
    Thank you once again for your invaluable suggestions. In this stage, we have now uploaded our point-by-point response to your comments in the system, along with the revised manuscript. We would be grateful if you could review them at your convenience.
    
    Citation: https://doi.org/10.5194/essd-2025-269-AC6
RC3:
'Comment on essd-2025-269', Anonymous Referee #2, 26 Aug 2025
This manuscript introduces the BRIGHT dataset, which is a first open building damage assessment dataset with global coverage, multi-hazard scenarios, multimodal imagery (Optical and SAR), and sub-meter resolution. The paper systematically describes data collection, annotation, and quality control methods, and validates the dataset with multiple deep learning models, including cross-disaster transfer (zero-shot and one-shot), semi-supervised, and unsupervised approaches. The dataset demonstrates clear novelty and practical value, and it is of significant importance for advancing research and applications in disaster emergency response, remote sensing, and artificial intelligence. Generally, the paper is well structured, logically clear, with detailed results and strong value in terms of data sharing. Although the manuscript is rich in content, there are still details that require improvement, and I recommend appropriate revisions.
Major Comments
The explanation of annotation consistency and reliability remains insufficient. Although the authors state that the data annotations were obtained from multiple institutions such as Copernicus EMS, UNOSAT, and FEMA and then refined manually, there may be inconsistencies in how different institutions define “damaged” and “destroyed.” This could affect the consistency of annotations across disaster scenarios. It is therefore necessary to further elaborate on the process of unifying annotations, provide more detail on the manual refinement procedures.

The treatment of class imbalance is not sufficient. Figure 5(d) shows that intact buildings account for over 80%, while destroyed buildings account for less than 7%. This severe imbalance directly affects the accuracy of recognizing destroyed classes. Although the authors employed the Lovasz loss function to partially alleviate the issue, this is still not enough to solve the problem. Is this imbalance one of the reasons for the relatively low performance of the subsequent experimental results?

The discussion on cross-disaster generalization needs to be strengthened. Table 6 shows that in different disaster types, certain events perform particularly poorly. In particular, the mIoU values are the lowest for explosion and chemical accident events such as Bata-EP-2021 and Kyaukpyu-CC-2023, while earthquake events such as Morocco-EQ-2023 and Noto-EQ-2024 also remain highly challenging. This indicates that the models face significant difficulties in handling highly destructive, structurally complex, and spatially heterogeneous disaster scenarios. The authors should analyze these challenges in more depth, such as the heterogeneity and extreme local variations in explosion damage, the diversity of collapse patterns in earthquake events, and the limitations of SAR data in capturing fine-grained details. It is also recommended to provide typical error cases and compare model errors across disaster types to better illustrate the shortcomings in generalization.

The manuscript currently lacks a comparison with optical-only baselines, which is crucial to highlight the value of multimodal methods. Readers may question whether the inclusion of SAR brings significant benefits and whether the additional cost of multimodality is justified. To avoid such doubts, I suggest adding experiments with optical-only inputs and comparing them with Optical+SAR results. This would further emphasize the unique value of the BRIGHT dataset and provide stronger evidence for the necessity of multimodal fusion.

The discussion of limitations and future directions is insufficient. At present, the conclusion mainly emphasizes the dataset’s contributions, but it does not address its shortcomings in detail. It is suggested to include a separate subsection summarizing the limitations, such as the use of single-polarization SAR, the lack of time-series data, and the fact that most disaster events are concentrated after 2020.

Minor Comments
The description of study areas and disaster events is somewhat redundant.

The terminology for “one-shot” may not be accurate. The authors describe it as using “a small number of labeled samples,” which may be better defined as “few-shot.” Since these concepts are borrowed from previous work, it is suggested to cite the corresponding references.

At line 420, the authors state that SAR is not sensitive to fine structural changes. Would this limitation reduce the value of multimodality in certain scenarios?
Citation: https://doi.org/10.5194/essd-2025-269-RC3
- AC3:
  'Reply on RC3', Hongruixuan Chen, 01 Sep 2025
  
  Thank you so much for the instructive and constructive comments for our paper! We have updated a point by point response attached as a supplement for your reference.
  
  Citation: https://doi.org/10.5194/essd-2025-269-AC3
  - RC4: 'Reply on AC3', Anonymous Referee #2, 02 Sep 2025
    
    I appreciate the authors’ careful revisions. The manuscript has been substantially improved in quality and clarity. All of my previous concerns have been satisfactorily addressed. I therefore recommend acceptance of this paper.
    
    Citation: https://doi.org/10.5194/essd-2025-269-RC4
    
    AC4: 'Reply on RC4', Hongruixuan Chen, 02 Sep 2025
    
    Thank you once again for your insightful comments, which have greatly helped us improve our manuscript. We will upload the revised version, based on your suggestions and as reflected in our responses, as the Final Response after the open discussion.
    
    Citation: https://doi.org/10.5194/essd-2025-269-AC4
    
    AC7: 'Reply on RC4', Hongruixuan Chen, 25 Sep 2025
    
    Thank you once again for your invaluable suggestions. In this stage, we have now uploaded our point-by-point response to your comments in the system, along with the revised manuscript. We would be grateful if you could review them at your convenience.
    
    Citation: https://doi.org/10.5194/essd-2025-269-AC7
AC5: 'Final response to referees' comments on essd-2025-269', Hongruixuan Chen, 09 Sep 2025

Again, we would like to express our sincere gratitude to both reviewers for their professional, insightful comments and constructive suggestions.
This document summarizes our final responses to the two reviewers' comments. The content is consistent with our replies to two reviewers during the open discussion phase. We have ensured that all in this final response are fully synchronized with the latest revised version of the manuscript.

Citation: https://doi.org/10.5194/essd-2025-269-AC5

Status: closed

RC1:
'Comment on essd-2025-269', Anonymous Referee #1, 11 Jul 2025
This paper introduces BRIGHT, a novel and timely benchmark dataset for building damage assessment using multimodal high-resolution optical and SAR imagery. Covering 14 globally distributed disaster events, BRIGHT provides pixel-level damage annotations for over 384,000 buildings. The dataset is designed to facilitate AI-based disaster response research, particularly in challenging all-weather conditions. The authors also benchmark a suite of machine learning and deep learning models on multiple tasks. The authors provided detailed documents and descriptions, making the data, related source code, and pretrained weights of models easy to understand and use.
In summary, this is quite interesting and solid work. I’d like to recommend the acceptance of this work since it represents an important contribution to Earth observation and disaster response communities. Yet before acceptance, several clarifications and refinements are suggested.
The manuscript would benefit from deeper exploration of what the models learn from multimodal fusion. Specifically, what roles do optical images play in multimodal building damage assessment? Is it beyond just building footprint localization? On the other words, are the features extracted from optical imagery actively compared with SAR representations? Some discussion (e.g., based on CAMs in Fig. 7) is provided but can be more explicitly elaborated.

The manuscript makes extensive evaluation of supervised and unsupervised change detection models, but the conceptual and methodological relationship between building damage assessment and generic change detection remains unclear, which is largely implied rather than discussed. An explicit and clearer explanation would be great for readers who lack of related background.

Since UMCD methods underperform, consider including a random guessing baseline for reference. This would contextualize the difficulty of BRIGHT and help readers understand the performance floor under UMCD setup.

While Table 1 offers a comprehensive comparison of datasets, several datasets seem relevant and should be included to enhance its completeness, like CRASAR-U-DROIDs [arXiv:2407.17673] and Noto-Earthquake building damage dataset [10.5194/essd-2024-363].

The paper describes careful multimodal alignment but omits the software used, e.g., ENVI, ArcGIS, or QGIS. Please provide related details.

Appendix G includes important new experimental setups and evaluation methods for UMCD. However, too much content is composed together now. It is not easy for people to grasp information. Adding section subtitles could improve readability.

8: Please specify in the figure or caption that the values represent average ± standard deviation across models.

10: Add a note in the caption to clarify that each dot corresponds to performance on a single test event under cross-event transfer.

Typo in Table 7: “Object-based major voting” should be corrected to “Object-based majority voting”.

Clarify the meaning of “–” symbols in Table 11. Do they indicate missing data or inapplicability? This should be stated explicitly.

“ML” should be defined on its first use and consistently used thereafter instead of alternating with [machine learning].

Standardize currency formatting (e.g., USD vs. US$).

Define abbreviations such as IGN and GSI when first mentioned as data providers.

The format of references should be standardized. Some of these entries use abbreviations for journals, while others have full titles.
Citation: https://doi.org/10.5194/essd-2025-269-RC1
- AC1:
  'Reply on RC1', Hongruixuan Chen, 24 Jul 2025
  
  Thank you so much for the instructive and constructive comments for our paper! We have updated a point by point response attached as a supplement for your reference.
  
  Citation: https://doi.org/10.5194/essd-2025-269-AC1
  - RC2: 'Reply on AC1', Anonymous Referee #1, 24 Jul 2025
    
    Thanks for the author's responses. This revision has significantly improved the quality and soundness of their work. My major concerns have been fully addressed. I recommend accepting this paper.
    
    Citation: https://doi.org/10.5194/essd-2025-269-RC2
    
    AC2: 'Reply on RC2', Hongruixuan Chen, 26 Jul 2025
    
    Thank you once again for your insightful comments, which have greatly helped us improve our manuscript. We will upload the revised version, based on your suggestions and as reflected in our responses, as the Final Response after the open discussion.
    
    Citation: https://doi.org/10.5194/essd-2025-269-AC2
    
    AC6: 'Reply on RC2', Hongruixuan Chen, 25 Sep 2025
    
    Thank you once again for your invaluable suggestions. In this stage, we have now uploaded our point-by-point response to your comments in the system, along with the revised manuscript. We would be grateful if you could review them at your convenience.
    
    Citation: https://doi.org/10.5194/essd-2025-269-AC6
RC3:
'Comment on essd-2025-269', Anonymous Referee #2, 26 Aug 2025
This manuscript introduces the BRIGHT dataset, which is a first open building damage assessment dataset with global coverage, multi-hazard scenarios, multimodal imagery (Optical and SAR), and sub-meter resolution. The paper systematically describes data collection, annotation, and quality control methods, and validates the dataset with multiple deep learning models, including cross-disaster transfer (zero-shot and one-shot), semi-supervised, and unsupervised approaches. The dataset demonstrates clear novelty and practical value, and it is of significant importance for advancing research and applications in disaster emergency response, remote sensing, and artificial intelligence. Generally, the paper is well structured, logically clear, with detailed results and strong value in terms of data sharing. Although the manuscript is rich in content, there are still details that require improvement, and I recommend appropriate revisions.
Major Comments
The explanation of annotation consistency and reliability remains insufficient. Although the authors state that the data annotations were obtained from multiple institutions such as Copernicus EMS, UNOSAT, and FEMA and then refined manually, there may be inconsistencies in how different institutions define “damaged” and “destroyed.” This could affect the consistency of annotations across disaster scenarios. It is therefore necessary to further elaborate on the process of unifying annotations, provide more detail on the manual refinement procedures.

The treatment of class imbalance is not sufficient. Figure 5(d) shows that intact buildings account for over 80%, while destroyed buildings account for less than 7%. This severe imbalance directly affects the accuracy of recognizing destroyed classes. Although the authors employed the Lovasz loss function to partially alleviate the issue, this is still not enough to solve the problem. Is this imbalance one of the reasons for the relatively low performance of the subsequent experimental results?

The discussion on cross-disaster generalization needs to be strengthened. Table 6 shows that in different disaster types, certain events perform particularly poorly. In particular, the mIoU values are the lowest for explosion and chemical accident events such as Bata-EP-2021 and Kyaukpyu-CC-2023, while earthquake events such as Morocco-EQ-2023 and Noto-EQ-2024 also remain highly challenging. This indicates that the models face significant difficulties in handling highly destructive, structurally complex, and spatially heterogeneous disaster scenarios. The authors should analyze these challenges in more depth, such as the heterogeneity and extreme local variations in explosion damage, the diversity of collapse patterns in earthquake events, and the limitations of SAR data in capturing fine-grained details. It is also recommended to provide typical error cases and compare model errors across disaster types to better illustrate the shortcomings in generalization.

The manuscript currently lacks a comparison with optical-only baselines, which is crucial to highlight the value of multimodal methods. Readers may question whether the inclusion of SAR brings significant benefits and whether the additional cost of multimodality is justified. To avoid such doubts, I suggest adding experiments with optical-only inputs and comparing them with Optical+SAR results. This would further emphasize the unique value of the BRIGHT dataset and provide stronger evidence for the necessity of multimodal fusion.

The discussion of limitations and future directions is insufficient. At present, the conclusion mainly emphasizes the dataset’s contributions, but it does not address its shortcomings in detail. It is suggested to include a separate subsection summarizing the limitations, such as the use of single-polarization SAR, the lack of time-series data, and the fact that most disaster events are concentrated after 2020.

Minor Comments
The description of study areas and disaster events is somewhat redundant.

The terminology for “one-shot” may not be accurate. The authors describe it as using “a small number of labeled samples,” which may be better defined as “few-shot.” Since these concepts are borrowed from previous work, it is suggested to cite the corresponding references.

At line 420, the authors state that SAR is not sensitive to fine structural changes. Would this limitation reduce the value of multimodality in certain scenarios?
Citation: https://doi.org/10.5194/essd-2025-269-RC3
- AC3:
  'Reply on RC3', Hongruixuan Chen, 01 Sep 2025
  
  Thank you so much for the instructive and constructive comments for our paper! We have updated a point by point response attached as a supplement for your reference.
  
  Citation: https://doi.org/10.5194/essd-2025-269-AC3
  - RC4: 'Reply on AC3', Anonymous Referee #2, 02 Sep 2025
    
    I appreciate the authors’ careful revisions. The manuscript has been substantially improved in quality and clarity. All of my previous concerns have been satisfactorily addressed. I therefore recommend acceptance of this paper.
    
    Citation: https://doi.org/10.5194/essd-2025-269-RC4
    
    AC4: 'Reply on RC4', Hongruixuan Chen, 02 Sep 2025
    
    Thank you once again for your insightful comments, which have greatly helped us improve our manuscript. We will upload the revised version, based on your suggestions and as reflected in our responses, as the Final Response after the open discussion.
    
    Citation: https://doi.org/10.5194/essd-2025-269-AC4
    
    AC7: 'Reply on RC4', Hongruixuan Chen, 25 Sep 2025
    
    Thank you once again for your invaluable suggestions. In this stage, we have now uploaded our point-by-point response to your comments in the system, along with the revised manuscript. We would be grateful if you could review them at your convenience.
    
    Citation: https://doi.org/10.5194/essd-2025-269-AC7
AC5: 'Final response to referees' comments on essd-2025-269', Hongruixuan Chen, 09 Sep 2025

Again, we would like to express our sincere gratitude to both reviewers for their professional, insightful comments and constructive suggestions.
This document summarizes our final responses to the two reviewers' comments. The content is consistent with our replies to two reviewers during the open discussion phase. We have ensured that all in this final response are fully synchronized with the latest revised version of the manuscript.

Citation: https://doi.org/10.5194/essd-2025-269-AC5

Hongruixuan Chen, Jian Song, Olivier Dietrich, Clifford Broni-Bediako, Weihao Xuan, Junjue Wang, Xinlei Shao, Yimin Wei, Junshi Xia, Cuiling Lan, Konrad Schindler, and Naoto Yokoya

Data sets

BRIGHT Hongruixuan Chen, Jian Song, Olivier Dietrich, Clifford Broni-Bediako, Weihao Xuan, Junjue Wang, Xinlei Shao, Yimin Wei, Junshi Xia, Cuiling Lan, Konrad Schindler, Naoto Yokoya https://zenodo.org/records/14619797

Model code and software

Hongruixuan Chen, Jian Song, Olivier Dietrich, Clifford Broni-Bediako, Weihao Xuan, Junjue Wang, Xinlei Shao, Yimin Wei, Junshi Xia, Cuiling Lan, Konrad Schindler, and Naoto Yokoya

Viewed

Total article views: 2,625 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
1,894	673	58	2,625	50	72

HTML: 1,894
PDF: 673
XML: 58
Total: 2,625
BibTeX: 50
EndNote: 72

Views and downloads (calculated since 18 Jun 2025)

Month	HTML	PDF	XML	Total
Jun 2025	194	35	6	235
Jul 2025	213	185	27	425
Aug 2025	252	123	4	379
Sep 2025	913	101	18	1,032
Oct 2025	183	124	2	309
Nov 2025	139	105	1	245

Cumulative views and downloads (calculated since 18 Jun 2025)

Month	HTML	PDF	XML	Total
Jun 2025	194	35	6	235
Jul 2025	213	185	27	425
Aug 2025	252	123	4	379
Sep 2025	913	101	18	1,032
Oct 2025	183	124	2	309
Nov 2025	139	105	1	245

Viewed (geographical distribution)

Total article views: 2,551 (including HTML, PDF, and XML) Thereof 2,551 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 16 Nov 2025

Short summary

Natural disasters often damage buildings and threaten lives, especially in areas with limited resources. To help improve emergency response, we created a global dataset called BRIGHT using both optical and radar images to detect building damage in any weather. We tested many artificial intelligence models and showed how well they work in real disaster scenes. This work can guide better tools for future disaster recovery and help save lives faster.


Total:	0
HTML:	0
PDF:	0
XML:	0