Preprints
https://doi.org/10.5194/essd-2021-36
https://doi.org/10.5194/essd-2021-36

  24 Feb 2021

24 Feb 2021

Review status: a revised version of this preprint was accepted for the journal ESSD and is expected to appear here in due course.

A multi-source 120-year U.S. flood database with a unified common format and public access

Zhi Li1, Mengye Chen1, Shang Gao1, Jonathan J. Gourley2, Tiantian Yang1, Xinyi Shen3, Randall Kolar1, and Yang Hong1 Zhi Li et al.
  • 1Hydrology and Water Security Program, Civil Engineering and Environmental Sciences, University of Oklahoma, Norman, 73072, USA
  • 2NOAA National Severe Storms Laboratory, Norman, 73072, USA
  • 3Department of Civil and Environmental Engineering, University of Connecticut, Storrs, CT, 06269, USA

Abstract. Despite several flood databases available in the U.S., there is a benefit to combine and reconcile these diverse data sources into a comprehensive flood database with a unified common format and easy public access in order to facilitate flood-related research and applications. Typically, floods are reported by specialists or media according to their socioeconomic impacts. Recently, data-driven analysis can reconstruct flood events based on in-situ and/or remote-sensing data. Lately, with the increasing engagement of citizen scientists, there is the potential to enhance flood reporting in near-real-time. The central objective of this study is to integrate information from seven popular multi-sourced flood databases into a comprehensive flood database in the U.S., made readily available to the public in a common data format. Natural Language Processing, geocoding, and harmonizing processing steps are undertaken to facilitate such development. In total, there are 695,808 flood records in the U.S. from 1900 to the present. The database features event locations, durations, date/times, socioeconomic impacts (e.g., fatalities and economic damages), and geographic information (e.g., elevation, slope, contributing area, and land cover types retrieved from ancillary data for given flood locations). Finally, this study utilizes the flood database to analyse flood seasonality within major basins, and socioeconomic impacts over time. It is anticipated that thus far the most comprehensive yet unified database can support a variety of flood-related research, such as a validation resource for hydrologic or hydraulic simulations, hydroclimatic studies concerning spatiotemporal patterns of floods, and flood susceptibility analysis for vulnerable geophysical locations. The dataset is publicly available with the following DOI: https://doi.org/10.5281/zenodo.4547036.

Zhi Li et al.

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on essd-2021-36', Anonymous Referee #1, 20 Apr 2021
    • AC1: 'Reply on RC1', Yang Hong, 11 Jun 2021
  • RC2: 'Comment on essd-2021-36', Anonymous Referee #2, 17 May 2021
    • AC2: 'Reply on RC2', Yang Hong, 11 Jun 2021

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on essd-2021-36', Anonymous Referee #1, 20 Apr 2021
    • AC1: 'Reply on RC1', Yang Hong, 11 Jun 2021
  • RC2: 'Comment on essd-2021-36', Anonymous Referee #2, 17 May 2021
    • AC2: 'Reply on RC2', Yang Hong, 11 Jun 2021

Zhi Li et al.

Data sets

United States Flood Database Zhi Li https://doi.org/10.5281/zenodo.4547036

Zhi Li et al.

Viewed

Total article views: 472 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
345 108 19 472 7 13
  • HTML: 345
  • PDF: 108
  • XML: 19
  • Total: 472
  • BibTeX: 7
  • EndNote: 13
Views and downloads (calculated since 24 Feb 2021)
Cumulative views and downloads (calculated since 24 Feb 2021)

Viewed (geographical distribution)

Total article views: 395 (including HTML, PDF, and XML) Thereof 395 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 29 Jul 2021
Download
Short summary
This dataset is a compilation of multi-sourced flood records, retrieved from official reports, instruments, and crowdsourcing data since 1900. This study utilizes the flood database to analyze flood seasonality within major basins, and socioeconomic impacts over time. It is anticipated that this dataset can support a variety of flood-related research, such as validation resources for hydrologic models, hydroclimatic studies, and flood vulnerability analysis across the US.