Preprints
https://doi.org/10.5194/essd-2024-460
https://doi.org/10.5194/essd-2024-460
27 Nov 2024
 | 27 Nov 2024
Status: this preprint is currently under review for the journal ESSD.

A benchmark dataset for global evapotranspiration estimation based on FLUXNET2015 from 2000 to 2022

Wangyipu Li, Zhaoyuan Yao, Yifan Qu, Hanbo Yang, Yang Song, Lisheng Song, Lifeng Wu, and Yaokui Cui

Abstract. Evapotranspiration (ET) is a crucial component of the terrestrial hydrological cycle. Latent heat flux (LE, equivalent to ET in W/m2) observed by the eddy covariance (EC) technique, as known as LEEC, has been publicly recognized as highly accurate benchmark for global ET estimation. Currently, there is an increasing need for long time-series benchmark data to support climate change analysis, construction of new models, and validation of new products. However, existing LEEC datasets, like FLUXNET2015, face significant challenges due to limited observation periods and extensive data gaps. This hinders their application. To address these issues, we developed a gap-filling and prolongation framework for LEEC data and established a benchmark dataset for global ET estimation from 2000 to 2022 across 64 sites at various time scales. The framework mainly contained 3 parts: site selection and data pre-processing, gap-filled half-hourly / hourly LE data generation, and prolonged daily LE data generation. We selected 64 sites from FLUXNET2015 based on a rigorous filtering criterion. A novel bias-corrected random forest (RF) algorithm was used as the gap-filling and prolongation algorithm of the framework to produce seamless half-hourly and daily LE data. After analysis, the framework using novel bias-corrected RF algorithm achieves excellent performance both in hourly gap-filling and daily prolongation, with a median RMSE of 32.84 W/m2 and 16.58 W/m2, respectively. The algorithm significantly improved the gap-filling performance for long gaps and extreme values compared with the original RF and marginal distribution sampling (MDS) algorithm. The results demonstrate robust prolongation performance of our framework both on prolonging directions and temporal stability. There is a high consistency in data distribution between our gap-filled dataset and FLUXNET2015 dataset. In conclusion, a benchmark dataset for global ET estimation based on FLUXNET2015 from 2000 to 2022 was firstly published. This dataset can strongly provide data support for ET modelling, water-carbon cycle monitoring and climate change analysis. It is made freely available via the following repository: https://doi.org/10.5281/zenodo.13853409 (Li et al., 2024b).

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.
Wangyipu Li, Zhaoyuan Yao, Yifan Qu, Hanbo Yang, Yang Song, Lisheng Song, Lifeng Wu, and Yaokui Cui

Status: open (until 19 Jan 2025)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Wangyipu Li, Zhaoyuan Yao, Yifan Qu, Hanbo Yang, Yang Song, Lisheng Song, Lifeng Wu, and Yaokui Cui

Data sets

A benchmark dataset for global evapotranspiration estimation based on FLUXNET2015 from 2000 to 2022 (V1.0) Wangyipu Li et al. https://doi.org/10.5281/zenodo.13853409

Wangyipu Li, Zhaoyuan Yao, Yifan Qu, Hanbo Yang, Yang Song, Lisheng Song, Lifeng Wu, and Yaokui Cui

Viewed

Total article views: 182 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
154 23 5 182 4 4
  • HTML: 154
  • PDF: 23
  • XML: 5
  • Total: 182
  • BibTeX: 4
  • EndNote: 4
Views and downloads (calculated since 27 Nov 2024)
Cumulative views and downloads (calculated since 27 Nov 2024)

Viewed (geographical distribution)

Total article views: 180 (including HTML, PDF, and XML) Thereof 180 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 13 Dec 2024
Download
Short summary
Due to shortcomings such as extensive data gaps and limited observation durations in current ground-based latent heat flux (LE) datasets, we developed a novel gap-filling and prolongation framework for ground-based LE observations, establishing a benchmark dataset for global evapotranspiration (ET) estimation from 2000 to 2022 across 64 sites at various time scales. This comprehensive dataset can strongly support ET modelling, water-carbon cycle monitoring, and long-term climate change analysis.
Altmetrics