Articles | Volume 17, issue 8
https://doi.org/10.5194/essd-17-3835-2025
© Author(s) 2025. This work is distributed under the Creative Commons Attribution 4.0 License.
A benchmark dataset for global evapotranspiration estimation based on FLUXNET2015 from 2000 to 2022
Download
- Final revised paper (published on 08 Aug 2025)
- Preprint (discussion started on 27 Nov 2024)
Interactive discussion
Status: closed
Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor
| : Report abuse
-
RC1: 'Comment on essd-2024-460', Anonymous Referee #1, 24 Jan 2025
- AC1: 'Reply on RC1', Yaokui Cui, 28 Apr 2025
-
RC2: 'Comment on essd-2024-460', Anonymous Referee #2, 17 Apr 2025
- AC2: 'Reply on RC2', Yaokui Cui, 28 Apr 2025
Peer review completion
AR – Author's response | RR – Referee report | ED – Editor decision | EF – Editorial file upload
AR by Yaokui Cui on behalf of the Authors (28 Apr 2025)
Author's response
Author's tracked changes
Manuscript
ED: Publish subject to minor revisions (review by editor) (29 Apr 2025) by Peng Zhu
ED: Publish as is (04 May 2025) by Peng Zhu
AR by Yaokui Cui on behalf of the Authors (10 May 2025)
Post-review adjustments
AA – Author's adjustment | EA – Editor approval
AA by Yaokui Cui on behalf of the Authors (28 Jul 2025)
Author's adjustment
Manuscript
EA: Adjustments approved (05 Aug 2025) by Peng Zhu
The research article discusses the development of a benchmark dataset for global evapotranspiration (ET) estimation, addressing limitations in existing latent heat flux (LE) data from the FLUXNET2015 dataset. Current datasets suffer from short observation periods and significant data gaps, hindering climate change analysis and model validation. To overcome these challenges, the authors created a gap-filling and prolongation framework that generates seamless half-hourly and daily LE data from 2000 to 2022 across 64 sites. They employed a novel bias-corrected random forest algorithm for improved data accuracy, achieving a median RMSE of 32.84 W/m² for hourly and 16.58 W/m² for daily data. The resulting dataset enhances ET modeling, water-carbon cycle monitoring, and climate change research.
The study is one of the pioneering efforts to utilize a bias-corrected random forest approach to enhance data gap-filling performance. I suggest minor revisions to address some specific questions before proceeding with publication.
Some minor issues:
Figure 3 - From the diagram there are two RF models being trained and evaluated. Please indicate that LE and Bias without single quote serve as observational ground-truth labels in Model training box.
In Model validation box, there is only predicted values instead of true values being indicated. Please add that true LE and Bias are used to evaluate the performance of RF1 and RF2 and indicate performance metrics used for each model validation.
Figure 4 – It is hard to conclude Bias-corrected RF has better performance than the other two approaches as the mean values of RMSE of those three are tightly close to each other shown in the figure. Consider adding data labels to the mean RMSE values in the figure to highlight the findings. Same for Figure 5.
Line 185 – Please elaborate more on how you choose the best hyperparameters from 64 models. 64 models with 64 sets of parameters are obtained. For the sites with similar land type, are those models combined into one unified model by taking averages of parameters or still using different sets of parameters? Please explain it in more details.
In discussion section, please add potential limitations from this study in terms of variable importance, sensitivity and stability.