the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
GHRSAT: the first global hourly dataset of all-sky remotely sensed estimates of surface air temperature
Abstract. Spatially continuous surface air temperature (SAT) is critically important for a wide range of fields such as eco-environmental assessments and hydrology. Remotely sensed estimation models based on satellite-derived thermal infrared data provides a structurally different approach for reconstructing SAT compared to spatial interpolation of ground observations of SAT and numerical modelling, which are mainly limited by the coverage of stations and coarse spatial resolutions, respectively. However, the data products of remotely sensed estimates of SAT developed in previous studies are only available at daily or monthly resolutions, and are primarily restricted for local regions. In this study, we generated the first hourly dataset (GHRSAT) of all-sky remotely sensed SAT estimates for the global land areas except Antarctica between 2011 and 2023. The hourly estimates in GHRSAT were reconstructed from land surface temperature using the hybrid estimation models that integrate random forest (RK) models and kriging techniques. The hybrid models were developed for different regions on a monthly basis. We adopted ordinary kriging (OK) and fixed rank kriging (FRK) in the modelling of the site residuals from the RF models for regions with low-density and high-density stations, respectively. Our results show that the hybrid models for generating GHRSAT have the predictive performance between 1.48 °C to 2.28 °C in overall cross-validation RMSE. The mean RMSE for estimating hourly SAT can be significantly reduced by 0.18–0.41 °C by the hybrid models compared to the RF models. We analyzed the variability in the predictive errors of estimating hourly SAT across regions, months and sites. The variability is apparently decreased when using the hybrid models. We found that the RF models are less sensitive to the parameter tuning of the RF models, which greatly impacts the hybrid models. Improving the RF models by parameter tuning can drastically improve the hybrid models based on the RF models. Additionally, we found the performance difference between OK and FRK in developing the hybrid models for regions with large amounts of stations is slight with the mean RMSE of 0.05 °C. In summary, the scheme of the hybrid models can result in satisfactorily higher performance for estimating SAT, and has the general practicability of applying to regions at various scales. The GHRSAT dataset is publicly available at http://doi.org/10.11888/RemoteSen.tpdc.301540 (Zhang, 2024).
- Preprint
(4451 KB) - Metadata XML
-
Supplement
(1908 KB) - BibTeX
- EndNote
Status: final response (author comments only)
-
RC1: 'Comment on essd-2024-548', Anonymous Referee #1, 07 Jan 2025
- AC1: 'Reply on RC1', Zhenwei Zhang, 16 Mar 2025
-
RC2: 'Comment on essd-2024-548', Anonymous Referee #2, 03 Feb 2025
The manuscript essd-2024-548 has been reviewed. It presents the first global hourly surface air temperature dataset (GHRSAT) from 2011 to 2023. The manuscript demonstrates clear organization, rigorous logical flow, and natural transitions between sections. While the overall quality of the work is commendable, several critical issues require attention, as outlined in the following comments:
1. The core method of this manuscript is the RF-KR method, that is, the RF model is used to build the site-scale SAT estimation model, and then the residual is interpolated by the Kriging interpolation method to obtain pixel-by-pixel residual data. Logically, it is possible, but the validation is not sufficient, that is, whether the cross-validation used in the test process can be validated with independent data to explain its accuracy better.
2. In the process of model construction, the GHA-LST dataset is used as the main input, but it is recommended to discuss how its uncertainty will affect SAT estimation.
3. How to consider the spatial representativeness of the air temperature observed at the station on the 5km scale?
4. Figure 2, which Kriging method was used for TR-4?
5. Figure 4, why are RMSE and MAE so large in TR-1 and TR-6?
6. Figure 5, why does the RMSE of the model have such strong periodicity?
7. Line 211, what is RF-KR?
8. Line 226, Should the formula number be Eq. (3)?
9. Is the time label of the dataset local time or UTC? This is critical for users.
10. For the air temperature estimate in the case of sparse sites, please refer to these two articles: https://doi.org/10.1016/j.isprsjprs.2025.01.021;https://doi.org/10.1109/JSTARS.2022.3161800Citation: https://doi.org/10.5194/essd-2024-548-RC2 - AC2: 'Reply on RC2', Zhenwei Zhang, 16 Mar 2025
-
RC3: 'Comment on essd-2024-548', Anonymous Referee #3, 06 Feb 2025
This paper presents a hybrid estimation model, RF-KR, for estimating global land surface air temperature (SAT) on an hourly basis. The model combines Random Forest and Kriging methods to address issues in remote sensing SAT estimation, especially in areas with sparse ground station data. Below are my review comments on this paper:
1. Imbalance in Model Description: The description of the model in the methods section is unbalanced, with much more emphasis on Kriging compared to Random Forest (RF).
2. Handling Missing Data: The paper mentions removing records with poor quality in the ground station data but does not explain in detail how missing data is handled, especially missing data with different temporal and spatial resolutions. If there is a significant amount of missing data, the model’s accuracy and generalizability may be impacted.
3. Selection of Covariates: The paper uses multiple spatial covariates such as NDVI, elevation, latitude/longitude, and hour of the day, but it does not provide a detailed discussion of the rationale behind selecting these covariates or their applicability in different regions.
4. Resampling of NDVI and Elevation Data: The resampling method for NDVI and elevation data is not clearly stated, which could affect data quality.
5. Limited Model Performance Evaluation: The paper only uses RMSE and MAE as performance metrics, lacking analysis of systemic bias (Bias) or coefficient of determination (R²), which makes it difficult to fully assess the model's performance.
6. Discussion on Practical Application: The discussion section could benefit from further elaboration on how the research results could be applied to real-world problems. Additionally, the limitations of the current study should be clearly stated, along with potential directions for future improvements.
Citation: https://doi.org/10.5194/essd-2024-548-RC3 - AC3: 'Reply on RC3', Zhenwei Zhang, 16 Mar 2025
Data sets
GHRSAT: the global hourly dataset of all-sky remotely sensed estimates of surface air temperature Zhenwei Zhang http://doi.org/10.11888/RemoteSen.tpdc.301540
Viewed
HTML | XML | Total | Supplement | BibTeX | EndNote | |
---|---|---|---|---|---|---|
347 | 104 | 17 | 468 | 26 | 17 | 15 |
- HTML: 347
- PDF: 104
- XML: 17
- Total: 468
- Supplement: 26
- BibTeX: 17
- EndNote: 15
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1