GHRSAT: the first global hourly dataset of all-sky remotely sensed estimates of surface air temperature

Zhang, Zhenwei; Wang, Ke; Liang, Chen; You, Zihan; Yi, Zishang

doi:10.5194/essd-2024-548

Preprints

https://doi.org/10.5194/essd-2024-548

Preprints

06 Dec 2024

| 06 Dec 2024

Status: this preprint was under review for the journal ESSD but the revision was not accepted.

GHRSAT: the first global hourly dataset of all-sky remotely sensed estimates of surface air temperature

Zhenwei Zhang, Ke Wang, Chen Liang, Zihan You, and Zishang Yi

Abstract. Spatially continuous surface air temperature (SAT) is critically important for a wide range of fields such as eco-environmental assessments and hydrology. Remotely sensed estimation models based on satellite-derived thermal infrared data provides a structurally different approach for reconstructing SAT compared to spatial interpolation of ground observations of SAT and numerical modelling, which are mainly limited by the coverage of stations and coarse spatial resolutions, respectively. However, the data products of remotely sensed estimates of SAT developed in previous studies are only available at daily or monthly resolutions, and are primarily restricted for local regions. In this study, we generated the first hourly dataset (GHRSAT) of all-sky remotely sensed SAT estimates for the global land areas except Antarctica between 2011 and 2023. The hourly estimates in GHRSAT were reconstructed from land surface temperature using the hybrid estimation models that integrate random forest (RK) models and kriging techniques. The hybrid models were developed for different regions on a monthly basis. We adopted ordinary kriging (OK) and fixed rank kriging (FRK) in the modelling of the site residuals from the RF models for regions with low-density and high-density stations, respectively. Our results show that the hybrid models for generating GHRSAT have the predictive performance between 1.48 °C to 2.28 °C in overall cross-validation RMSE. The mean RMSE for estimating hourly SAT can be significantly reduced by 0.18–0.41 °C by the hybrid models compared to the RF models. We analyzed the variability in the predictive errors of estimating hourly SAT across regions, months and sites. The variability is apparently decreased when using the hybrid models. We found that the RF models are less sensitive to the parameter tuning of the RF models, which greatly impacts the hybrid models. Improving the RF models by parameter tuning can drastically improve the hybrid models based on the RF models. Additionally, we found the performance difference between OK and FRK in developing the hybrid models for regions with large amounts of stations is slight with the mean RMSE of 0.05 °C. In summary, the scheme of the hybrid models can result in satisfactorily higher performance for estimating SAT, and has the general practicability of applying to regions at various scales. The GHRSAT dataset is publicly available at http://doi.org/10.11888/RemoteSen.tpdc.301540 (Zhang, 2024).

Received: 20 Nov 2024 – Discussion started: 06 Dec 2024

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Preprint (PDF, 4451 KB)

Supplement (1908 KB)

Download & links

Zhenwei Zhang, Ke Wang, Chen Liang, Zihan You, and Zishang Yi

Status: closed

RC1:
'Comment on essd-2024-548', Anonymous Referee #1, 07 Jan 2025

Please see the comments in the attached file.

Citation: https://doi.org/10.5194/essd-2024-548-RC1
- AC1: 'Reply on RC1', Zhenwei Zhang, 16 Mar 2025
  
  Thanks very much for your valuable comments on our manuscript. Our responses to your comments are provided in the supplement.
  
  Citation: https://doi.org/10.5194/essd-2024-548-AC1
RC2:
'Comment on essd-2024-548', Anonymous Referee #2, 03 Feb 2025

The manuscript essd-2024-548 has been reviewed. It presents the first global hourly surface air temperature dataset (GHRSAT) from 2011 to 2023. The manuscript demonstrates clear organization, rigorous logical flow, and natural transitions between sections. While the overall quality of the work is commendable, several critical issues require attention, as outlined in the following comments:
1. The core method of this manuscript is the RF-KR method, that is, the RF model is used to build the site-scale SAT estimation model, and then the residual is interpolated by the Kriging interpolation method to obtain pixel-by-pixel residual data. Logically, it is possible, but the validation is not sufficient, that is, whether the cross-validation used in the test process can be validated with independent data to explain its accuracy better.

2. In the process of model construction, the GHA-LST dataset is used as the main input, but it is recommended to discuss how its uncertainty will affect SAT estimation.

3. How to consider the spatial representativeness of the air temperature observed at the station on the 5km scale?

4. Figure 2, which Kriging method was used for TR-4?

5. Figure 4, why are RMSE and MAE so large in TR-1 and TR-6?

6. Figure 5, why does the RMSE of the model have such strong periodicity?

7. Line 211, what is RF-KR?

8. Line 226, Should the formula number be Eq. (3)?

9. Is the time label of the dataset local time or UTC? This is critical for users.

10. For the air temperature estimate in the case of sparse sites, please refer to these two articles: https://doi.org/10.1016/j.isprsjprs.2025.01.021；https://doi.org/10.1109/JSTARS.2022.3161800

Citation: https://doi.org/10.5194/essd-2024-548-RC2
- AC2: 'Reply on RC2', Zhenwei Zhang, 16 Mar 2025
  
  Thanks very much for your valuable comments on our manuscript. Our responses to your comments are provided in the supplement.
  
  Citation: https://doi.org/10.5194/essd-2024-548-AC2
RC3:
'Comment on essd-2024-548', Anonymous Referee #3, 06 Feb 2025

This paper presents a hybrid estimation model, RF-KR, for estimating global land surface air temperature (SAT) on an hourly basis. The model combines Random Forest and Kriging methods to address issues in remote sensing SAT estimation, especially in areas with sparse ground station data. Below are my review comments on this paper:
1. Imbalance in Model Description: The description of the model in the methods section is unbalanced, with much more emphasis on Kriging compared to Random Forest (RF).
2. Handling Missing Data: The paper mentions removing records with poor quality in the ground station data but does not explain in detail how missing data is handled, especially missing data with different temporal and spatial resolutions. If there is a significant amount of missing data, the model’s accuracy and generalizability may be impacted.
3. Selection of Covariates: The paper uses multiple spatial covariates such as NDVI, elevation, latitude/longitude, and hour of the day, but it does not provide a detailed discussion of the rationale behind selecting these covariates or their applicability in different regions.
4. Resampling of NDVI and Elevation Data: The resampling method for NDVI and elevation data is not clearly stated, which could affect data quality.
5. Limited Model Performance Evaluation: The paper only uses RMSE and MAE as performance metrics, lacking analysis of systemic bias (Bias) or coefficient of determination (R²), which makes it difficult to fully assess the model's performance.
6. Discussion on Practical Application: The discussion section could benefit from further elaboration on how the research results could be applied to real-world problems. Additionally, the limitations of the current study should be clearly stated, along with potential directions for future improvements.

Citation: https://doi.org/10.5194/essd-2024-548-RC3
- AC3: 'Reply on RC3', Zhenwei Zhang, 16 Mar 2025
  
  Thanks very much for your valuable comments on our manuscript. Our responses to your comments are provided in the supplement.
  
  Citation: https://doi.org/10.5194/essd-2024-548-AC3

Status: closed

RC1:
'Comment on essd-2024-548', Anonymous Referee #1, 07 Jan 2025

Please see the comments in the attached file.

Citation: https://doi.org/10.5194/essd-2024-548-RC1
- AC1: 'Reply on RC1', Zhenwei Zhang, 16 Mar 2025
  
  Thanks very much for your valuable comments on our manuscript. Our responses to your comments are provided in the supplement.
  
  Citation: https://doi.org/10.5194/essd-2024-548-AC1
RC2:
'Comment on essd-2024-548', Anonymous Referee #2, 03 Feb 2025

The manuscript essd-2024-548 has been reviewed. It presents the first global hourly surface air temperature dataset (GHRSAT) from 2011 to 2023. The manuscript demonstrates clear organization, rigorous logical flow, and natural transitions between sections. While the overall quality of the work is commendable, several critical issues require attention, as outlined in the following comments:
1. The core method of this manuscript is the RF-KR method, that is, the RF model is used to build the site-scale SAT estimation model, and then the residual is interpolated by the Kriging interpolation method to obtain pixel-by-pixel residual data. Logically, it is possible, but the validation is not sufficient, that is, whether the cross-validation used in the test process can be validated with independent data to explain its accuracy better.

2. In the process of model construction, the GHA-LST dataset is used as the main input, but it is recommended to discuss how its uncertainty will affect SAT estimation.

3. How to consider the spatial representativeness of the air temperature observed at the station on the 5km scale?

4. Figure 2, which Kriging method was used for TR-4?

5. Figure 4, why are RMSE and MAE so large in TR-1 and TR-6?

6. Figure 5, why does the RMSE of the model have such strong periodicity?

7. Line 211, what is RF-KR?

8. Line 226, Should the formula number be Eq. (3)?

9. Is the time label of the dataset local time or UTC? This is critical for users.

10. For the air temperature estimate in the case of sparse sites, please refer to these two articles: https://doi.org/10.1016/j.isprsjprs.2025.01.021；https://doi.org/10.1109/JSTARS.2022.3161800

Citation: https://doi.org/10.5194/essd-2024-548-RC2
- AC2: 'Reply on RC2', Zhenwei Zhang, 16 Mar 2025
  
  Thanks very much for your valuable comments on our manuscript. Our responses to your comments are provided in the supplement.
  
  Citation: https://doi.org/10.5194/essd-2024-548-AC2
RC3:
'Comment on essd-2024-548', Anonymous Referee #3, 06 Feb 2025

This paper presents a hybrid estimation model, RF-KR, for estimating global land surface air temperature (SAT) on an hourly basis. The model combines Random Forest and Kriging methods to address issues in remote sensing SAT estimation, especially in areas with sparse ground station data. Below are my review comments on this paper:
1. Imbalance in Model Description: The description of the model in the methods section is unbalanced, with much more emphasis on Kriging compared to Random Forest (RF).
2. Handling Missing Data: The paper mentions removing records with poor quality in the ground station data but does not explain in detail how missing data is handled, especially missing data with different temporal and spatial resolutions. If there is a significant amount of missing data, the model’s accuracy and generalizability may be impacted.
3. Selection of Covariates: The paper uses multiple spatial covariates such as NDVI, elevation, latitude/longitude, and hour of the day, but it does not provide a detailed discussion of the rationale behind selecting these covariates or their applicability in different regions.
4. Resampling of NDVI and Elevation Data: The resampling method for NDVI and elevation data is not clearly stated, which could affect data quality.
5. Limited Model Performance Evaluation: The paper only uses RMSE and MAE as performance metrics, lacking analysis of systemic bias (Bias) or coefficient of determination (R²), which makes it difficult to fully assess the model's performance.
6. Discussion on Practical Application: The discussion section could benefit from further elaboration on how the research results could be applied to real-world problems. Additionally, the limitations of the current study should be clearly stated, along with potential directions for future improvements.

Citation: https://doi.org/10.5194/essd-2024-548-RC3
- AC3: 'Reply on RC3', Zhenwei Zhang, 16 Mar 2025
  
  Thanks very much for your valuable comments on our manuscript. Our responses to your comments are provided in the supplement.
  
  Citation: https://doi.org/10.5194/essd-2024-548-AC3

Zhenwei Zhang, Ke Wang, Chen Liang, Zihan You, and Zishang Yi

Supplement

https://doi.org/10.5194/essd-2024-548-supplement

Data sets

GHRSAT: the global hourly dataset of all-sky remotely sensed estimates of surface air temperature Zhenwei Zhang http://doi.org/10.11888/RemoteSen.tpdc.301540

Zhenwei Zhang, Ke Wang, Chen Liang, Zihan You, and Zishang Yi

Viewed

Total article views: 1,206 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	Supplement	BibTeX	EndNote
991	179	36	1,206	63	38	90

HTML: 991
PDF: 179
XML: 36
Total: 1,206
Supplement: 63
BibTeX: 38
EndNote: 90

Views and downloads (calculated since 06 Dec 2024)

Month	HTML	PDF	XML	Total
Dec 2024	146	65	5	216
Jan 2025	77	12	5	94
Feb 2025	79	10	3	92
Mar 2025	49	20	4	73
Apr 2025	40	3	1	44
May 2025	27	6	3	36
Jun 2025	27	3	8	38
Jul 2025	40	2	2	44
Aug 2025	78	3	0	81
Sep 2025	286	4	1	291
Oct 2025	39	20	1	60
Nov 2025	39	21	2	62
Dec 2025	53	8	1	62
Jan 2026	11	2	0	13

Cumulative views and downloads (calculated since 06 Dec 2024)

Month	HTML	PDF	XML	Total
Dec 2024	146	65	5	216
Jan 2025	77	12	5	94
Feb 2025	79	10	3	92
Mar 2025	49	20	4	73
Apr 2025	40	3	1	44
May 2025	27	6	3	36
Jun 2025	27	3	8	38
Jul 2025	40	2	2	44
Aug 2025	78	3	0	81
Sep 2025	286	4	1	291
Oct 2025	39	20	1	60
Nov 2025	39	21	2	62
Dec 2025	53	8	1	62
Jan 2026	11	2	0	13

Viewed (geographical distribution)

Total article views: 1,182 (including HTML, PDF, and XML) Thereof 1,182 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 05 Jan 2026

Download

Preprint (4451 KB)
Metadata XML

Short summary

Reconstructing spatial surface air temperature based on remotely sensed data is critically important for various fields. Our work developed the models by the hybrid scheme that integrates machine learning and spatial kriging for reconstructing the product of all-sky hourly estimates of surface air temperature for global land areas. The models for the product were fully validated and analyzed. The scheme for generating the product has the general practicability of applications.


Total:	0
HTML:	0
PDF:	0
XML:	0