the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
TPHH: A long-term (1901–2023) high-resolution (1/30°) near-surface humidity dataset for the Tibetan Plateau generated via spatial downscaling based on hybrid-structure deep learning
Abstract. The Tibetan Plateau acts as the "Asian Water Tower" and faces regional amplified warming compared to the global climate change baseline. Given the Tibet Plateau’s pronounced alpine terrain, i.e., significant elevation gradients within short horizontal distances, studies on climate changes/dynamics over this mountainous region fundamentally depend on spatially high-resolution datasets. However, most of currently available spatially high-resolution datasets only extend back to the 1980s, with prolonged temporal coverage data of pre-satellite era remaining scarce, especially for near surface atmospheric humidity. Thus, our study implements a hybrid-structure-based deep learning framework to generate monthly 2 m specific humidity, 2 m temperature and surface pressure at 1/30° × 1/30° horizontal resolution during 1901–2023. Briefly, employing a hybrid-structure model (FourCastNet by NVIDIA®), historical high-resolution fields (1/30° × 1/30° covering 1901–2023) are generated based on long-range low-resolution (0.5° × 0.5° covering 1901–2023 from CRU) and short-range high-resolution fields (1/30° × 1/30° covering 1978–2023 from TPMFD) via spatial downscaling. The produced data were validated against multiple related datasets, with independent in-situ site observations serving as the reference, and showed superior performance compared to most of them. Our study demonstrates that in topographically complex regions like the Tibetan Plateau, where meteorological fields exhibit strong physical dependencies on terrain, the synergistic mapping between total-field signals and subregional terrain constraints can be effectively achieved through hybrid-structure deep learning, thereby enabling this physically-consistent downscaling approach. Open access to this dataset is at https://doi.org/10.57760/sciencedb.36169.
- Preprint
(26915 KB) - Metadata XML
-
Supplement
(5554 KB) - BibTeX
- EndNote
Status: open (until 12 Jun 2026)
- RC1: 'Comment on essd-2026-180', Anonymous Referee #1, 26 May 2026 reply
-
RC2: 'Comment on essd-2026-180', Anonymous Referee #2, 04 Jun 2026
reply
General comment
This study addresses a critical challenge in high-altitude climatology by constructing TPHH, a long-term (1901–2023), high-resolution (1/30°) near-surface humidity, temperature, and surface pressure dataset for the Tibetan Plateau. The work is highly significant as it bridges the pre-satellite era data gap, providing much-needed spatial detail for a region characterized by extreme topographic complexity and elevation-dependent warming. The methodological innovation lies in the application of the FourCastNet architecture, which synergistically integrates Vision Transformers and Adaptive Fourier Neural Operators to map coarse-resolution signals from the CRU dataset onto a high-resolution grid governed by terrain constraints. The validation effort is commendable, employing not only a network of 135 independent meteorological stations but also an innovative biological cross-verification using 41 tree-ring proxy groups to ensure temporal fidelity in the early 20th century. This dataset could provide a foundational material for studying long-term hydrological cycles, glacier dynamics, and ecosystem responses to climate change on the Tibetan Plateau. To meet the stringent standards of ESSD, it is essential that the authors thoroughly address the concerns raised below, specifically regarding the physical grounding of the FourCastNet architecture and the reliability of reconstructions in the data-sparse western Tibetan Plateau.
Major comments
1. Quantification of uncertainty in the pre-1950s observational void
The authors utilize CRU TS 4.08 as the primary predictor. It is well-documented that before the 1950s, ground-based observations within the Tibetan Plateau were virtually non-existent, and the CRU signal in this period relies heavily on spatial interpolation from distant stations outside the plateau. While the deep learning model successfully learns fine-scale "terrain textures" from the TPMFD training period (1979–2023), there is a risk that the reconstructed high-resolution features in the early 20th century are merely reflections of static topographic constraints rather than actual climatic fluctuations. The authors must provide a deeper discussion or a sensitivity analysis comparing the model's error characteristics before and after the densification of the observation network (e.g., around 1950).
2. Theoretical justification of physical consistency
The manuscript repeatedly emphasizes that the hybrid-structure deep learning approach is physically-consistent. However, FourCastNet is essentially a data-driven super-resolution model. While it effectively captures "terrain governance," it is unclear whether the model architecture or loss functions explicitly enforce thermodynamic laws (e.g., the Clausius-Clapeyron equation or the relationship between air pressure and hypsometry). The authors should clarify if this consistency is merely a spatial correlation with topography or if the model satisfies fundamental meteorological equations during the downscaling process.
3. Spatial Bias in Validation and Representative Uncertainty
As shown in Figure 1 and Figure 7, the 135 validation stations are predominantly clustered in the eastern and southern edges or river valleys of the plateau. The western interior and the high-altitude “No Man's Land” (Qiangtang) are almost entirely unrepresented in the ground-truth network. Given that the model's RMSE is higher in complex terrain, the authors should provide a spatial uncertainty map or an analysis of residuals across elevation gradients to quantify the reliability of the data in these unmonitored high-altitude regions.
4. Phenological logic of tree-ring variable selection (equation 5)
In the ternary regression model for tree-ring validation, the authors selected April-May-June (AMJ) humidity as a key factor. The choice of this specific seasonal window requires stronger phenological justification. In many parts of the Tibetan Plateau, tree growth may be more sensitive to JJA (Summer) humidity or have lagged responses to winter moisture. The authors should explain why the AMJ window was selected as the primary moisture constraint across all 41 proxy groups.
Minor Comments
Line 22 & Supplement (Fig S1, S2 titles): There is an inconsistency in the dataset’s acronym. The supplement uses “THPP” while the main text uses "TPHH". Please standardize to TPHH throughout.
Line 34: The text mentions “87 independent ground-based meteorological stations”, but Line 56 and Figure 1 refer to “135 stations”. Please verify the exact number used for historical validation (1901–1978).
Line 225 (Equation 1): The formatting for the R2 formula is slightly garbled in the PDF, with overlapping characters in the denominator. Please re-typeset using standard LaTeX.
Line 290: The text states, “As demonstrated in Table 1...”, but the performance metrics are actually presented in Table 3. This is a citation error.
Lines 505, 520, 530 of References: Several citations are listed for 2025 or 2026 (e.g., Jiang et al., 2025; O’Neill et al., 2025). Please indicate if these are preprints or currently in press.
Figure 6 (RMSE Heatmap): The color scale for specific humidity is quite narrow (max 0.40 g/kg). It is difficult to discern spatial variance in the plateau interior. Consider using a more contrast-enhancing color ramp for the humidity panel.
Figure 12: The Pearson correlation coefficient (0.82) is unexpected high. Please include the significance level (p-value) to strengthen the statistical argument.Citation: https://doi.org/10.5194/essd-2026-180-RC2
Data sets
TPHH: A long-term (1901–1978) high-resolution (1/30°) reconstruction of meteorological variables over the Tibetan Plateau Zezhou Chen https://doi.org/10.57760/sciencedb.36169
Viewed
| HTML | XML | Total | Supplement | BibTeX | EndNote | |
|---|---|---|---|---|---|---|
| 337 | 150 | 22 | 509 | 46 | 20 | 23 |
- HTML: 337
- PDF: 150
- XML: 22
- Total: 509
- Supplement: 46
- BibTeX: 20
- EndNote: 23
Viewed (geographical distribution)
| Country | # | Views | % |
|---|
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
Given that near-surface humidity and temperature are critical drivers of glacier ablation (surface energy balance), permafrost degradation, and alpine ecosystem evolution, this manuscript addresses the severe scarcity of centennial-scale, high-resolution datasets over the Tibetan Plateau by proposing an innovative hybrid-structure deep learning downscaling approach based on FourCastNet. Nevertheless, the current manuscript still exhibits several critical limitations that must be addressed through a major revision before it can be considered for publication.
Major
and 4) Whether there is any duplication of data information between the target dataset and the validation procedure. In addition, Figure 7 appears to use the 125/135 CMA stations rather than the 87 independent stations mentioned earlier in the manuscript. The rationale for this choice is unclear. If independent stations are available, it would seem more appropriate to use them for the primary validation analyses in Figure 7 to avoid potential dependence between TPMFD and the validation dataset.
4) The reconstructed 1901–1978 historical climate fields rely entirely on a deep learning model trained on the 1979–2023 modern period. This implicitly assumes statistical stationarity in the relationship between the coarse CRU predictors and the fine-scale TPMFD targets across vastly different climatic regimes. However, this critical assumption is not rigorously evaluated. Prior to 1979, station coverage over the Tibetan Plateau was extremely sparse, and early CRU grids heavily rely on large-scale interpolations that lack real mesoscale gradients. Applying a model overly optimized for modern terrain features to these early interpolated fields risks generating artificial spatial features or "homogenized" artifacts. The brief mention in Section 6.3 is insufficient. The authors must: (1) refrain from treating the reliability of the early-century reconstruction on par with the post-1979 period, and (2) substantially expand the discussion on how sparse early observations may affect the robustness, spatial variability, and uncertainty of the reconstructed 1901–1978 climate fields.
Minor:
1.The major acronyms should be fully expanded upon their first appearance in both the Abstract and the main text. The acronyms like "CRU" and "CMA" are not spelled out at their first mention. Please check the entire text for similar errors.
2.Line 70, CRU or CRU_TS?