the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
A 1-km daily high-accuracy meteorological dataset of air temperature, atmospheric pressure, relative humidity, and sunshine duration across China (1961–2021)
Abstract. Fine-resolution and high-accuracy meteorological datasets are essential for understanding climate change processes and their cascading impacts on hydrology, water resources management, and ecological systems. In this study, we present a nationwide, high-resolution dataset of six daily meteorological variables across China from 1961 to 2021, including average temperature, maximum temperature, minimum temperature, atmospheric pressure, relative humidity, and sunshine duration. The dataset was generated through a hierarchical reconstruction framework that utilizes daily observations from 2345 meteorological stations across China, combined with station-level topographic attributes (latitude, longitude, and elevation). By decoding the nonlinear relationships among six meteorological variables and their spatial covariates, the framework enables the generation of gridded daily fields at 1 km resolution with spatial continuity and internal consistency. Validation against 118 in-situ stations confirms that the dataset achieves high accuracy across all variables, with average, maximum, and minimum temperatures exhibiting minimal errors (median RMSEs: 1.03 °C, 1.19 °C, 1.34 °C; median MEs: -0.09 °C, -0.10 °C, -0.08 °C) and high consistency with in-situ data (median CCs: 1.00, 0.99, 0.99). Atmospheric pressure shows minimal error (median RMSE: 2.48 hPa; median ME: -0.02 hPa) and high consistency (median CC: 0.98). Although relative humidity has slightly weaker accuracy (median RMSE: 6.02 %; median ME: -0.5 %; median CC: 0.90), it still surpasses standard benchmarks. Sunshine duration maintains high precision (median RMSE: 1.48 h; median ME: 0.05 h; median CC: 0.93), demonstrating overall excellent product quality. Further comparison reveals that in high-altitude and topographically complex regions, the reconstructed product demonstrates higher actual accuracy than suggested by station-to-grid validation, as spatial mismatches between stations and grid cells lead to systematic underestimation. Free access to the dataset available at https://doi.org/10.11888/Atmos.tpdc.301341 or https://cstr.cn/18406.11.Atmos.tpdc.301341.
- Preprint
(2471 KB) - Metadata XML
- BibTeX
- EndNote
Status: open (until 27 Aug 2025)
-
RC1: 'Comment on essd-2025-291', Anonymous Referee #1, 04 Jul 2025
reply
- This paper develops a long sequence dataset, and the research purpose and user target population of this dataset need to be further clarified;
- “The objective of this study is to develop a high-resolution and accuracy-assessed dataset of daily near-surface meteorological variables across mainland China, suitable for applications in hydrological modeling, environmental monitoring, and climate analysis.” How did the author consider the issue of temporal "homogeneity" in a long series dataset used for climate analysis?
- What is the quality of the raw observation data used to establish 1km grid data? Has the author conducted data quality evaluation, analysis, quality control, homogenization processing, etc. on the original observation data during its use?
- Compared to a grid spatial resolution of 1km * 1km, using over 2000 observation data from China is relatively insufficient, especially in the sparse observation areas of western China. How does the author consider this issue?
- How is the " day boundary issues" handled? The ground meteorological observation in China adopts "20:00 Beijing time" as the boundary point of the day, which means that the observation day is from 20:00 to 20:00 the next day. This standard is applicable to daily value statistics of factors such as precipitation and temperature. Prior to the 1980s, some stations had a phenomenon of inconsistent day boundaries (such as a few stations using 08 or local time), which led to a decrease in comparability between early data and other stations.
- Overall evaluation: This long sequence dataset did not take into account the quality of the observation data used, day boundary issues, uniformity issues, etc. during the development process. Therefore, the dataset reconstructed in this article also has day boundary and uniformity issues, which will have a serious impact on downstream user research.
Citation: https://doi.org/10.5194/essd-2025-291-RC1 -
AC1: 'Reply on RC1', keke zhao, 07 Jul 2025
reply
We thank the reviewer for their valuable comments. The attached file contains our point-by-point responses.
-
RC2: 'Comment on essd-2025-291', Anonymous Referee #2, 12 Aug 2025
reply
This paper constructs a long-term meteorological variable dataset by decoding the nonlinear relationships between six meteorological variables and their spatial covariates. The method is innovative and the dataset is usable, but the paper needs to be revised based on the following points.
- Abbreviations such as “CC” in the abstract should be spelled out in full.
- The abstract does not adequately reflect the research objectives and significance of the study and needs to be improved.
- The first paragraph of the introduction should provide some supporting citations.
- I noticed that the “Materials” section includes observation sites from different sources, but the verification sites were only selected from CMA. Have you considered selecting verification sites based on the weight of the number of sites from different sources?
- How to consider the cumulative error caused by progressively inputted meteorological variables during the modeling process, especially in the sunshine duration model.
- How can authors reduce errors caused by the boundaries of the study area during modeling, given that these areas have fewer observation stations?
- I noticed that Figure 4 contains a large amount of information, but the poor resolution and color quality make it difficult to see clearly. Please improve this.
- Please indicate whether adjustments were made when the author encountered situations where the sunshine duration was less than 0 during the verification.
- The author mentioned the limitations of satellite remote sensing in estimating the meteorological variables in the “introduction”. However, sunshine duration is greatly affected by cloud and aerosol parameters observed by remote sensing. When comparing sunshine duration products, please consider comparing sunshine duration datasets estimated based on remote sensing data (https://doi.org/10.5194/essd-17-1427-2025) and explain the advantages of the research method used in this study.
- Is the model independent on a daily scale? Did the authors consider modeling based on different days of year (DOYs) to enhance the model's generalization ability in the future?
Citation: https://doi.org/10.5194/essd-2025-291-RC2
Data sets
China's 1km daily reconstructed product of six meteorological elements (1961–2021) Keke Zhao, Denghua Yan, Tianling Qin, Chenhao Li, Dingzhi Peng, Yifan Song https://doi.org/10.11888/Atmos.tpdc.301341
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
371 | 45 | 22 | 438 | 11 | 17 |
- HTML: 371
- PDF: 45
- XML: 22
- Total: 438
- BibTeX: 11
- EndNote: 17
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1