the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
1 km Monthly Precipitation and Temperatures Dataset for China from 1952 to 2019 based on a Brand-New and High-Quality Baseline Climatology Surface
Abstract. Long-term climate data and high-quality baseline climatology surface with high resolution are highly essential to multiple fields in climatological, ecological, hydrological, and environmental sciences. Here, we created a brand-new baseline climatology surface (ChinaClim_baseline) and developed a 1 km monthly precipitation and temperatures dataset in China during 1952–2019 (ChinaClim_timeseries). Thin plate spline (TPS) algorithm in each month with different model formulations by accounting for satellite-driven products, was used to generate ChinaClim_baseline and monthly climate anomaly surface. Meanwhile, climatologically aided interpolation (CAI) was used to superimpose monthly anomaly surface with ChinaClim_baseline to generate ChinaClim_timeseries. Our results showed that ChinaClim_baseline exhibited very high performance. For precipitation estimation, the value of all R2 was over 0.860, and the values of RMSEs and MAEs were 8.149 mm~21.959 mm and 2.787~14.125 mm, respectively. Temperature elements had an average R2 of 0.967~0.992, an average MAEs of 0.321~0.785 °C, and an average RMSEs between 0.485 and 1.233 °C for all months. ChinaClim_baseline performed much better than WorldClim2 and CHELSA and there were many spatial discrepancies captured among those surfaces, especially in summer months and the regions with low-density weather stations in temperate continental and high cold Tibetan Plateau. For ChinaClim_timeseries, precipitation had an average R2 of 0.699~0.923, an average RMSE between 7.449 mm and 56.756 mm, and an average of MAE of 4.263~40.271 mm for all months. Temperature elements had an average R2 of 0.936~0.985, an average RMSE between 0.807 °C and 1.766 °C, and an average MAE of 0.548~1.236 °C for all months. Compared with Peng's climate surface and CHELSAcruts, R2 increased by approximately 6 %, RMSE and MAE decreased by approximately 15 % for precipitation; R2 of temperatures had no obviously changes, but RMSE and MAE decreased by 8.37~34.02 %. The results showed that the interannual variations of ChinaClim_timeseries performed much better than other datasets, thanks to the help of ChinaClim_baseline and satellite-driven products. However, ChinaClim_baseline did not significantly improve the accuracy of precipitation estimation, but it greatly improved the accuracy of temperature estimation; the satellite-driven TRMM3B43 anomaly greatly improve the accuracy of precipitation estimation after 1998, while the LST anomaly did not effectively improve the accuracy of temperature estimation. ChinaClim_baseline can be used as an excellent baseline climatology surface for obtaining high-quality and long-term climate datasets from past to future. In the meantime, ChinaClim_timeseries of 1 km spatial resolution based on ChinaClim_baseline, is very suitable for investigating the spatial-temporal climate changes and their impacts on eco-environmental systems in China. Here, ChinaClim_baseline is available at https://doi.org/10.5281/zenodo.4287824 (Gong, 2020a), ChinaClim_timeseries of precipitation is available at https://doi.org/10.5281/zenodo.4288388 (Gong, 2020b), ChinaClim_timeseries of maximum temperature is available at https://doi.org/10.5281/zenodo.4288390 (Gong, 2020c) and ChinaClim_timeseries of minimum temperature is available at https://doi.org/10.5281/zenodo.4288392 (Gong, 2020d).
This preprint has been withdrawn.
-
Withdrawal notice
This preprint has been withdrawn.
-
Preprint
(2422 KB)
-
Supplement
(71 KB)
-
This preprint has been withdrawn.
- Preprint
(2422 KB) - Metadata XML
-
Supplement
(71 KB) - BibTeX
- EndNote
Interactive discussion
Status: closed
- RC1: 'Comment on essd-2020-361', Anonymous Referee #1, 07 Feb 2021
-
RC2: 'Comment on essd-2020-361', Anonymous Referee #2, 31 Jul 2021
Some major concerns on the manuscript titled “1 km Monthly Precipitation and Temperatures Dataset for China from 1952 to 2019 based on a Brand-New and High-Quality Baseline Climatology Surface” are listed below:
First, except for combining satellite-based precipitation and tempeature data, it is hard to tell the novelty of this study in terms of methodology for creating the 1-km monthly datasets, especially given there are already some datasets available for ecological, hydrological studies etc.
Second, the method for generating “ChinaClim_timeseries” is questionable or, to some degree, wrong. It is unthinkable that the authors used “ChinaClim_baseline” to obtain the anomaly time series. Why not use the 30-years mean normal from weather stations to derive the anomaly time series? The method described in Section 3.2 and Fig. 3 is incomprehensible.
Third, the evaluation on the accuracy of ChinaClim_baseline and ChinaClim_timeseries is also questionable. Because the weather stations used to evaluate ChinaClim_baseline and ChinaClim_timeseries are different from those used to evaluate the WorldClim2 and CHELSA, it is hard to infer that the quality of ChinaClim_baseline and ChinaClim_timeseries is better than those of compared datasets, respectively.
Fourth, the determination coefficient R2, MAE and RMSE are used to compare the accuracy of ChinaClim_baseline and ChinaClim_timeseries with those of other datasets. Even thought the values of R2 is slightly higher while the values of MAE and RMSE are slightly lower for the newly-created datasets, are the differences statistically significant? If not, it just suggests there are no significant differences between the newly-created datasets and those existing ones.
Fifth, the colors used for creating Figure 5, 7, 9, and 11 are bad. The divergent or sequential colors had better be used correctly to map the data. For example, red color is good for high values and blue color is good for low values.
Sixth, time-series of daily temperature and precipitation data are highly valuable for hydrological and ecological studies. From the current version of the newly-created monthly datasets, it is difficult to see the significance of the datasets, at least for hydrological studies.
Seventh, the method for creating ChinaClim_baseline is not very clear. The step 5 (on page 11), i.e.,”(5) Repeat steps 2 to 4 for 10 times, and final baseline climatology surface (ChinaClim_baseline) was created by averaging ten surfaces” means that the nine-folds weather station data used as training data will vary as the process repeats. For each repeat, did you evaluate the accuracy of model formulations for each month?
Eighth, is there any overfitting problem when creating the so-called brand-new datasets?
Citation: https://doi.org/10.5194/essd-2020-361-RC2
Interactive discussion
Status: closed
-
RC1: 'Comment on essd-2020-361', Anonymous Referee #1, 07 Feb 2021
As authors said high quality and resolution baseline climatology along with long time series climate data are very important for multiple fields in climatological, ecological, hydrological, and environmental sciences. This study has generated a high quality ChinaClim_baseline based on lots of weather stations and remote sensing data, and then 1km ChinaClim_timeseries based on ChinaClim_baseline and remote sensing data. Compared with the previous climate datasets, this study really improved the estimation accuracy, especially in the areas with low-density weather stations and during April to October, where and when are usually hard to improve the estimation. More interesting, this study found that high quality baseline climatology can greatly improve the estimation of temperature, but less improve that of precipitation. In contrast, remote sensing can greatly improve the estimation of precipitation, but less improve that of temperature. However, I think further improvements are needed on the section of discussion and English grammars. So, I suggest a further revision before acceptance for publication.
Specific comments:
- Lines 234-237, I think these sentences should be placed in the head of the section 3.1.
- Lines 241-247, why there is 1° overlap area?? Because China is out of the range of 50°S to 50°N?? If so, I think you should pointed out the ranges of China.
- Lines 256-257: Why you choose the model with the highest average R2 value instead of the other metrics such as AIC??
- Lines 356-357, I am confused with what you mean! I am not sure how you test the performance of ChinaClim_baseline.
- Line 379, In the first two paragraph of section discussion, you just emphasize that ChinaClim_baseline performs better than the others, but I think you should emphasize more on the implication, especially for the temperature of ChinaClim_baseline. For example, what the effects for overestimation or underestimation the precipition or temperature in the areas with low-density weather stations during growing season??
- Line560ï¼Do you mean ChinaClim_baseline??
- Lines 581-584, How your results proved Peng’s climate surface and CHELSAcruts datasets, relying on coarse CRU anomaly and high-quality baseline climatology surfaces with CAI method, had relatively high accuracy (high R2)
- Lines 579-587, It seems that authors want to prove CAI is very suitable for estimating precipitation and temperature, however, they have not estimate these data with the other methods and then compared them with the other method.
- Lines 592-596, what about the improvements in the performance of temperature related variables.
- In this paper, by using a brand new baseline climatology surfaces and remote sensing products, authors have generated 1 km Monthly Precipitation and Temperatures Dataset for China from 1952 to 2019. Thus, except one paragraph to discuss the impacts of ChinaClim_baseline, one more paragraph is suggested to emphasize the importance of remote sensing data.
- All the figures are not very clear, especially that the fonts are too small to recognize.
- Further improvement in English is needed. For example, temperate continental is not a noun, but an adjective, please correct it throughout the manuscript.
Citation: https://doi.org/10.5194/essd-2020-361-RC1 -
RC2: 'Comment on essd-2020-361', Anonymous Referee #2, 31 Jul 2021
Some major concerns on the manuscript titled “1 km Monthly Precipitation and Temperatures Dataset for China from 1952 to 2019 based on a Brand-New and High-Quality Baseline Climatology Surface” are listed below:
First, except for combining satellite-based precipitation and tempeature data, it is hard to tell the novelty of this study in terms of methodology for creating the 1-km monthly datasets, especially given there are already some datasets available for ecological, hydrological studies etc.
Second, the method for generating “ChinaClim_timeseries” is questionable or, to some degree, wrong. It is unthinkable that the authors used “ChinaClim_baseline” to obtain the anomaly time series. Why not use the 30-years mean normal from weather stations to derive the anomaly time series? The method described in Section 3.2 and Fig. 3 is incomprehensible.
Third, the evaluation on the accuracy of ChinaClim_baseline and ChinaClim_timeseries is also questionable. Because the weather stations used to evaluate ChinaClim_baseline and ChinaClim_timeseries are different from those used to evaluate the WorldClim2 and CHELSA, it is hard to infer that the quality of ChinaClim_baseline and ChinaClim_timeseries is better than those of compared datasets, respectively.
Fourth, the determination coefficient R2, MAE and RMSE are used to compare the accuracy of ChinaClim_baseline and ChinaClim_timeseries with those of other datasets. Even thought the values of R2 is slightly higher while the values of MAE and RMSE are slightly lower for the newly-created datasets, are the differences statistically significant? If not, it just suggests there are no significant differences between the newly-created datasets and those existing ones.
Fifth, the colors used for creating Figure 5, 7, 9, and 11 are bad. The divergent or sequential colors had better be used correctly to map the data. For example, red color is good for high values and blue color is good for low values.
Sixth, time-series of daily temperature and precipitation data are highly valuable for hydrological and ecological studies. From the current version of the newly-created monthly datasets, it is difficult to see the significance of the datasets, at least for hydrological studies.
Seventh, the method for creating ChinaClim_baseline is not very clear. The step 5 (on page 11), i.e.,”(5) Repeat steps 2 to 4 for 10 times, and final baseline climatology surface (ChinaClim_baseline) was created by averaging ten surfaces” means that the nine-folds weather station data used as training data will vary as the process repeats. For each repeat, did you evaluate the accuracy of model formulations for each month?
Eighth, is there any overfitting problem when creating the so-called brand-new datasets?
Citation: https://doi.org/10.5194/essd-2020-361-RC2
Data sets
A Brand-New and High-Quality Baseline Climatology Surface for China (ChinaClim_baseline) H. Gong https://doi.org/10.5281/zenodo.4287824
1 km Monthly Precipitation Dataset for China from 1952 to 2019 (ChinaClim_timeseries) H. Gong https://doi.org/10.5281/zenodo.4288388
1 km Monthly Maximum Temperature Dataset for China from 1952 to 2019 (ChinaClim_timeseries) H. Gong https://doi.org/10.5281/zenodo.4288390
1 km Monthly Minimum Temperature Dataset for China from 1952 to 2019 (ChinaClim_timeseries) H. Gong https://doi.org/10.5281/zenodo.4288392
Viewed
HTML | XML | Total | Supplement | BibTeX | EndNote | |
---|---|---|---|---|---|---|
1,895 | 471 | 82 | 2,448 | 100 | 88 | 129 |
- HTML: 1,895
- PDF: 471
- XML: 82
- Total: 2,448
- Supplement: 100
- BibTeX: 88
- EndNote: 129
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1
Haibo Gong
Xueqiao Xiang
Huiyu Liu
Xiaojuan Xu
Fusheng Jiao
Zhenshan Lin
This preprint has been withdrawn.
- Preprint
(2422 KB) - Metadata XML
-
Supplement
(71 KB) - BibTeX
- EndNote
As authors said high quality and resolution baseline climatology along with long time series climate data are very important for multiple fields in climatological, ecological, hydrological, and environmental sciences. This study has generated a high quality ChinaClim_baseline based on lots of weather stations and remote sensing data, and then 1km ChinaClim_timeseries based on ChinaClim_baseline and remote sensing data. Compared with the previous climate datasets, this study really improved the estimation accuracy, especially in the areas with low-density weather stations and during April to October, where and when are usually hard to improve the estimation. More interesting, this study found that high quality baseline climatology can greatly improve the estimation of temperature, but less improve that of precipitation. In contrast, remote sensing can greatly improve the estimation of precipitation, but less improve that of temperature. However, I think further improvements are needed on the section of discussion and English grammars. So, I suggest a further revision before acceptance for publication.
Specific comments: