1 km Monthly Precipitation and Temperatures Dataset for China from 1952 to 2019 based on a Brand-New and High-Quality Baseline Climatology Surface

Gong, Haibo; Xiang, Xueqiao; Liu, Huiyu; Xu, Xiaojuan; Jiao, Fusheng; Lin, Zhenshan

doi:10.5194/essd-2020-361

Preprints

https://doi.org/10.5194/essd-2020-361

Preprints

08 Jan 2021

| 08 Jan 2021

Status: this preprint has been withdrawn by the authors.

1 km Monthly Precipitation and Temperatures Dataset for China from 1952 to 2019 based on a Brand-New and High-Quality Baseline Climatology Surface

Haibo Gong, Xueqiao Xiang, Huiyu Liu, Xiaojuan Xu, Fusheng Jiao, and Zhenshan Lin

Abstract. Long-term climate data and high-quality baseline climatology surface with high resolution are highly essential to multiple fields in climatological, ecological, hydrological, and environmental sciences. Here, we created a brand-new baseline climatology surface (ChinaClim_baseline) and developed a 1 km monthly precipitation and temperatures dataset in China during 1952–2019 (ChinaClim_timeseries). Thin plate spline (TPS) algorithm in each month with different model formulations by accounting for satellite-driven products, was used to generate ChinaClim_baseline and monthly climate anomaly surface. Meanwhile, climatologically aided interpolation (CAI) was used to superimpose monthly anomaly surface with ChinaClim_baseline to generate ChinaClim_timeseries. Our results showed that ChinaClim_baseline exhibited very high performance. For precipitation estimation, the value of all R² was over 0.860, and the values of RMSEs and MAEs were 8.149 mm~21.959 mm and 2.787~14.125 mm, respectively. Temperature elements had an average R2 of 0.967~0.992, an average MAEs of 0.321~0.785 °C, and an average RMSEs between 0.485 and 1.233 °C for all months. ChinaClim_baseline performed much better than WorldClim2 and CHELSA and there were many spatial discrepancies captured among those surfaces, especially in summer months and the regions with low-density weather stations in temperate continental and high cold Tibetan Plateau. For ChinaClim_timeseries, precipitation had an average R² of 0.699~0.923, an average RMSE between 7.449 mm and 56.756 mm, and an average of MAE of 4.263~40.271 mm for all months. Temperature elements had an average R2 of 0.936~0.985, an average RMSE between 0.807 °C and 1.766 °C, and an average MAE of 0.548~1.236 °C for all months. Compared with Peng's climate surface and CHELSAcruts, R² increased by approximately 6 %, RMSE and MAE decreased by approximately 15 % for precipitation; R² of temperatures had no obviously changes, but RMSE and MAE decreased by 8.37~34.02 %. The results showed that the interannual variations of ChinaClim_timeseries performed much better than other datasets, thanks to the help of ChinaClim_baseline and satellite-driven products. However, ChinaClim_baseline did not significantly improve the accuracy of precipitation estimation, but it greatly improved the accuracy of temperature estimation; the satellite-driven TRMM3B43 anomaly greatly improve the accuracy of precipitation estimation after 1998, while the LST anomaly did not effectively improve the accuracy of temperature estimation. ChinaClim_baseline can be used as an excellent baseline climatology surface for obtaining high-quality and long-term climate datasets from past to future. In the meantime, ChinaClim_timeseries of 1 km spatial resolution based on ChinaClim_baseline, is very suitable for investigating the spatial-temporal climate changes and their impacts on eco-environmental systems in China. Here, ChinaClim_baseline is available at https://doi.org/10.5281/zenodo.4287824 (Gong, 2020a), ChinaClim_timeseries of precipitation is available at https://doi.org/10.5281/zenodo.4288388 (Gong, 2020b), ChinaClim_timeseries of maximum temperature is available at https://doi.org/10.5281/zenodo.4288390 (Gong, 2020c) and ChinaClim_timeseries of minimum temperature is available at https://doi.org/10.5281/zenodo.4288392 (Gong, 2020d).

This preprint has been withdrawn.

Received: 02 Dec 2020 – Discussion started: 08 Jan 2021

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Preprint (PDF, 2422 KB)

Withdrawal notice
This preprint has been withdrawn.
Preprint (2422 KB)

Supplement (71 KB)

Download & links

This preprint has been withdrawn.

Haibo Gong, Xueqiao Xiang, Huiyu Liu, Xiaojuan Xu, Fusheng Jiao, and Zhenshan Lin

Interactive discussion

Status: closed

RC1:
'Comment on essd-2020-361', Anonymous Referee #1, 07 Feb 2021
As authors said high quality and resolution baseline climatology along with long time series climate data are very important for multiple fields in climatological, ecological, hydrological, and environmental sciences. This study has generated a high quality ChinaClim_baseline based on lots of weather stations and remote sensing data, and then 1km ChinaClim_timeseries based on ChinaClim_baseline and remote sensing data. Compared with the previous climate datasets, this study really improved the estimation accuracy, especially in the areas with low-density weather stations and during April to October, where and when are usually hard to improve the estimation. More interesting, this study found that high quality baseline climatology can greatly improve the estimation of temperature, but less improve that of precipitation. In contrast, remote sensing can greatly improve the estimation of precipitation, but less improve that of temperature. However, I think further improvements are needed on the section of discussion and English grammars. So, I suggest a further revision before acceptance for publication.

Specific comments:

Lines 234-237, I think these sentences should be placed in the head of the section 3.1.

Lines 241-247, why there is 1° overlap area?? Because China is out of the range of 50°S to 50°N?? If so, I think you should pointed out the ranges of China.

Lines 256-257: Why you choose the model with the highest average R2 value instead of the other metrics such as AIC??

Lines 356-357, I am confused with what you mean! I am not sure how you test the performance of ChinaClim_baseline.

Line 379, In the first two paragraph of section discussion, you just emphasize that ChinaClim_baseline performs better than the others, but I think you should emphasize more on the implication, especially for the temperature of ChinaClim_baseline. For example, what the effects for overestimation or underestimation the precipition or temperature in the areas with low-density weather stations during growing season??

Line560ï¼Do you mean ChinaClim_baseline??

Lines 581-584, How your results proved Peng’s climate surface and CHELSAcruts datasets, relying on coarse CRU anomaly and high-quality baseline climatology surfaces with CAI method, had relatively high accuracy (high R²)

Lines 579-587, It seems that authors want to prove CAI is very suitable for estimating precipitation and temperature, however, they have not estimate these data with the other methods and then compared them with the other method.

Lines 592-596, what about the improvements in the performance of temperature related variables.

In this paper, by using a brand new baseline climatology surfaces and remote sensing products, authors have generated 1 km Monthly Precipitation and Temperatures Dataset for China from 1952 to 2019. Thus, except one paragraph to discuss the impacts of ChinaClim_baseline, one more paragraph is suggested to emphasize the importance of remote sensing data.

All the figures are not very clear, especially that the fonts are too small to recognize.

Further improvement in English is needed. For example, temperate continental is not a noun, but an adjective, please correct it throughout the manuscript.
Citation: https://doi.org/10.5194/essd-2020-361-RC1
RC2: 'Comment on essd-2020-361', Anonymous Referee #2, 31 Jul 2021

Some major concerns on the manuscript titled “1 km Monthly Precipitation and Temperatures Dataset for China from 1952 to 2019 based on a Brand-New and High-Quality Baseline Climatology Surface” are listed below:

First, except for combining satellite-based precipitation and tempeature data, it is hard to tell the novelty of this study in terms of methodology for creating the 1-km monthly datasets, especially given there are already some datasets available for ecological, hydrological studies etc.

Second, the method for generating “ChinaClim_timeseries” is questionable or, to some degree, wrong. It is unthinkable that the authors used “ChinaClim_baseline” to obtain the anomaly time series. Why not use the 30-years mean normal from weather stations to derive the anomaly time series? The method described in Section 3.2 and Fig. 3 is incomprehensible.

Third, the evaluation on the accuracy of ChinaClim_baseline and ChinaClim_timeseries is also questionable. Because the weather stations used to evaluate ChinaClim_baseline and ChinaClim_timeseries are different from those used to evaluate the WorldClim2 and CHELSA, it is hard to infer that the quality of ChinaClim_baseline and ChinaClim_timeseries is better than those of compared datasets, respectively.

Fourth, the determination coefficient R2, MAE and RMSE are used to compare the accuracy of ChinaClim_baseline and ChinaClim_timeseries with those of other datasets. Even thought the values of R2 is slightly higher while the values of MAE and RMSE are slightly lower for the newly-created datasets, are the differences statistically significant? If not, it just suggests there are no significant differences between the newly-created datasets and those existing ones.

Fifth, the colors used for creating Figure 5, 7, 9, and 11 are bad. The divergent or sequential colors had better be used correctly to map the data. For example, red color is good for high values and blue color is good for low values.

Sixth, time-series of daily temperature and precipitation data are highly valuable for hydrological and ecological studies. From the current version of the newly-created monthly datasets, it is difficult to see the significance of the datasets, at least for hydrological studies.

Seventh, the method for creating ChinaClim_baseline is not very clear. The step 5 (on page 11), i.e.,”(5) Repeat steps 2 to 4 for 10 times, and final baseline climatology surface (ChinaClim_baseline) was created by averaging ten surfaces” means that the nine-folds weather station data used as training data will vary as the process repeats. For each repeat, did you evaluate the accuracy of model formulations for each month?

Eighth, is there any overfitting problem when creating the so-called brand-new datasets?

Citation: https://doi.org/10.5194/essd-2020-361-RC2

Interactive discussion

Status: closed

RC1:
'Comment on essd-2020-361', Anonymous Referee #1, 07 Feb 2021
As authors said high quality and resolution baseline climatology along with long time series climate data are very important for multiple fields in climatological, ecological, hydrological, and environmental sciences. This study has generated a high quality ChinaClim_baseline based on lots of weather stations and remote sensing data, and then 1km ChinaClim_timeseries based on ChinaClim_baseline and remote sensing data. Compared with the previous climate datasets, this study really improved the estimation accuracy, especially in the areas with low-density weather stations and during April to October, where and when are usually hard to improve the estimation. More interesting, this study found that high quality baseline climatology can greatly improve the estimation of temperature, but less improve that of precipitation. In contrast, remote sensing can greatly improve the estimation of precipitation, but less improve that of temperature. However, I think further improvements are needed on the section of discussion and English grammars. So, I suggest a further revision before acceptance for publication.

Specific comments:

Lines 234-237, I think these sentences should be placed in the head of the section 3.1.

Lines 241-247, why there is 1° overlap area?? Because China is out of the range of 50°S to 50°N?? If so, I think you should pointed out the ranges of China.

Lines 256-257: Why you choose the model with the highest average R2 value instead of the other metrics such as AIC??

Lines 356-357, I am confused with what you mean! I am not sure how you test the performance of ChinaClim_baseline.

Line 379, In the first two paragraph of section discussion, you just emphasize that ChinaClim_baseline performs better than the others, but I think you should emphasize more on the implication, especially for the temperature of ChinaClim_baseline. For example, what the effects for overestimation or underestimation the precipition or temperature in the areas with low-density weather stations during growing season??

Line560ï¼Do you mean ChinaClim_baseline??

Lines 581-584, How your results proved Peng’s climate surface and CHELSAcruts datasets, relying on coarse CRU anomaly and high-quality baseline climatology surfaces with CAI method, had relatively high accuracy (high R²)

Lines 579-587, It seems that authors want to prove CAI is very suitable for estimating precipitation and temperature, however, they have not estimate these data with the other methods and then compared them with the other method.

Lines 592-596, what about the improvements in the performance of temperature related variables.

In this paper, by using a brand new baseline climatology surfaces and remote sensing products, authors have generated 1 km Monthly Precipitation and Temperatures Dataset for China from 1952 to 2019. Thus, except one paragraph to discuss the impacts of ChinaClim_baseline, one more paragraph is suggested to emphasize the importance of remote sensing data.

All the figures are not very clear, especially that the fonts are too small to recognize.

Further improvement in English is needed. For example, temperate continental is not a noun, but an adjective, please correct it throughout the manuscript.
Citation: https://doi.org/10.5194/essd-2020-361-RC1
RC2: 'Comment on essd-2020-361', Anonymous Referee #2, 31 Jul 2021

Some major concerns on the manuscript titled “1 km Monthly Precipitation and Temperatures Dataset for China from 1952 to 2019 based on a Brand-New and High-Quality Baseline Climatology Surface” are listed below:

First, except for combining satellite-based precipitation and tempeature data, it is hard to tell the novelty of this study in terms of methodology for creating the 1-km monthly datasets, especially given there are already some datasets available for ecological, hydrological studies etc.

Second, the method for generating “ChinaClim_timeseries” is questionable or, to some degree, wrong. It is unthinkable that the authors used “ChinaClim_baseline” to obtain the anomaly time series. Why not use the 30-years mean normal from weather stations to derive the anomaly time series? The method described in Section 3.2 and Fig. 3 is incomprehensible.

Third, the evaluation on the accuracy of ChinaClim_baseline and ChinaClim_timeseries is also questionable. Because the weather stations used to evaluate ChinaClim_baseline and ChinaClim_timeseries are different from those used to evaluate the WorldClim2 and CHELSA, it is hard to infer that the quality of ChinaClim_baseline and ChinaClim_timeseries is better than those of compared datasets, respectively.

Fourth, the determination coefficient R2, MAE and RMSE are used to compare the accuracy of ChinaClim_baseline and ChinaClim_timeseries with those of other datasets. Even thought the values of R2 is slightly higher while the values of MAE and RMSE are slightly lower for the newly-created datasets, are the differences statistically significant? If not, it just suggests there are no significant differences between the newly-created datasets and those existing ones.

Fifth, the colors used for creating Figure 5, 7, 9, and 11 are bad. The divergent or sequential colors had better be used correctly to map the data. For example, red color is good for high values and blue color is good for low values.

Sixth, time-series of daily temperature and precipitation data are highly valuable for hydrological and ecological studies. From the current version of the newly-created monthly datasets, it is difficult to see the significance of the datasets, at least for hydrological studies.

Seventh, the method for creating ChinaClim_baseline is not very clear. The step 5 (on page 11), i.e.,”(5) Repeat steps 2 to 4 for 10 times, and final baseline climatology surface (ChinaClim_baseline) was created by averaging ten surfaces” means that the nine-folds weather station data used as training data will vary as the process repeats. For each repeat, did you evaluate the accuracy of model formulations for each month?

Eighth, is there any overfitting problem when creating the so-called brand-new datasets?

Citation: https://doi.org/10.5194/essd-2020-361-RC2

Haibo Gong, Xueqiao Xiang, Huiyu Liu, Xiaojuan Xu, Fusheng Jiao, and Zhenshan Lin

Supplement

https://doi.org/10.5194/essd-2020-361-supplement

Data sets

A Brand-New and High-Quality Baseline Climatology Surface for China (ChinaClim_baseline) H. Gong https://doi.org/10.5281/zenodo.4287824

1 km Monthly Precipitation Dataset for China from 1952 to 2019 (ChinaClim_timeseries) H. Gong https://doi.org/10.5281/zenodo.4288388

1 km Monthly Maximum Temperature Dataset for China from 1952 to 2019 (ChinaClim_timeseries) H. Gong https://doi.org/10.5281/zenodo.4288390

1 km Monthly Minimum Temperature Dataset for China from 1952 to 2019 (ChinaClim_timeseries) H. Gong https://doi.org/10.5281/zenodo.4288392

Haibo Gong, Xueqiao Xiang, Huiyu Liu, Xiaojuan Xu, Fusheng Jiao, and Zhenshan Lin

Viewed

Total article views: 2,687 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	Supplement	BibTeX	EndNote
2,059	535	93	2,687	109	109	161

HTML: 2,059
PDF: 535
XML: 93
Total: 2,687
Supplement: 109
BibTeX: 109
EndNote: 161

Views and downloads (calculated since 08 Jan 2021)

Month	HTML	PDF	XML	Total
Jan 2021	238	59	2	299
Feb 2021	78	17	4	99
Mar 2021	45	11	0	56
Apr 2021	37	7	1	45
May 2021	48	14	0	62
Jun 2021	45	7	0	52
Jul 2021	48	17	1	66
Aug 2021	91	12	2	105
Sep 2021	19	5	1	25
Oct 2021	21	33	0	54
Nov 2021	36	27	1	64
Dec 2021	31	10	3	44
Jan 2022	28	10	3	41
Feb 2022	27	9	0	36
Mar 2022	29	10	1	40
Apr 2022	22	5	1	28
May 2022	14	10	1	25
Jun 2022	14	5	1	20
Jul 2022	18	2	0	20
Aug 2022	21	7	1	29
Sep 2022	28	6	1	35
Oct 2022	28	3	0	31
Nov 2022	16	7	0	23
Dec 2022	23	9	1	33
Jan 2023	9	9	1	19
Feb 2023	17	2	0	19
Mar 2023	29	4	1	34
Apr 2023	23	4	1	28
May 2023	9	2	1	12
Jun 2023	12	10	2	24
Jul 2023	16	4	0	20
Aug 2023	8	4	1	13
Sep 2023	19	1	20
Oct 2023	11	6	2	19
Nov 2023	10	0	10
Dec 2023	15	1	16
Jan 2024	22	3	1	26
Feb 2024	13	12	4	29
Mar 2024	14	8	6	28
Apr 2024	18	3	7	28
May 2024	13	4	5	22
Jun 2024	11	3	1	15
Jul 2024	7	2	6	15
Aug 2024	9	3	3	15
Sep 2024	10	3	0	13
Oct 2024	14	3	0	17
Nov 2024	15	1	0	16
Dec 2024	20	4	0	24
Jan 2025	25	1	0	26
Feb 2025	29	3	1	33
Mar 2025	42	4	2	48
Apr 2025	26	8	3	37
May 2025	33	3	2	38
Jun 2025	29	10	1	40
Jul 2025	25	5	1	31
Aug 2025	57	5	2	64
Sep 2025	266	16	1	283
Oct 2025	21	29	0	50
Nov 2025	31	21	1	53
Dec 2025	29	15	1	45
Jan 2026	68	9	7	84
Feb 2026	19	6	2	27
Mar 2026	10	4	0	14

Cumulative views and downloads (calculated since 08 Jan 2021)

Month	HTML	PDF	XML	Total
Jan 2021	238	59	2	299
Feb 2021	78	17	4	99
Mar 2021	45	11	0	56
Apr 2021	37	7	1	45
May 2021	48	14	0	62
Jun 2021	45	7	0	52
Jul 2021	48	17	1	66
Aug 2021	91	12	2	105
Sep 2021	19	5	1	25
Oct 2021	21	33	0	54
Nov 2021	36	27	1	64
Dec 2021	31	10	3	44
Jan 2022	28	10	3	41
Feb 2022	27	9	0	36
Mar 2022	29	10	1	40
Apr 2022	22	5	1	28
May 2022	14	10	1	25
Jun 2022	14	5	1	20
Jul 2022	18	2	0	20
Aug 2022	21	7	1	29
Sep 2022	28	6	1	35
Oct 2022	28	3	0	31
Nov 2022	16	7	0	23
Dec 2022	23	9	1	33
Jan 2023	9	9	1	19
Feb 2023	17	2	0	19
Mar 2023	29	4	1	34
Apr 2023	23	4	1	28
May 2023	9	2	1	12
Jun 2023	12	10	2	24
Jul 2023	16	4	0	20
Aug 2023	8	4	1	13
Sep 2023	19	1	20
Oct 2023	11	6	2	19
Nov 2023	10	0	10
Dec 2023	15	1	16
Jan 2024	22	3	1	26
Feb 2024	13	12	4	29
Mar 2024	14	8	6	28
Apr 2024	18	3	7	28
May 2024	13	4	5	22
Jun 2024	11	3	1	15
Jul 2024	7	2	6	15
Aug 2024	9	3	3	15
Sep 2024	10	3	0	13
Oct 2024	14	3	0	17
Nov 2024	15	1	0	16
Dec 2024	20	4	0	24
Jan 2025	25	1	0	26
Feb 2025	29	3	1	33
Mar 2025	42	4	2	48
Apr 2025	26	8	3	37
May 2025	33	3	2	38
Jun 2025	29	10	1	40
Jul 2025	25	5	1	31
Aug 2025	57	5	2	64
Sep 2025	266	16	1	283
Oct 2025	21	29	0	50
Nov 2025	31	21	1	53
Dec 2025	29	15	1	45
Jan 2026	68	9	7	84
Feb 2026	19	6	2	27
Mar 2026	10	4	0	14

Viewed (geographical distribution)

Total article views: 2,497 (including HTML, PDF, and XML) Thereof 2,495 with geography defined and 2 with unknown origin.

Country	#	Views	%

Latest update: 09 Mar 2026

Download

This preprint has been withdrawn.

Preprint (2422 KB)
Metadata XML

Short summary

We created a brand-new baseline climatology surface (ChinaClim_baseline) and developed a 1 km monthly precipitation and temperatures dataset during 1952–2019 (ChinaClim_timeseries). ChinaClim_baseline can be used as an excellent baseline climatology surface for obtaining long-term climate datasets.ChinaClim_timeseries based on ChinaClim_baseline, is suitable for investigating the spatial-temporal climate changes and their impacts on eco-environmental systems in China.


Total:	0
HTML:	0
PDF:	0
XML:	0