A new daily gridded precipitation dataset for the Chinese mainland based on gauge observations

Han, Jingya; Miao, Chiyuan; Gou, Jiaojiao; Zheng, Haiyan; Zhang, Qi; Guo, Xiaoying

doi:https://doi.org/10.5194/essd-15-3147-2023

Articles | Volume 15, issue 7

https://doi.org/10.5194/essd-15-3147-2023

Articles | Volume 15, issue 7

Data description paper

25 Jul 2023

Data description paper |

| 25 Jul 2023

A new daily gridded precipitation dataset for the Chinese mainland based on gauge observations

Jingya Han, Chiyuan Miao, Jiaojiao Gou, Haiyan Zheng, Qi Zhang, and Xiaoying Guo

Abstract

High-quality, freely accessible, long-term precipitation estimates with fine spatiotemporal resolution play essential roles in hydrologic, climatic, and numerical modeling applications. However, the existing daily gridded precipitation datasets over China are either constructed with insufficient gauge observations or neglect topographic effects and boundary effects on interpolation. Using daily observations from 2839 gauges located across China and nearby regions from 1961 to the present, this study compared eight different interpolation schemes that adjusted the climatology based on a monthly precipitation constraint and topographic characteristic correction, using an algorithm that combined the daily climatology field with a precipitation ratio field. Results from these eight interpolation schemes were validated using 45 992 high-density daily gauge observations from 2015 to 2019 across China. Of these eight schemes, the one with the best performance merges the Parameter-elevation Regression on Independent Slopes Model (PRISM) in the daily climatology field and interpolates station observations into the ratio field using an inverse-distance weighting method. This scheme had median values of 0.78 for the correlation coefficient, 8.8 mm d⁻¹ for the root-mean-square deviation, and 0.69 for the Kling–Gupta efficiency for comparisons between the 45 992 high-density gauge observations and the best interpolation scheme for the 0.1^∘ latitude × longitude grid cells from 2015 to 2019. This scheme had the best overall performance, as it fully considers topographic effects in the daily climatology field and it balances local data fidelity and global fitting smoothness in the interpolation of the precipitation ratio field. Therefore, this scheme was used to construct a new long-term, gauge-based gridded precipitation dataset for the Chinese mainland (called CHM_PRE, as a member of the China Hydro-Meteorology dataset) with spatial resolutions of 0.5, 0.25, and 0.1^∘ from 1961 to the present. This precipitation dataset is expected to facilitate the advancement of drought monitoring, flood forecasting, and hydrological modeling. Free access to the dataset can be found at https://doi.org/10.6084/m9.figshare.21432123.v4 (Han and Miao, 2022).

Download & links

How to cite.

Received: 06 Nov 2022 – Discussion started: 12 Dec 2022 – Revised: 10 Apr 2023 – Accepted: 05 Jun 2023 – Published: 25 Jul 2023

1 Introduction

As one of the key components of the hydrological cycle, precipitation can influence the distribution of water resources (Rodell et al., 2018), sustain agriculture (Beck et al., 2020; Zou et al., 2022), replenish aquifers (Fischer and Knutti, 2016; Kucera et al., 2013), and enable economic prosperity (Trenberth et al., 2003; Kirschbaum et al., 2017). Each of the last 3 decades has been successively warmer at the Earth's surface than any preceding decade since 1850 (IPCC, 2021). With an ever-warming climate, the Earth's water cycle has been amplified, resulting in frequent severe extreme precipitation events (Fischer and Knutti, 2016; Myhre et al., 2018). The intensification of water transport and exchanges between the atmosphere and land surface is having profound impacts on the redistribution of water resources by moisture flux, which exaggerates the contrast between wet and dry meteorological regimes, seasons, and events (Allen and Ingram, 2002; Allan et al., 2020; Han et al., 2021). Constructing a high-quality, long-term daily precipitation dataset is essential for hydrometeorological research (Sun et al., 2018; Beck et al., 2019). However, due to the spatial heterogeneity and temporal variability in daily precipitation, it is challenging to derive accurate spatiotemporal patterns of daily precipitation.

Collection of precipitation data relies mainly on measurements using ground-based rain gauges and estimates using remote-sensing technologies such as weather radar and satellite (Shen et al., 2014; Beck et al., 2019; Sun et al., 2018). Among these approaches, rain-gauge observations are the most reliable and widely used tool for directly measuring precipitation. However, precipitation data measured with gauges are point observations only, and the uneven distribution of gauges increases the limitations of gauge applications over a region. It is vital to interpolate these spatially irregular gauge observations to areal averages, since multiple scientific and operational applications (e.g., estimating local climate variables in data-sparse regions, monitoring climate change at the regional or global scale, and validating climate models with observations) require good-quality, high spatiotemporal-resolution precipitation datasets (Haylock et al., 2008; Xie et al., 2007; Harris et al., 2020). Spatial interpolation methods are usually applied to irregular point observations to produce an evenly distributed precipitation grid for the application in hydrological and meteorological studies (Ahrens, 2006; Schamm et al., 2014; Golian et al., 2019).

In China, the original gauge observations used as the benchmark of various precipitation datasets mainly come from two suites of gauge observations provided by the China Meteorological Administration (CMA): ∼ 700 benchmark stations and ∼2400 gauges comprising other ordinary national automatic weather stations (Shen and Xiong, 2016). Using observations from the former (the ∼700 stations), a monthly precipitation dataset has been established over China that covers the period of 1901–2017 (Peng et al., 2019). To achieve a higher temporal resolution, a daily gridded precipitation dataset has been produced for China using the same raw precipitation data for the same period, from 1961 to 2019 (Qin et al., 2022). Through a fusion of remote-sensing products, reanalysis datasets, and in situ station data, the China Meteorological Forcing Dataset (CMFD) has been produced to serve as a high-resolution (3 h, 0.1^∘ × 0.1^∘) input forcing dataset for hydrological and ecosystem models beginning in 1979 (He et al., 2020). Generally, the quantitative accuracy of a gauge-based dataset can be improved by enhancing the density of gauge observations (Merino et al., 2021; Hofstra and New, 2009). Xie et al. (2007) developed a widely used gauge-based analysis of daily precipitation over East Asia (EA05) with a collection of daily precipitation observations from over 700 stations from CMA and about 1000 hydrological station observations from the Chinese Yellow River Conservation Commission. Using observations from approximately 2400 gauges from 1961 to the present, Wu and Gao (2013) created a daily gridded dataset with a resolution of 0.25^∘ × 0.25^∘ over China (CN05.1), and Zhao et al. (2014) constructed the second version of a 0.5^∘ × 0.5^∘ gridded daily precipitation dataset over China (CMA V2.0). Further accounting for topographic effects, including elevation, slope, proximity to coastlines, and the locations of temperature inversions, Shen et al. (2010) developed the China Gauge-based Daily Precipitation Analysis (CGDPA) with spatial resolutions of 0.5^∘ × 0.5^∘ and 0.25^∘ × 0.25^∘ using a topographic correction algorithm. The aforementioned datasets only involve gauges inside China's boundaries, except for EA05, which covers the East Asia domain. This limitation can lead to boundary effects such that grid cells near the boundaries suffer positioning inaccuracy in relation to interior grid cells (Ahrens, 2006). In addition, different interpolation algorithms can produce different results even with the same inputs. Comparing the performance of different interpolation techniques is crucial to determining the best interpolation method. A summary of these gridded precipitation datasets is shown in Table 1.

Table 1Gauge-based gridded precipitation datasets for China.

Download Print Version | Download XLSX

Given these limitations and the important role these datasets play in many applications, it is of great urgency to establish long-term, continuously updated daily precipitation series with multiple spatial resolutions that are free to use. Therefore, this study aimed to construct a long-term (from 1961 to the present) daily precipitation dataset with different spatial resolutions (0.5^∘ × 0.5^∘, 0.25^∘ × 0.25^∘, and 0.1^∘ × 0.1^∘) based on 2839 gauge observations in and around China (2419 gauges located across the Chinese mainland and 420 gauges from nearby regions). Eight interpolation schemes were considered and validated using 45 992 gauge observations for the period of 2015–2019 over China. Finally, we produced a new gridded precipitation dataset for the Chinese mainland (a member of the China Hydro-Meteorology datasets, hereinafter called CHM_PRE) covering the period 1961–2022, with spatial resolutions of 0.5, 0.25, and 0.1^∘, which is available in the public domain and will be updated yearly.

2 Data

2.1 Raw gauge data used for interpolation

Daily rain-gauge datasets (from 1961 to the present) from 2419 stations across the Chinese mainland and 420 stations just outside China's boundaries were collected from the China Meteorological Administration (CMA, http://data.cma.cn, last access: 24 July 2023) and Global Historical Climatology Network-Daily Version 3 (GHCND, https://www.ncei.noaa.gov, last access: 24 July 2023), respectively (Fig. 1a). The CMA gauge dataset is available for the Chinese mainland (data in Hong Kong, Macao, and Taiwan are currently not accessible for use). Stations are sparsely distributed in northwestern China and the Tibetan Plateau compared with eastern and southern China. The daily precipitation is the accumulated precipitation amount between 20:00 and 20:00 (local time in Beijing). This dataset has been subjected to strict quality controls, including (1) an extreme-values check, (2) an internal consistency check of daily values, (3) a spatial and temporal consistency check, and (4) manual verification (Zhang et al., 2020). The gauge observations from neighboring countries come from the GHCND dataset, which contains records from over 80 000 stations in 180 countries and territories. Quality controls are routinely applied to assure the basic consistency of the dataset (Menne et al., 2012). Stations with less than 5 % of calendar days missing in an individual year were used for interpolation. Changes in the number of stations over time are shown in Fig. 1c.

2.2 High-density gauge observations used for validation

High-density daily observations from nearly 68 000 automatic weather stations for the period 2015–2019 in China are provided by the National Meteorological Information Center of CMA (Li et al., 2018). Once stations with more than 20 % missing data were removed, there were 45 992 good-quality stations available for validation (Fig. 1b).

https://essd.copernicus.org/articles/15/3147/2023/essd-15-3147-2023-f01

Figure 1(a) Distribution of 2839 stations used in interpolation and (b) 45 992 stations used for validation. (c) Quality-controlled number of stations for interpolation over time.

2.3 SRTM-DEM

The 3 arcsec (90 m resolution) digital elevation model (DEM) applied in this study was acquired from the Shuttle Radar Topography Mission (SRTM) data (https://cmr.earthdata.nasa.gov/search/concepts/C1214622194-SCIOPS, last access: 24 July 2023). SRTM uses dual radar antennas to acquire interferometric radar data and process digital topographic data (Farr et al., 2007). We resampled the SRTM-DEM into 0.05^∘ × 0.05^∘ grid cells using the bilinear interpolation method.

2.4 PRISM

The monthly climatology generated by the Parameter-elevation Regression on Independent Slopes Model (PRISM) was used for the monthly precipitation constraint and topographic characteristic correction of the daily climatology field (https://prism.oregonstate.edu/, last access: 24 July 2023). PRISM incorporates local climate–elevation relationships, topographic features, proximity to coastlines, and several measures of terrain complexity, and it is the most widely used climatology dataset in the world (Daly et al., 1994, 2002). The original spatial resolution is 0.04^∘ × 0.04^∘ for the monthly climatology of PRISM between 1961 and 1990; we used bilinear interpolation to regrid the spatial resolution into 0.05^∘ × 0.05^∘ grid cells for adjustment based on climatology.

3 Methodology

3.1 Interpolation scheme

Due to the high spatial variability in precipitation relative to other climate variables, directly interpolating the daily rain-gauge observations into grid cells could produce a dataset with misleading daily precipitation characteristics (Xie et al., 2007; Chen et al., 2002; Shen et al., 2010). To avoid this and reduce introduced errors, the overall strategy for establishing a daily gridded precipitation dataset is to construct a relatively continuous daily climatology field (Shen et al., 2010). Then, we would build an intermediate field of the interpolated variable based on this daily climatology field, such as a daily precipitation anomalies field or a field of the ratio between daily precipitation and daily climatology. Previous studies have demonstrated that interpolating the ratio (between daily precipitation and daily climatology) yields better performance than interpolating anomalies for constructing daily gridded precipitation (Xie et al., 2007; Yatagai et al., 2012; Di Luzio et al., 2008). Therefore, the “daily climatology field (C_d) × field of the ratio between daily precipitation and daily climatology ( $P / C_{d}$ )” was employed as the interpolation scheme for constructing the new gridded precipitation dataset in this study (Fig. 2), as developed by Xie et al. (2007). Figure 3 shows a flowchart of the gridding analysis system.

https://essd.copernicus.org/articles/15/3147/2023/essd-15-3147-2023-f02

Figure 2Interpolation strategy for generating the daily gridded precipitation dataset (CHM_PRE) in this study.

Download

https://essd.copernicus.org/articles/15/3147/2023/essd-15-3147-2023-f03

Figure 3Flowchart of the gridding analysis system. Gray shading represents gauge data; blue shading represents gridded data. Approaches used for generating daily climatology fields (using the ANUSPLIN software or PRISM data) are marked in red, and interpolation methods used for producing ratio fields are marked in green.

Download

3.2 Building the daily climatology field (C_d)

First, the gauge-based climatology of daily precipitation was calculated using gauge observations. The definition of gauge-based climatology of daily precipitation is the Fourier-truncated 30-year mean daily precipitation series produced from gauge observations for the period of 1971–2000 for each of the 365 calendar days (Fig. 4). We used Fourier truncation to remove the high-frequency noise of the 30-year mean daily precipitation series for each station and retained the accumulation of the first six harmonic components as the gauge-based climatology of daily precipitation (Xie et al., 2007). After Fourier truncation, approximately 75 % of all stations preserve a variation of 40 % to 75 % in the truncated mean daily precipitation series relative to the total variation in the mean daily precipitation. The unadjusted 0.05^∘ × 0.05^∘ gridded daily climatology field ( $C_{d_{0}}$ ) was then interpolated from the gauge-based climatology of daily precipitation with SRTM-DEM as a covariate using the ANUSPLIN software (Hutchinson and Xu, 2004). To minimize systematic bias from the unadjusted 0.05^∘ × 0.05^∘ gridded daily climatology field in the monthly climatology field (C_m), the monthly accumulation of the unadjusted 0.05^∘ × 0.05^∘ gridded daily climatology field was then constrained by the monthly climatology field. This produced an adjusted gridded daily climatology field that uses a monthly precipitation constraint and topographic characteristic correction. We compared two types of gridded monthly climatology fields to determine which adjusts better for the systematic bias: (1) an ANUSPLIN-type gridded monthly climatology field or (2) a PRISM-type gridded monthly climatology field. The ANUSPLIN-type gridded monthly climatology field was produced by interpolating monthly precipitation climatology (1971–2000) from stations to the 0.05^∘ latitude × longitude grids with a covariate of SRTM-DEM, using the ANUSPLIN software. The 0.05^∘ × 0.05^∘ regridded monthly climatology of PRISM was used as the PRISM-type monthly climatology field.

The climatology adjustment steps for one grid cell were as follows:

Calculate $C_{d_{0_(m, j)}}$ (m=1, 2, 3, …, 12; j=1, 2, 3, …, 365; m is the corresponding month for day j), which is the monthly total of the unadjusted 0.05^∘ × 0.05^∘ gridded daily climatology field, derived by taking the sum of the unadjusted 0.05^∘ × 0.05^∘ gridded daily climatology field for the month.
Match the monthly total series derived using the unadjusted 0.05^∘ × 0.05^∘ gridded daily climatology field to the gridded monthly climatology field month by month.
Compute the scaling factor SF_(m,j) for the individual calendar day of the unadjusted 0.05^∘ × 0.05^∘ daily climatology field to the gridded monthly climatology field:
$\begin{matrix} (1) & \begin{aligned} {SF}_{(m, j)} \\ = \frac{C_{(m, j)}}{w_{(m - 1, j)} C_{d_{0_(m - 1, j)}} + w_{(m, j)} C_{d_{0_(m, j)}} + w_{(m + 1, j)} C_{d_{0_(m + 1, j)}}} \\ (m = 1, 2, 3, \dots, 11, 12; j = 1, 2, 3, \dots, 365; \\ m is the corresponding month for day j), \end{aligned} \end{matrix}$
where C_(m,j) is the gridded monthly climatology field for the corresponding month m of day j; $C_{d_{0_(m - 1, j)}}$ , $C_{d_{0_(m, j)}}$ , and $C_{d_{0_(m + 1, j)}}$ are the monthly total of months m−1, m, and m+1, respectively, which are calculated from the unadjusted 0.05^∘ × 0.05^∘ gridded daily climatology field; $w_{(m - 1, j)}$ , w_(m,j), and $w_{(m + 1, j)}$ are the corresponding weights for months m−1, m, and m+1, respectively, which are inversely proportional to the interval between the calendar day j and the center of the month (Xie et al., 2007). Note that the weight $w_{(m - 1, j)}$ is 0 when m=1 and so is the weight $w_{(m + 1, j)}$ when m=12.
The adjusted gridded daily climatology field ( $C_{d_{(m, j)}}$ ) is defined as
$\begin{matrix} (2) & \begin{aligned} C_{d_{(m, j)}} = C_{d_{0_(m, j)}} {SF}_{(m, j)} \\ (m = 1, 2, 3, \dots, 11, 12; j = 1, 2, 3, \dots, 365; \\ m is the corresponding month for day j) . \end{aligned} \end{matrix}$

https://essd.copernicus.org/articles/15/3147/2023/essd-15-3147-2023-f04

Figure 4Time series of 30-year (1971–2000) mean daily precipitation (blue lines) and the gauge-based climatology of daily precipitation derived using Fourier truncation (black lines) for three randomly selected stations in southern China (a), northeastern China (b), and northwestern China (c).

Download

3.3 Constructing a field of the ratio between daily precipitation and daily climatology ( $P / C_{d}$ )

The ratio of daily precipitation to the spatiotemporally corresponding daily climatology field was calculated for each station. To compare the performances of different interpolation methods, four widely used interpolation approaches for precipitation were adopted. The four interpolation methods used in this study were angular-distance weighting (ADW) (Shepard, 1968; Caesar et al., 2006), inverse-distance weighting (IDW) (Shepard, 1984; Eischeid et al., 2000), thin plate spline (TPS) (Hutchinson, 1995; Camera et al., 2014), and triangulation-based nearest neighbor interpolation (TNNI) (Thiessen, 1911; Sibson, 1978). A brief overview of the main characteristics of the four methods is given below.

3.3.1 ADW

The ADW interpolation method used for this study was the modified Shepard's algorithm, which introduces the concept of correlation decay distance (CDD), also called correlation length scale or decorrelation length (Shepard, 1984; Dunn et al., 2020). The CDD is defined as the distance at which the correlation between one station and all other stations decays below $1 / e$ , approximately corresponding to the significance level of 0.05 for the correlation within large samples (Jones et al., 1997; Harris et al., 2020). The number of stations for interpolating the target grid cell is well constrained by the CDD, thus improving the interpolation precision (New et al., 2000; Mitchell and Jones, 2005; Hofstra and New, 2009).

For every station, correlations (r) and distances (x) with the other 2838 stations are shown in Fig. 5, and the ordinary least-squares method was used to fit an exponential decay function:

\begin{matrix} (3) & r = e^{- x / CDD} . \end{matrix}

The estimated CDD is 244.7 km (95 % confidence interval: 244.5–244.8 km) at the 0.05 significance level.

https://essd.copernicus.org/articles/15/3147/2023/essd-15-3147-2023-f05

Figure 5Estimation of correlation decay distance (CDD) for daily precipitation series for all stations in the interpolated domain. Black points show the distance–correlation pair for each station. The blue line is the exponential curve fitted to the data by ordinary least squares. The red dashed line marks where correlation equals $1 / e$ .

Download

The ADW method accounts for the importance of both distance and the isolation of stations in interpolation (New et al., 2000). Only stations within the range of the CDD for the center of the target grid cell (L) were involved in the interpolation. The weight for each involved station (i) is a function of the distance weight (D_i) and angular weight (A_i):

\begin{matrix} (4) & D_{i} = {(e^{- x_{i} / CDD})}^{n}, \end{matrix}

where x_i is the distance between station i and the center of target grid cell L; n is a constant and usually set to 4, in accordance with previous studies (Harris et al., 2020; Dunn et al., 2020; Efthymiadis et al., 2006).

\begin{matrix} (5) & A_{i} = 1 + \frac{\sum_{k} D_{k} [1 - \cos (θ_{k} - θ_{i})]}{\sum_{k} D_{k}} (i \neq k), \end{matrix}

where k represents the surrounding stations relative to station i; D_k is the distance weight for the surrounding stations k; and θ_i and θ_k are the angles relative to the north of the center of target grid cell L for station i and the surrounding stations k. Here, “surrounding stations” refers to all other stations except for station i. Finally, the weights for all contributing stations were standardized to sum to 1.0.

The angular-distance weight (W_i) equals

\begin{matrix} (6) & W_{i} = D_{i} A_{i} . \end{matrix}

3.3.2 IDW

We introduced the concept of CDD to the IDW method for interpolation. We set CDD1 = 244.7 km, which represents the boundary at which the search radius has a good correlation between stations and the target grid cell. The minimum distance satisfying the condition that at least three stations are included in the search radius is around 1336 km. Therefore, CDD2 = 1336 km was used as a second choice of search radius if there were not at least three stations located within the range of CDD1. We employed S₀ representing the target grid cell, i representing the surrounding stations that fall in the search radius of the target grid cell, y(s_i) denoting the station observations, and d_0i denoting the distance between S₀ and i. The estimation of daily precipitation in grid cell S₀ is $\hat{y} (s_{0})$ :

\begin{matrix} (7) & \hat{y} (s_{0}) = \sum_{i = 1}^{n} λ_{i} y (s_{i}), \\ (8) & λ_{i} = d_{0_{i}}^{- α} / \sum_{i}^{n} d_{0 i}^{- α}, \\ (9) & \sum_{i}^{n} λ_{i} = 1, \end{matrix}

where λ_i is the distance weight for the interpolated stations; n is the number of stations involved in the interpolation; and the parameter α is the geometric form of weight. A high magnitude of α represents a strong correlation decay per unit of distance; thus the stations that are close to the target grid cell would be assigned a greater weight (Lu and Wong, 2008). In this study, we used α=2, which is widely used for the IDW method to show the Euclidean distance between the centers of grid cells and the interpolated stations (Ly et al., 2013; Ahrens, 2006).

Table 2An overview of eight combination schemes for daily climatology fields and ratio fields.

Download Print Version | Download XLSX

3.3.3 TPS

Splines are developed with the use of spatial covariate functions (Wahba and Wendelberger, 1980; Camera et al., 2014). TPS regards the spatial distribution as simply a function of observations, and there is no need to first estimate a covariate function (Hutchinson, 1995). Thus, the interpolation precision is improved. The TPS function offers a trade-off between data fidelity and smoothness of fit (Tait et al., 2006; Haylock et al., 2008). The degree of smoothing is determined by minimizing generalized cross validation (Hutchinson, 1998).

3.3.4 TNNI

The Delaunay triangulation net is built by location of vertices of triangulations with rainfall amount as the third dimension (Delaunay, 1934). Within finite point sets, the Delaunay triangulation is demonstrated to be the only optimal method (Sibson, 1978). This uniqueness guarantees the stability of interpolation. TNNI estimates the value of the target grid cell as a value of the nearest sample, and this can reflect the characteristic of regional precipitation (Vivoni Enrique et al., 2004).

3.4 Eight combination schemes for the daily climatology field and ratio field

The two types of daily climatology fields and four types of ratio fields constitute eight combination schemes of interpolation strategies (Table 2). We compared the performances of the eight combination schemes to choose the best scheme to construct the dataset.

3.5 Validation

To improve the validation efficiency, 45 992 high-density gauge observations from China were used to evaluate the eight interpolation schemes with a spatial resolution of 0.1^∘ × 0.1^∘. The validation steps are as follows:

Remove the 2839 gauge stations used for interpolation from the 45 992 stations; the observations of the remaining stations are employed as the “true values”.
Distribute the remaining stations into the corresponding 0.1^∘ × 0.1^∘ grid cells according to their longitudes and latitudes.
Apply the validation only for the grid cells where stations are located. The average of station observations is calculated as the validation value for the 0.1^∘ × 0.1^∘ grid cells where multiple stations are located. The correlation coefficient (CC), root-mean-square error (RMSE), and Kling–Gupta efficiency (KGE) between validation value (V_n) and estimation value (Y_n) are used as evaluation indicators for validation:
$\begin{matrix} (10) & CC = \frac{\frac{1}{N} \sum_{n = 1}^{N} (V_{n} - \bar{V}) (Y_{n} - \bar{Y})}{σ_{V} σ_{Y}}, \\ (11) & RMSE = \sqrt{\frac{1}{N} \sum_{n = 1}^{N} {(V_{n} - Y_{n})}^{2}}, \\ (12) & KGE = 1 - \sqrt{{(CC - 1)}^{2} + {(α - 1)}^{2} + (β - 1)^{2}}, \\ (13) & α = \frac{σ_{Y}}{σ_{V}}, β = \frac{\bar{Y}}{\bar{V}}, \end{matrix}$
where N is the length of the daily precipitation series for 2015–2019; σ_V and $\bar{V}$ are the standard deviation and mean value for the validated daily precipitation series, respectively; σ_Y and $\bar{Y}$ are the standard deviation and mean value for the estimated daily precipitation series, respectively; α is a variability bias term; and β is a measure of mean bias (Gupta et al., 2009).

4 Results and discussion

4.1 Best interpolation scheme derived by validation

Generally, the performances of CC, RMSE, and KGE in southeast China were all better than those in northwest China and the Tibetan Plateau, where the density of stations is relatively sparse (Figs. 6, 7, 8). The interpolated grid values were closer to gauge observations in simple terrain (e.g., the North China Plain) compared with complex terrain (e.g., the Yungui Plateau, the Loess Plateau, and the Tibetan Plateau). In terms of CC, the performances of scheme 3 (ANUSPLIN + IDW) and scheme 4 (PRISM + IDW) were best among the eight schemes, both with median CC values of 0.78. The median CC values for scheme 7 (ANUSPLIN + TNNI; 0.76) and scheme 8 (PRISM + TNNI; 0.76) were a little smaller than those for schemes 3 and 4. Scheme 1 (ANUSPLIN + ADW) and scheme 2 (PRISM + ADW) shared the same median CC value of 0.71. The lowest median CC value (0.63) was calculated for scheme 5 (ANUSPLIN + TPS) and scheme 6 (PRISM + TPS). For the median RMSE values, the order of the eight interpolation schemes was as follows: scheme 4 (8.8 mm d⁻¹) < scheme 3 (8.83 mm d⁻¹) < scheme 1 (10.09 mm d⁻¹) = scheme 2 (10.09 mm d⁻¹) < scheme 7 (10.14 mm d⁻¹) < scheme 8 (10.23 mm d⁻¹) < scheme 5 (11.16 mm d⁻¹) < scheme 6 (11.17 mm d⁻¹). The results for the comprehensive index, KGE, combining the characteristics of correlation, variability bias, and mean bias show that using scheme 4 would obtain the best interpolation results, at a median KGE value of 0.69 for China. Schemes 3, 7, and 8 performed slightly worse than scheme 4, all with the same value of 0.68. The median KGE values for scheme 1 (0.56) and scheme 2 (0.57) were a little worse than those listed above. Using scheme 5 and scheme 6 would result in the worst performance based on KGE values; their median KGE values were about 0.49 and 0.5, respectively. Overall, using scheme 4, which applies PRISM monthly climatology to adjust the daily climatology field, combined with an IDW-interpolated ratio field, achieved the best performance among these validation indices. Therefore, the best scheme (PRISM + IDW) was used to construct the new 62-year CHM_PRE dataset for the Chinese mainland. The gridded data are the areal average precipitation over the grid cell.

https://essd.copernicus.org/articles/15/3147/2023/essd-15-3147-2023-f06

Figure 6(a–h) Spatial pattern of the correlation coefficient (CC) for eight combination schemes. Numbers in each subplot represent the median of CC values across all grid cells involved in validation.

https://essd.copernicus.org/articles/15/3147/2023/essd-15-3147-2023-f07

Figure 7The same as Fig. 6 but for root-mean-square error (RMSE). The unit is millimeters per day.

https://essd.copernicus.org/articles/15/3147/2023/essd-15-3147-2023-f08

Figure 8The same as Fig. 6 but for Kling–Gupta efficiency (KGE).

Scheme 4 had better performance than the other schemes because it considers the impact of topography more deeply and holds an appropriate balance between local data fidelity and global fitting smoothness. The overall interpolation strategy was to combine the daily climatology field (C_d) with the field of the ratio between daily precipitation and daily climatology ( $P / C_{d}$ ). With respect to C_d, the PRISM-type daily climatology field incorporates topographic features, proximity to coastlines, and several measures of terrain complexity, which goes beyond the climate–elevation relationships that the ANUSPLIN-type daily climatology field considers. As for $P / C_{d}$ , we selected the four alternative interpolation methods (ADW, IDW, TPS, and TNNI) to consider the balance between local data fidelity and global fitting smoothness in addition to the popularity, authority, and simplicity of the interpolation methods. Specifically, the ADW and IDW methods were chosen due to their high local data fidelity. Both are local interpolation methods (Liszka, 1984). Unlike the IDW method, the ADW method assigns a tiny weight to far distant gauge observations to promote global fitting smoothness. This impacts the local accuracy of interpolation. The TPS and TNNI methods, on the other hand, were chosen for their high global fitting smoothness. Both TPS and TNNI are global interpolation methods (Liszka, 1984). The TPS method is based on a mathematical model for surface estimation that fits a minimum-curvature surface through all input points, while TNNI constructs a Delaunay triangulation of three stations locations. So TNNI tends to assign more weights to maintain local data fidelity but has weaker fitting smoothness. To sum up, the combination of PRISM-type C_d and IDW-type $P / C_{d}$ yielded the best performance among the selected schemes. This was not simply due to chance. This best-performing interpolation scheme could be applied in other regions, but further validation would be needed to confirm whether it is the best-performing interpolation scheme there.

Due to a high rate of missing daily observations from 1961 to 1980, up to 15 % of stations could not reach the threshold for quality control (i.e., a rate of missing daily precipitation not greater than 5 %) and were removed. With the vigorous development of hydrometeorological observation in China since the 1980s, precipitation data quality has been improving, which caused a jump in the number of stations that met quality control requirements beginning in 1981 (Shen et al., 2014). If we had kept the number of gauges used for interpolation steady over the full 62-year span, about 300 gauges available for the period 1981–2022 would have been excluded, which would have been a great loss of real observed precipitation information. Therefore, the strategy adopted in this study was that all observational data that met the quality control conditions were used for data interpolation, which led to some differences in the number of sites used every year. These slight differences could be partly compensated for by using correlation decay distance (CDD1 and CDD2), which confirmed there were at least three stations involved in the interpolation for each grid cell so that there would not be a sharp change in the number of stations used for interpolation for each grid cell.

https://essd.copernicus.org/articles/15/3147/2023/essd-15-3147-2023-f09

Figure 9Monthly precipitation series from 1 January 2008 to 31 December 2015 for the CGDPA, CN05.1, CMA V2.0, and CHM_PRE datasets.

Download

https://essd.copernicus.org/articles/15/3147/2023/essd-15-3147-2023-f10

Figure 10Spatial pattern of average annual wet-day (>1 mm d⁻¹) precipitation amount during the period of 2008 to 2015 for the (a) CGDPA, (b) CN05.1, (c) CMA V2.0, and (d) CHM_PRE datasets.

https://essd.copernicus.org/articles/15/3147/2023/essd-15-3147-2023-f11

Figure 11Spatial pattern of mean annual wet-day (>1 mm d⁻¹) frequency from 2008 to 2015 for the (a) CGDPA, (b) CN05.1, (c) CMA V2.0, and (d) CHM_PRE datasets.

https://essd.copernicus.org/articles/15/3147/2023/essd-15-3147-2023-f12

Figure 12Spatial pattern of mean Rx1day from 2008 to 2015 for the (a) CGDPA, (b) CN05.1, (c) CMA V2.0, and (d) CHM_PRE datasets.

4.2 Comparison with other gauge-based datasets

We compared the performance of the CHM_PRE dataset with three other datasets – CGDPA (Shen et al., 2010), CN05.1 (Wu and Gao, 2013), and CMA V2.0 (Zhao et al., 2014) (Figs. 9–12). To derive a uniform time span among the different datasets, monthly precipitation series for the period of 1 January 2008 to 31 December 2015, were calculated in the CHM_PRE, CGDPA, CN05.1, and CMA V2.0 datasets. Results showed that the temporal pattern of the monthly precipitation series was generally consistent among different datasets, with a maximum bias of 5 mm per month for the dry (December–January–February) and wet (June–July–August) seasons (Fig. 9). The average annual wet-day (>1 mm d⁻¹) precipitation amount and frequency between 2008 and 2015 for different datasets shared similar spatial patterns, with a general decrease from southeastern China to northwestern China (Figs. 10, 11). The median differences in the multi-year annual wet-day precipitation amount across all grid cells for CHM_PRE − CGDPA, CHM_PRE − CN05.1, and CHM_PRE − CMA V2.0 were −24.79, −7.43, and 13.87 mm yr⁻¹, respectively. The multi-year annual wet-day frequency was higher in CGDPA and CMA V2.0 than in the CHM_PRE dataset, with median differences of 6.38 and 3.25 d yr⁻¹ across grid cells, respectively. The mean annual wet-day frequency in CN05.1 was 9.63 d yr⁻¹ less than in the CHM_PRE dataset. The broad features of mean annual maximum 1 d precipitation amount (Rx1day) were comparable among the four datasets from 2008 to 2015 (Fig. 12). The 8-year average Rx1day values for China were 48.19, 35.29, 39.72, and 40.19 mm d⁻¹ for the CGDPA, CN05.1, CMA V2.0, and CHM_PRE datasets, respectively. Generally, the differences in spatial pattern among different datasets were a combined effect of the gauge density involved and whether or not orographic effects and boundary effects were considered in the interpolation algorithm. The spatial patterns of mean precipitation (Figs. 10, 11) and extreme precipitation (Fig. 12) agreed well among the different datasets in eastern and southern China, where gauge density is relatively high. This indicates that the density of input gauges could be a dominant factor affecting interpolation output (Morrissey et al., 1995). Agreement in spatial patterns of mean and extreme precipitation was poorer in northwestern China and the Tibetan Plateau, which was driven by interpolation algorithms. In particular, the heavy precipitation in the southern Tibetan Plateau was well captured by the CGDPA and CHM_PRE datasets, but the CN05.1 and CMA V2.0 datasets failed to capture it. This suggests the importance of orographic effects and boundary effects in interpolation processes because heavier rainfall appears over mountainous regions than nearby plains.

5 Data availability

This high-resolution long-term gauge-based daily precipitation dataset covers the period of 1961–2022, and it will continue to be updated annually. It contains data for three spatial resolutions: 0.1^∘ × 0.1^∘, 0.25^∘ × 0.25^∘, and 0.5^∘ × 0.5^∘ covering the domain of 18–54^∘ N, 72–136^∘ E. The NetCDF-formatted output files of the CHM_PRE dataset are freely accessible at https://doi.org/10.6084/m9.figshare.21432123.v4 (Han and Miao, 2022).

6 Conclusions

Based on a recent 62-year time series of daily observations from 2839 gauges across the Chinese mainland and the areas just outside China's boundaries, this study compared eight different interpolation schemes that used an algorithm combining the daily climatology field with a precipitation ratio field. A validation method was used to evaluate the eight interpolation schemes using 45 992 high-density gauge observations from China. The results indicate that the best-performing scheme was scheme 4, which combined a monthly precipitation constraint and correction for topographic characteristics with the daily climatology field and interpolated station observations of precipitation ratio into grid cells using an inverse-distance weighting method. The median CC, RMSE, and KGE values for the interpolation scheme that performed the best among the selected metrics (in comparison with the high-density gauge observations used for validation) were 0.78, 8.8 mm d⁻¹, and 0.69, respectively. Using the best-performing interpolation scheme, we constructed a new gridded precipitation dataset (CHM_PRE) for the Chinese mainland with a daily temporal resolution and at multiple spatial resolutions (0.1^∘ × 0.1^∘, 0.25^∘ × 0.25^∘, and 0.5^∘ × 0.5^∘) for the period of 1961–2022. The CHM_PRE dataset showed reliable quality compared with other available precipitation products.

Author contributions

JH and CM contributed to designing the research; JH implemented the research and wrote the original draft; CM and JG supervised the research; all co-authors revised the paper and contributed to the writing.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Acknowledgements

We would like to express our thanks for the high-performance computing support from the Center for Geodata and Analysis, Faculty of Geographical Science, Beijing Normal University (https://gda.bnu.edu.cn/, last access: 24 July 2023). We are also grateful to the National Meteorological Information Center of the China Meteorological Administration (NMIC, http://data.cma.cn, last access: 24 July 2023) for providing the observed climate data.

Financial support

This research was supported by the National Natural Science Foundation of China (grant no. 42041006), the Second Tibetan Plateau Scientific Expedition and Research Program (STEP) (grant no. 2019QZKK0405), and the State Key Laboratory of Earth Surface Processes and Resource Ecology (grant no. 2022-ZD-03).

Review statement

This paper was edited by Chunlüe Zhou and reviewed by two anonymous referees.

References

Ahrens, B.: Distance in spatial interpolation of daily rain gauge data, Hydrol. Earth Syst. Sci., 10, 197–208, https://doi.org/10.5194/hess-10-197-2006, 2006.

Allan, R. P., Barlow, M., Byrne, M. P., Cherchi, A., Douville, H., Fowler, H. J., Gan, T. Y., Pendergrass, A. G., Rosenfeld, D., Swann, A. L. S., Wilcox, L. J., and Zolina, O.: Advances in understanding large-scale responses of the water cycle to climate change, Ann. NY Acad. Sci., 1472, 49–75, https://doi.org/10.1111/nyas.14337, 2020.

Allen, M. R. and Ingram, W. J.: Constraints on future changes in climate and the hydrologic cycle, Nature, 419, 228–232, https://doi.org/10.1038/nature01092, 2002.

Beck, H. E., Wood, E. F., Pan, M., Fisher, C. K., Miralles, D. G., van Dijk, A. I. J. M., McVicar, T. R., and Adler, R. F.: MSWEP V2 global 3-hourly 0.1^∘ precipitation: methodology and quantitative assessment, B. Am. Meteorol. Soc., 100, 473–500, https://doi.org/10.1175/BAMS-D-17-0138.1, 2019.

Beck, H. E., Westra, S., Tan, J., Pappenberger, F., Huffman, G. J., McVicar, T. R., Gründemann, G. J., Vergopolan, N., Fowler, H. J., Lewis, E., Verbist, K., and Wood, E. F.: PPDIST, global 0.1^∘ daily and 3-hourly precipitation probability distribution climatologies for 1979–2018, Sci. Data, 7, 1–12, https://doi.org/10.1038/s41597-020-00631-x, 2020.

Caesar, J., Alexander, L., and Vose, R.: Large-scale changes in observed daily maximum and minimum temperatures: Creation and analysis of a new gridded data set, J. Geophys. Res.-Atmos., 111, D05101, https://doi.org/10.1029/2005JD006280, 2006.

Camera, C., Bruggeman, A., Hadjinicolaou, P., Pashiardis, S., and Lange, M. A.: Evaluation of interpolation techniques for the creation of gridded daily precipitation (1×1 km²); Cyprus, 1980–2010, J. Geophys. Res.-Atmos., 119, 693–712, https://doi.org/10.1002/2013JD020611, 2014.

Chen, M., Xie, P., Janowiak, J. E., and Arkin, P. A.: Global Land Precipitation: A 50-yr Monthly Analysis Based on Gauge Observations, J. Hydrometeorol., 3, 249–266, https://doi.org/10.1175/1525-7541(2002)003<0249:GLPAYM>2.0.CO;2, 2002.

Daly, C., Neilson, R. P., and Phillips, D. L.: A statistical-topographic model for mapping climatological precipitation over mountainous terrain, J. Appl. Meteorol., 33, 140–158, https://doi.org/10.1175/1520-0450(1994)033<0140:ASTMFM>2.0.CO;2, 1994.

Daly, C., Gibson, W., Taylor, G. H., Johnson, G. L., and Pasteris, P. P.: A knowledge-based approach to the statistical mapping of climate, Clim. Res., 22, 99–113, https://doi.org/10.3354/cr022099, 2002.

Delaunay, B.: Sur la sphere vide, Izv. Akad. Nauk SSSR, Otdelenie Matematicheskii i Estestvennyka Nauk, 7, 1–2, 1934 (in French).

Di Luzio, M., Johnson, G. L., Daly, C., Eischeid, J. K., and Arnold, J. G.: Constructing retrospective gridded daily precipitation and temperature datasets for the conterminous United States, J. Appl. Meteorol. Climatol., 47, 475–497, https://doi.org/10.1175/2007JAMC1356.1, 2008.

Dunn, R. J. H., Alexander, L. V., Donat, M. G., Zhang, X., Bador, M., Herold, N., Lippmann, T., Allan, R., Aguilar, E., Barry, A. A., Brunet, M., Caesar, J., Chagnaud, G., Cheng, V., Cinco, T., Durre, I., de Guzman, R., Htay, T. M., Wan Ibadullah, W. M., Bin Ibrahim, M. K. I., Khoshkam, M., Kruger, A., Kubota, H., Leng, T. W., Lim, G., Li-Sha, L., Marengo, J., Mbatha, S., McGree, S., Menne, M., de los Milagros Skansi, M., Ngwenya, S., Nkrumah, F., Oonariya, C., Pabon-Caicedo, J. D., Panthou, G., Pham, C., Rahimzadeh, F., Ramos, A., Salgado, E., Salinger, J., Sané, Y., Sopaheluwakan, A., Srivastava, A., Sun, Y., Timbal, B., Trachow, N., Trewin, B., van der Schrier, G., Vazquez-Aguirre, J., Vasquez, R., Villarroel, C., Vincent, L., Vischel, T., Vose, R., and Bin Hj Yussof, M. N. A.: Development of an updated global land in situ-based data set of temperature and precipitation extremes: HadEX3, J. Geophys. Res.-Atmos., 125, e2019JD032263, https://doi.org/10.1029/2019JD032263, 2020.

Efthymiadis, D., Jones, P. D., Briffa, K. R., Auer, I., Böhm, R., Schöner, W., Frei, C., and Schmidli, J.: Construction of a 10-min-gridded precipitation data set for the Greater Alpine Region for 1800–2003, J. Geophys. Res.-Atmos., 111, D01105, https://doi.org/10.1029/2005JD006120, 2006.

Eischeid, J. K., Pasteris, P. A., Diaz, H. F., Plantico, M. S., and Lott, N. J.: Creating a serially complete, national daily time series of temperature and precipitation for the western United States, J. Appl. Meteorol., 39, 1580–1591, https://doi.org/10.1175/1520-0450(2000)039<1580:CASCND>2.0.CO;2, 2000.

Farr, T. G., Rosen, P. A., Caro, E., Crippen, R., Duren, R., Hensley, S., Kobrick, M., Paller, M., Rodriguez, E., Roth, L., Seal, D., Shaffer, S., Shimada, J., Umland, J., Werner, M., Oskin, M., Burbank, D., and Alsdorf, D.: The shuttle radar topography mission, Rev. Geophys., 45, RG2004, https://doi.org/10.1029/2005RG000183, 2007.

Fischer, E. M. and Knutti, R.: Observed heavy precipitation increase confirms theory and early models, Nat. Clim. Change, 6, 986–991, https://doi.org/10.1038/nclimate3110, 2016.

Golian, S., Javadian, M., and Behrangi, A.: On the use of satellite, gauge, and reanalysis precipitation products for drought studies, Environ. Res. Lett., 14, 075005, https://doi.org/10.1088/1748-9326/ab2203, 2019.

Gupta, H. V., Kling, H., Yilmaz, K. K., and Martinez, G. F.: Decomposition of the mean squared error and NSE performance criteria: Implications for improving hydrological modelling, J. Hydrol., 377, 80–91, https://doi.org/10.1016/j.jhydrol.2009.08.003, 2009.

Han, J., Miao, C., Duan, Q., Wu, J., Gou, J., and Zheng, H.: Changes in Unevenness of Wet-Day Precipitation Over China During 1961–2020, J. Geophys. Res.-Atmos., 126, e2020JD034483, https://doi.org/10.1029/2020JD034483, 2021.

Han, J. Y. and Miao, C. Y.: A new daily gridded precipitation dataset for the Chinese mainland based on gauge observations, figshare [data set], https://doi.org/10.6084/m9.figshare.21432123.v4, 2022.

Harris, I., Osborn, T. J., Jones, P., and Lister, D.: Version 4 of the CRU TS monthly high-resolution gridded multivariate climate dataset, Sci. Data, 7, 109, https://doi.org/10.1038/s41597-020-0453-3, 2020.

Haylock, M. R., Hofstra, N., Klein Tank, A. M. G., Klok, E. J., Jones, P. D., and New, M.: A European daily high-resolution gridded data set of surface temperature and precipitation for 1950–2006, J. Geophys. Res.-Atmos., 113, D20119, https://doi.org/10.1029/2008JD010201, 2008.

He, J., Yang, K., Tang, W., Lu, H., Qin, J., Chen, Y., and Li, X.: The first high-resolution meteorological forcing dataset for land process studies over China, Sci. Data, 7, 25, https://doi.org/10.1038/s41597-020-0369-y, 2020.

Hofstra, N. and New, M.: Spatial variability in correlation decay distance and influence on angular-distance weighting interpolation of daily precipitation over Europe, Int. J. Climatol., 29, 1872–1880, https://doi.org/10.1002/joc.1819, 2009.

Hutchinson, M. F.: Interpolating mean rainfall using thin plate smoothing splines, Int. J. Geogr. Inf. Syst., 9, 385–403, https://doi.org/10.1080/02693799508902045, 1995.

Hutchinson, M. F.: Interpolation of rainfall data with thin plate smoothing splines – part I: two dimensional smoothing of data with short range correlation, Journal of Geographic Information and Decision Analysis, 2, 153–167, 1998.

Hutchinson, M. F. and Xu, T.: Anusplin version 4.2 user guide, Centre for Resource and Environmental Studies, The Australian National University, Canberra, 5, 2004.

IPCC: Climate Change 2021: The Physical Science Basis. Contribution of Working Group I to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change, edited by: Masson-Delmotte, V., Zhai, P., Pirani, A., Connors, S. L., Péan, C., Berger, S., Caud, N., Chen, Y., Goldfarb, L., Gomis, M. I., Huang, M., Leitzell, K., Lonnoy, E., Matthews, J. B. R., Maycock, T. K., Waterfield, T., Yelekçi, O., Yu, R., and Zhou, B., Cambridge University Press, Cambridge, UK, 1–195, https://doi.org/10.1017/9781009157896, 2021.

Jones, P. D., Osborn, T. J., and Briffa, K. R.: Estimating sampling errors in large-scale temperature averages, J. Climate, 10, 2548–2568, https://doi.org/10.1175/1520-0442(1997)010<2548:ESEILS>2.0.CO;2, 1997.

Kirschbaum, D. B., Huffman, G. J., Adler, R. F., Braun, S., Garrett, K., Jones, E., McNally, A., Skofronick-Jackson, G., Stocker, E., Wu, H., and Zaitchik, B. F.: NASA's remotely sensed precipitation: a reservoir for applications users, B. Am. Meteorol. Soc., 98, 1169–1184, https://doi.org/10.1175/BAMS-D-15-00296.1, 2017.

Kucera, P. A., Ebert, E. E., Turk, F. J., Levizzani, V., Kirschbaum, D., Tapiador, F. J., Loew, A., and Borsche, M.: Precipitation from space: advancing earth system science, B. Am. Meteorol. Soc., 94, 365–375, https://doi.org/10.1175/BAMS-D-11-00171.1, 2013.

Li, R., Wang, K., and Qi, D.: Validating the integrated multisatellite retrievals for global precipitation measurement in terms of diurnal variability with hourly gauge observations collected at 50 000 stations in China, J. Geophys. Res.-Atmos., 123, 10423–10442, https://doi.org/10.1029/2018JD028991, 2018.

Liszka, T.: An interpolation method for an irregular net of nodes, Int. J. Numer. Meth. Eng., 20, 1599–1612, https://doi.org/10.1002/nme.1620200905, 1984.

Lu, G. Y. and Wong, D. W.: An adaptive inverse-distance weighting spatial interpolation technique, Comput. Geosci., 34, 1044–1055, https://doi.org/10.1016/j.cageo.2007.07.010, 2008.

Ly, S., Charles, C., and Degré, A.: Different methods for spatial interpolation of rainfall data for operational hydrology and hydrological modeling at watershed scale: a review, Biotechnol. Agron. Soc., 17, 392–406, https://doi.org/10.6084/M9.FIGSHARE.1225842.V1, 2013.

Menne, M. J., Durre, I., Vose, R. S., Gleason, B. E., and Houston, T. G.: An overview of the global historical climatology network-daily database, J. Atmos. Ocean. Tech., 29, 897–910, https://doi.org/10.1175/JTECH-D-11-00103.1, 2012.

Merino, A., García-Ortega, E., Navarro, A., Fernández-González, S., Tapiador, F. J., and Sánchez, J. L.: Evaluation of gridded rain-gauge-based precipitation datasets: Impact of station density, spatial resolution, altitude gradient and climate, Int. J. Climatol., 41, 3027–3043, https://doi.org/10.1002/joc.7003, 2021.

Mitchell, T. D. and Jones, P. D.: An improved method of constructing a database of monthly climate observations and associated high-resolution grids, Int. J. Climatol., 25, 693–712, https://doi.org/10.1002/joc.1181, 2005.

Morrissey, M. L., Maliekal, J. A., Greene, J. S., and Wang, J.: The uncertainty of simple spatial averages using rain gauge networks, Water Resour. Res., 31, 2011–2017, https://doi.org/10.1029/95WR01232, 1995.

Myhre, G., Samset, B. H., Hodnebrog, Ø., Andrews, T., Boucher, O., Faluvegi, G., Fläschner, D., Forster, P. M., Kasoar, M., Kharin, V., Kirkevåg, A., Lamarque, J. F., Olivié, D., Richardson, T. B., Shawki, D., Shindell, D., Shine, K. P., Stjern, C. W., Takemura, T., and Voulgarakis, A.: Sensible heat has significantly affected the global hydrological cycle over the historical period, Nat. Commun., 9, 1922, https://doi.org/10.1038/s41467-018-04307-4, 2018.

New, M., Hulme, M., and Jones, P.: Representing twentieth-century space–time climate variability. part II: development of 1901–96 monthly grids of terrestrial surface climate, J. Climate, 13, 2217–2238, https://doi.org/10.1175/1520-0442(2000)013<2217:RTCSTC>2.0.CO;2, 2000.

Peng, S., Ding, Y., Liu, W., and Li, Z.: 1 km monthly temperature and precipitation dataset for China from 1901 to 2017, Earth Syst. Sci. Data, 11, 1931–1946, https://doi.org/10.5194/essd-11-1931-2019, 2019.

Qin, R., Zhao, Z., Xu, J., Ye, J.-S., Li, F.-M., and Zhang, F.: HRLT: a high-resolution (1 d, 1 km) and long-term (1961–2019) gridded dataset for surface temperature and precipitation across China, Earth Syst. Sci. Data, 14, 4793–4810, https://doi.org/10.5194/essd-14-4793-2022, 2022.

Rodell, M., Famiglietti, J. S., Wiese, D. N., Reager, J. T., Beaudoing, H. K., Landerer, F. W., and Lo, M. H.: Emerging trends in global freshwater availability, Nature, 557, 651–659, https://doi.org/10.1038/s41586-018-0123-1, 2018.

Schamm, K., Ziese, M., Becker, A., Finger, P., Meyer-Christoffer, A., Schneider, U., Schröder, M., and Stender, P.: Global gridded precipitation over land: a description of the new GPCC First Guess Daily product, Earth Syst. Sci. Data, 6, 49–60, https://doi.org/10.5194/essd-6-49-2014, 2014.

Shen, Y. and Xiong, A.: Validation and comparison of a new gauge-based precipitation analysis over mainland China, Int. J. Climatol., 36, 252–265, https://doi.org/10.1002/joc.4341, 2016.

Shen, Y., Feng, M., Zhang, H., and Gao, F.: Interpolation Methods of China Daily Precipitation Data, J. Appl. Meteorol. Sci., 21, 279–286, 2010 (in Chinese).

Shen, Y., Zhao, P., Pan, Y., and Yu, J.: A high spatiotemporal gauge-satellite merged precipitation analysis over China, J. Geophys. Res.-Atmos., 119, 3063–3075, https://doi.org/10.1002/2013JD020686, 2014.

Shepard, D.: A two-dimensional interpolation function for irregularly-spaced data, Proceedings of the 1968 23rd ACM national conference, 27–29 August 1968, New York, United States, 517–524, https://doi.org/10.1145/800186.810616, 1968.

Shepard, D. S.: Computer Mapping: The SYMAP Interpolation Algorithm, in: Spatial Statistics and Models, edited by: Gaile, G. L., and Willmott, C. J., Springer Netherlands, Dordrecht, 133–145, https://doi.org/10.1007/978-94-017-3048-8_7, 1984.

Sibson, R.: Locally equiangular triangulations, Comput. J., 21, 243–245, https://doi.org/10.1093/comjnl/21.3.243, 1978.

Sun, Q., Miao, C., Duan, Q., Ashouri, H., Sorooshian, S., and Hsu, K.-L.: A review of global precipitation data sets: Data sources, estimation, and intercomparisons, Rev. Geophys., 56, 79–107, https://doi.org/10.1002/2017RG000574, 2018.

Tait, A., Henderson, R., Turner, R., and Zheng, X.: Thin plate smoothing spline interpolation of daily rainfall for New Zealand using a climatological rainfall surface, Int. J. Climatol., 26, 2097–2115, https://doi.org/10.1002/joc.1350, 2006.

Thiessen, A. H.: Precipitation averages for large areas, Mon. Weather Rev., 39, 1082–1089, https://doi.org/10.1175/1520-0493(1911)39<1082b:PAFLA>2.0.CO;2, 1911.

Trenberth, K. E., Dai, A., Rasmussen, R. M., and Parsons, D. B.: The changing character of precipitation, B. Am. Meteorol. Soc., 84, 1205–1218, https://doi.org/10.1175/BAMS-84-9-1205, 2003.

Vivoni Enrique, R., Ivanov Valeri, Y., Bras Rafael, L., and Entekhabi, D.: Generation of triangulated irregular networks based on hydrological similarity, J. Hydrol. Eng., 9, 288–302, https://doi.org/10.1061/(ASCE)1084-0699(2004)9:4(288), 2004.

Wahba, G. and Wendelberger, J.: Some new mathematical methods for variational objective analysis using splines and cross validation, Mon. Weather Rev., 108, 1122–1143, https://doi.org/10.1175/1520-0493(1980)108<1122:SNMMFV>2.0.CO;2, 1980.

Wu, J. and Gao, X.: A gridded daily observation dataset over China region and comparison with the other datasets, Chinese J. Geophys., 56, 1102–1111, https://doi.org/10.6038/cjg20130406, 2013 (in Chinese).

Xie, P., Chen, M., Yang, S., Yatagai, A., Hayasaka, T., Fukushima, Y., and Liu, C.: A gauge-based analysis of daily precipitation over east Asia, J. Hydrometeorol., 8, 607–626, https://doi.org/10.1175/JHM583.1, 2007.

Yatagai, A., Kamiguchi, K., Arakawa, O., Hamada, A., Yasutomi, N., and Kitoh, A.: APHRODITE: constructing a long-term daily gridded precipitation dataset for Asia based on a dense network of rain gauges, B. Am. Meteorol. Soc., 93, 1401–1415, https://doi.org/10.1175/BAMS-D-11-00122.1, 2012.

Zhang, Y., Ren, Y., Ren, G., and Wang, G.: Precipitation trends over mainland China from 1961–2016 after removal of measurement biases, J. Geophys. Res.-Atmos., 125, e2019JD031728, https://doi.org/10.1029/2019JD031728, 2020.

Zhao, Y., Zhu, J., and Xu, Y.: Establishment and assessment of the grid precipitation datasets in China for recent 50 years (in Chinese), J. Meteorol. Sci., 34, 414–420, https://doi.org/10.3969/2013jms.0008, 2014.

Zou, C., Zhang, H., Yang, Y., Liu, Y., Sun, H., and Gao, X.: Effects of rainfall characteristics on bromide leaching in a typical tobacco field in China's Yunnan Province, Soil Sci. Soc. Am. J., 87, 231–245, https://doi.org/10.1002/saj2.20502, 2022.

Articles

Download

Article (13391 KB)
Full-text XML

Short summary

Constructing a high-quality, long-term daily precipitation dataset is essential to current hydrometeorology research. This study aims to construct a long-term daily precipitation dataset with different spatial resolutions based on 2839 gauge observations. The constructed precipitation dataset shows reliable quality compared with the other available precipitation products and is expected to facilitate the advancement of drought monitoring, flood forecasting, and hydrological modeling.

A new daily gridded precipitation dataset for the Chinese mainland based on gauge observations

2.1 Raw gauge data used for interpolation

2.2 High-density gauge observations used for validation

2.3 SRTM-DEM

2.4 PRISM

3.1 Interpolation scheme

3.2 Building the daily climatology field (Cd)

3.3 Constructing a field of the ratio between daily precipitation and daily climatology (P/Cd)

3.3.1 ADW

3.3.2 IDW

3.3.3 TPS

3.3.4 TNNI

3.4 Eight combination schemes for the daily climatology field and ratio field

3.5 Validation

4.1 Best interpolation scheme derived by validation

4.2 Comparison with other gauge-based datasets

3.2 Building the daily climatology field (C_d)

3.3 Constructing a field of the ratio between daily precipitation and daily climatology ( $P / C_{d}$ )