the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
WHU-SGCC: a novel approach for blending daily satellite (CHIRP) and precipitation observations over the Jinsha River basin
Gaoyun Shen
Nengcheng Chen
Wei Wang
Accurate and consistent satellite-based precipitation estimates blended with rain gauge data are important for regional precipitation monitoring and hydrological applications, especially in regions with limited rain gauges. However, the existing fusion precipitation estimates often have large uncertainties over mountainous areas with complex topography and sparse rain gauges, and most of the existing data blending algorithms are not good at removing the day-by-day errors. Therefore, the development of effective methods for high-accuracy precipitation estimates over complex terrain and at a daily scale is of vital importance for mountainous hydrological applications. This study aims to offer a novel approach for blending daily precipitation gauge data and the Climate Hazards Group Infrared Precipitation (CHIRP; daily, 0.05∘) satellite-derived precipitation developed by UC Santa Barbara over the Jinsha River basin from 1994 to 2014. This method is called the Wuhan University Satellite and Gauge precipitation Collaborated Correction (WHU-SGCC). The results show that the WHU-SGCC method is effective for liquid precipitation bias adjustments from points to surfaces as evaluated by multiple error statistics and from different perspectives. Compared with CHIRP and CHIRP with station data (CHIRPS), the precipitation adjusted by the WHU-SGCC method has greater accuracy, with overall average improvements of the Pearson correlation coefficient (PCC) by 0.0082–0.2232 and 0.0612–0.3243, respectively, and decreases in the root mean square error (RMSE) by 0.0922–0.65 and 0.2249–2.9525 mm, respectively. In addition, the Nash–Sutcliffe efficiency coefficient (NSE) of the WHU-SGCC provides more substantial improvements than CHIRP and CHIRPS, which reached 0.2836, 0.2944, and 0.1853 in the spring, autumn, and winter. Daily accuracy evaluations indicate that the WHU-SGCC method has the best ability to reduce precipitation bias, with average reductions of 21.68 % and 31.44 % compared to CHIRP and CHIRPS, respectively. Moreover, the accuracy of the spatial distribution of the precipitation estimates derived from the WHU-SGCC method is related to the complexity of the topography. The validation also verifies that the proposed approach is effective at detecting major precipitation events within the Jinsha River basin. In spite of the correction, the uncertainties in the seasonal precipitation forecasts in the summer and winter are still large, which might be due to the homogenization attenuating the extreme rain event estimates. However, the WHU-SGCC approach may serve as a promising tool to monitor daily precipitation over the Jinsha River basin, which contains complicated mountainous terrain with sparse rain gauge data, based on the spatial correlation and the historical precipitation characteristics. The daily precipitation estimations at the 0.05∘ resolution over the Jinsha River basin during all four seasons from 1990 to 2014, derived from WHU-SGCC, are available at the PANGAEA Data Publisher for Earth & Environmental Science portal (https://doi.org/10.1594/PANGAEA.905376, Shen et al., 2019).
- Article
(28910 KB) - Full-text XML
- BibTeX
- EndNote
Accurate and consistent estimates of precipitation are vital for hydrological modelling, flood forecasting, and climatological studies in support of better planning and decision making (Agutu et al., 2017; Cattani et al., 2018; Roy et al., 2017). In general, ground-based gauge networks include a substantial number of liquid precipitation observations measured with high accuracy, high temporal resolution, and long historical records. However, the sparse distribution and point measurements limit the accurate estimation of spatially gridded rainfall (Martens et al., 2013).
Due to the sparseness and uneven spatial distribution of rain gauges and the high proportion of missing data, satellite-derived precipitation data are an attractive supplement offering the advantage of plentiful information with high spatio-temporal resolution over widespread regions, particularly over oceans, high-elevation mountainous regions, and other remote regions where gauge networks are difficult to deploy. However, satellite estimates are susceptible to systematic biases that can influence hydrological modelling, and the retrieval algorithms are relatively insensitive to light rainfall events, especially in complex terrain, resulting in underestimations of the magnitudes of precipitation events (Behrangi et al., 2014; Thiemig et al., 2013; Yang et al., 2017). Without adjustments, inaccurate satellite-based precipitation estimates will lead to unreliable assessments of risk and reliability (AghaKouchak et al., 2011).
Accordingly, many kinds of precipitation estimates combining multiple sources and datasets are available. Table 1 shows the temporal and spatial resolution of current major satellite-based precipitation datasets. Since 1997, the Tropical Rainfall Measurement Mission (TRMM) has improved satellite-based rainfall retrievals over tropical regions (Kummerow et al., 1998; Simpson et al., 1988). High spatial and temporal resolution multi-satellite precipitation products were developed continuously during the TRMM era (Maggioni et al., 2016), including (1) the TRMM Multisatellite Precipitation Analysis (TMPA) products, which are derived from gauge–satellite fusing (Huffman et al., 2010; Vila et al., 2009); (2) the Climate Prediction Center (CPC) morphing technique (Joyce et al., 2004; Joyce and Xie, 2011; Xie et al., 2017), which integrates geosynchronous infrared (GEO IR) and polar-orbiting microwave (PMW) sensor data and is available 3-hourly on a grid with a spatial resolution of 0.25∘; (3) the Precipitation Estimation from Remotely Sensed Information using Artificial Neural Networks - Climate Data Record (PERSIANN-CDR) produced by the PERSIANN algorithm, which has daily temporal and spatial resolutions (Ashouri et al., 2015); and (4) the Global Satellite Mapping of Precipitation (GSMaP) project, which produces global rainfall estimates in near-real time and applies the motion vector Kalman filter based on physical models (GSMaP-NRT and GSMaP-MVK, respectively) (Aonashi et al., 2009; Ushio et al., 2009; Ushio and Kachi, 2010). In 2014, the Global Precipitation Measurement (GPM) satellite was launched after the success of the TRMM satellite by a cooperation between the National Aeronautics and Space Administration (NASA) and the Japan Aerospace Exploration Agency (JAXA) (Mahmoud et al., 2018; Ning et al., 2016). The main core observatory satellite (GPM) integrates advanced radar and radiometer systems to obtain the precipitation physics and takes advantages of TMPA, the Climate Prediction Center morphing technique (CMORPH), and PERSIANN algorithms to offer high spatio-temporal resolution products (, half-hourly) of global real-time precipitation estimates (Huffman et al., 2018; Skofronick-Jackson et al., 2017; Hou et al., 2014). Nevertheless, the major aforementioned products have only been available since 1998, which limits long-term climatological studies. Only the PERSIANN-CDR dataset has temporal coverage since 1983. However, the spatial resolution of PERSIANN-CDR is relatively coarse, and the data resolution must be degraded to achieve high accuracy in precipitation monitoring. To fill the gap in high-resolution and long-term global multi-satellite precipitation monitoring, the Multi-Source Weighted-Ensemble Precipitation (MSWEP) product (Beck et al., 2017, 2019) and the Climate Hazards Group Infrared Precipitation with Station data (CHIRPS) product from UC Santa Barbara (Funk et al., 2015a) were developed. MSWEP is a precipitation dataset with global coverage available at 0.1∘ spatial resolution and at 3-hourly, daily, and monthly temporal resolutions. MSWEP is multi-source data that take advantage of the complementary strengths of gauge-,satellite-, and reanalysis-based data. However, to provide precipitation estimates at a higher spatial resolution, the CHIRPS dataset is used in this study.
CHIRPS is a longer-length precipitation data series with a higher spatial resolution (0.05∘) that merges three types of information: global climatology, satellite estimates, and in situ observations. The CHIRPS precipitation dataset with several temporal and spatial scales has been evaluated in Brazil (Nogueira et al., 2018; Paredes-Trejo et al., 2017), Chile (Yang et al., 2016; Zambrano-Bigiarini et al., 2017), China (Bai et al., 2018), Cyprus (Katsanos et al., 2016a, b), India (Ali and Mishra, 2017; Prakash, 2019), and Italy (Duan et al., 2016). However, the temporal resolutions of these applications were mainly at seasonal and monthly scales, lacking the evaluation and correction of daily precipitation. Additionally, despite the great potential of gauge–satellite fusing products for large-scale environmental monitoring, there are still large discrepancies with ground observations at the sub-regional level where these data have been applied. Furthermore, the CHIRPS product's reliability has not been analysed in detail over the Jinsha River basin in China, particularly at a daily scale. The Jinsha River basin is a typical study area with complex and varied terrain, an uneven spatial distribution of precipitation, and a sparse spatial distribution of rain gauges, which limit high-accuracy precipitation monitoring. The existing research indicates that estimations over mountainous areas with complex topography often have large uncertainties and bias due to the topography, seasonality, climate impact, and sparseness of rain gauges (Derin et al., 2016; Maggioni and Massari, 2018; Zambrano-Bigiarini et al., 2017). Moreover, Bai et al. (2018) evaluated CHIRPS over mainland China and indicated that the performance of CHIRPS is poor over the Sichuan basin and the northern China Plain, which have complex terrain with substantial variations in elevation. Additionally, Trejo et al. (2016) show that CHIRPS overestimates low monthly rainfall and underestimates high monthly rainfall using several numerical metrics and that the rainfall event frequency is overestimated outside the rainy season.
To overcome these limitations, many studies have focused on proposing effective methodologies for blending rain gauge observations, satellite-based precipitation estimates, and sometimes radar data to take advantage of each dataset. Many numerical models have been established with these datasets for high-accuracy precipitation estimations, such as bias adjustment by a quantile mapping (QM) approach (Yang et al., 2016), Bayesian kriging (BK) (Verdin et al., 2015), and a conditional merging technique (Berndt et al., 2014). The QM approach is a distribution-based approach, which works with historical data for bias adjustment and is effective at reducing the systematic bias of regional climate model precipitation estimates at monthly or seasonal scales (Chen et al., 2013). However, the QM approach offers very limited improvement in removing day-by-day errors. The BK approach provides very good model fit with precipitation observations, but the Gaussian assumption of the BK model is invalid for daily scales. Overall, there is a lack of effective methods for high-accuracy precipitation estimates over complex terrain at a daily scale.
As such, due to the poor performance at the sub-regional scale, the gauge-satellite fusing algorithms can be assumed to limit high-accuracy estimations in the process of CHIRPS data production. Therefore, the aim of this article is to present a novel approach for reblending daily liquid precipitation gauge data and the Climate Hazards Group Infrared Precipitation (CHIRP) satellite-derived precipitation estimates developed by UC Santa Barbara over the Jinsha River basin. We use precipitation to denote liquid precipitation throughout the text. The CHIRP data are the raw data of CHIRPS before blending with the rain gauge data. The objective is to build corresponding precipitation models that consider terrain factors and precipitation characteristics to produce high-quality precipitation estimates. This novel method is called the Wuhan University Satellite and Gauge precipitation Collaborated Correction (WHU-SGCC) method. We present this method by applying it to daily precipitation over the Jinsha River basin in the different seasons from 1990 to 2014. The results support the validity of the proposed approach for producing refined satellite gauge precipitation estimates over mountainous areas.
The remainder of this paper is organized as follows: Sect. 2 describes the study region, rain gauges, and CHIRPS dataset used in this study. Section 3 presents the principle of the WHU-SGCC approach for high-accuracy daily precipitation estimates. The results and discussion are analysed in Sect. 4, the data available are described in Sect. 5, and the conclusions and future work are presented in Sect. 6.
2.1 Study region
The Yangtze River is one of the largest and most important rivers in South-east Asia, originating on the Tibetan Plateau and extending approximately 6300 km eastward to the East China Sea. The river's catchment covers an area of approximately km2 and the average annual precipitation is approximately 1100 mm (Zhang et al., 2019). The Yangtze River is divided into nine sub-basins, and the upper drainage basin is the Jinsha River basin, which flows through the provinces of Qinghai, Sichuan, and Yunnan in western China. Within the Jinsha River basin, the total river length is 3486 km, accounting for 77 % of the length of the upper Yangtze River and covering a watershed area of 460×103 km2. The location of the Jinsha River basin is shown in Fig. 1, and it covers the eastern part of the Tibetan Plateau and part of the Hengduan Mountains. The southern portion of the river basin is the northern Yunnan Plateau, and the eastern portion includes a wide area of the south-western margin of the Sichuan basin. Crossing complex and varied terrains, the elevation of the Jinsha River ranges from 263 to 6575 m above sea level, which results in significant temporal and spatial climate and weather variations inside the basin. The average annual precipitation of the Jinsha River basin is approximately 710 mm, the average annual precipitation of the lower reaches is approximately 900–1300 mm, and the average annual precipitation of the middle and upper reaches is approximately 600–800 mm (Yuan et al., 2018). The Jinsha River basin has four seasons: spring (March–April–May), summer (June–July–August), autumn (September–October–November), and winter (December–January–February). Therefore, the blending of satellite estimations with gauge observations during the different seasons is the main focus of this research.
2.2 Study data
2.2.1 Precipitation gauge observations
Daily rain gauge observations at 30 national standard rain stations within the Jinsha River basin from 1 March 1990 to February 2015 were provided by the National Climate Centre (NCC) of the China Meteorological Administration (CMA) (http://data.cma.cn/data/cdcdetail/dataCode/SURF_CLI_CHN_MUL_DAY_V3.0.html, last access: 10 December 2018), which imposes strict quality control at the station, provincial, and state levels. The process of quality control conducted by the CMA is as follows: (1) climate threshold or allowable value check; (2) extreme values at gauge stations check; (3) internal consistency check between fixed value, daily average value, and daily extreme value; (4) time consistency check; and (5) manual verification and correction. The station identification numbers and relevant geographical characteristics are shown in Appendix A, and their uneven spatial distribution is shown in Fig. 2. The selected rain gauges are located in Qinghai, Tibet, Sichuan, and Yunnan provinces, but are mainly scattered in Sichuan Province, and the northern river basin contains fewer rain gauges than the southern river basin. In this study, the daily rain gauge observations were used as the reference data for the bias correction of satellite precipitation estimations.
The multi-annual (1990–2014) average seasonal precipitation over the Jinsha River basin increases from north to south (Fig. 2). The dynamic and uneven distribution of precipitation is influenced distinctly by the seasonal climate. Most of the precipitation falls in the summer, with the average seasonal precipitation ranging from less than 250 mm to more than 600 mm, while the average seasonal precipitation during the winter is no more than 50 mm. The average seasonal precipitation and spatial distribution in the spring are similar to those in the autumn, with values concentrated in the range of 50 to 200 mm.
2.2.2 CHIRPS satellite-gauge fusion precipitation estimates
The CHIRPS v.2 dataset, a satellite-based daily rainfall product, is available online at ftp://ftp.chg.ucsb.edu/pub/org/chg/products/CHIRPS-2.0/global_daily/tifs/p05/ (last access: 10 December 2018). It covers a quasi-global area (land only, 50∘ S–50∘ N) at several temporal scales (daily, pentad, decad, monthly, and annual temporal resolutions) and a high spatial resolution (0.05∘) (Rivera et al., 2018). This dataset contains a wide variety of satellite-based rainfall products derived from multiple data sources and incorporates five data types: (1) the monthly precipitation from CHPClim v.1.0 (Climate Hazards Group's Precipitation Climatology version 1) derived from a combination of satellite fields, gridded physiographic indicators, and in situ climate normal with the geospatial modelling approach based on moving window regressions and inverse distance weighting interpolation (Funk et al., 2015b); (2) quasi-global geostationary thermal infrared (IR) satellite observations; (3) the TRMM 3B42 product (Huffman et al., 2007); (4) the CFS (Climate Forecast System, version 2) atmospheric model rainfall fields from NOAA; and (5) surface-based precipitation observations from various sources including national and regional meteorological services. The differences from other frequently used precipitation products are the higher resolution of 0.05∘, wider coverage, and longer length data series from 1981 to near-real time (Funk et al., 2015a).
CHIRPS is the blended product of a two-part process. First, IR precipitation (IRP) pentad rainfall estimates are fused with corresponding CHPClim pentad data to produce an unbiased gridded estimate, called CHIRP, which is available online at ftp://ftp.chg.ucsb.edu/pub/org/chg/products/CHIRP/daily/ (last access: 10 December 2018). In the second part of the process, the CHIRP data are blended with ground-based precipitation observations obtained from a variety of sources, including national and regional meteorological services by means of a modified inverse-distance weighting algorithm to create the final blended product, CHIRPS (Funk et al., 2014). The daily CHIRP satellite-based data over the Jinsha River basin from February 1990 to February 2015 were selected as the input for WHU-SGCC blending with rain observations, and the corresponding daily CHIRPS blended data was used for comparisons of the precipitation accuracy.
The blended in situ daily precipitation observations of the CHIRPS data come from a variety of sources, such as the daily GHCN archive (Durre et al., 2010), the Global Summary of the Day dataset (GSOD) provided by NOAA's National Climatic Data Center, the World Meteorological Organization's Global Telecommunication System (GTS) daily archive provided by NOAA CPC, and more than a dozen national and regional meteorological services. However, the stations for daily CHIRPS data have a different spatial distribution than those downloaded from the CMA, and the precipitation values used for CHIRPS production are the monthly values available online (ftp://ftp.chg.ucsb.edu/pub/org/chg/products/CHIRPS-2.0/diagnostics/monthly_station_data/, last access: 15 August 2019). For the daily precipitation adjustments over the Jinsha River basin, the daily gauge observations from the CMA are blended with the daily CHIRP data due to the unknown spatial distribution and precipitation values of gauge stations used in the process of daily CHIRPS merging.
3.1 The WHU-SGCC approach
In this study, the WHU-SGCC approach estimates the precipitation at every pixel by blending satellite estimates and rain gauge observations considering the terrain factors and precipitation characteristics. Due to the significant seasonal difference of precipitation, the WHU-SGCC method was applied in the different seasons. Four steps were used to establish the numerical relationship between the gauge stations and the corresponding satellite pixels and for the interpolation of the remaining pixels. The WHU-SGCC method identifies the geographical locations and topographical features of each pixel and applies the four classification and blending rules. A flowchart of the WHU-SGCC method is shown in Fig. 3. The proposed approach was evaluated over the Jinsha River basin based on 30 gauge stations and CHIRP satellite-based precipitation estimations in the different seasons from 1990 to 2014. The leave-one-out cross-validation step was applied to compute the out-of-sample adjusted bias with the gauge stations. The WHU-SGCC algorithm was repeated 30 times, each time leaving one station as the validation station.
The basic description of the WHU-SGCC method is given below, and the details are illustrated separately in later sections.
-
Classify all regional pixels into four types: C1 (pixels including one gauge station in their area), C2 (pixels statistically similar to C1), C3 (pixels statistically similar to C2), and C4 (remaining pixels).
-
Analyse the relationships between the precipitation observations and the C1, C2, and C3 pixel types, and interpolate for the C4 pixels. These relationships are described by four rules, which are described below as Rules 1 through 4.
-
Establish statistical models and screen the target pixels based on the four rules.
-
Correct all of the precipitation pixels in the daily regional precipitation images.
3.1.1 Assumptions
-
Gauge observations are the most accurate, or “true”, values for reference purposes. However, the sparseness of the gauges, their uneven spatial distribution, and the high proportion of missing data may limit high-accuracy estimation in rainfall monitoring.
-
No major terrain changes occurred during the 20 years (Appendix B).
-
There are no abnormal values at one pixel in the CHIRP dataset during the long time series, so Pearson's correlation coefficient (PCC) can represent the statistical similarity of the rainfall characteristics among the pixels in a certain spatial area at a seasonal scale.
3.1.2 Rule 1 of the WHU-SGCC method
In general, the satellite precipitation estimations deviated from the ground-based measurements, which were assumed to be the true values. Rule 1 aims to establish a regression model between the historical observations at each gauge and the corresponding CHIRP grid cell values. The regression relationship was derived by random forest regression (RFR) at each gauge station. RFR is a machine-learning algorithm for a predictive model with a large set of regression trees in which each tree in the ensemble is grown from a bootstrap sample (Johnson, 1998) drawn with a replacement from the training set. In the process of establishing regression trees, a subset of variables for each node is selected to avoid overfitting. The final prediction is obtained by combining the results of the prediction methods applied to each bootstrap sample (Genuer et al., 2017). The predicted value is calculated by the average of the values from all of the decision trees. Each tree can be expressed as
where Yo denotes the historical observations at each gauge at the C1 pixels, is a randomly selected vector from Ys, Ys denotes the corresponding CHIRP grid cell values at the C1 pixels, n is the number of trees, and fRFR is constructed from the time series Yo (dependent variable) and Ys (independent variable) by means of RFR. The bootstrap sample will be the training set used for growing the tree. The error rate (out-of-bag, OOB) left out of one-third of the training data is also monitored to determine the number of decision trees. In this study, the minimum OOB error rate was reached when the number of decision trees n was less than 500 (Appendix C).
Rule 1 builds the statistical relationships between the gauge observations and the corresponding CHIRP grid cell values, which is the key idea in correcting the satellite-based precipitation estimations in the entire study area. In the process of Rule 1, the regression relationships at the C1 pixels were established at 30 gauge stations separately. The values of the C1 pixels are not corrected in Rule 1, but are interpolated in Rule 4.
3.1.3 Rule 2 of the WHU-SGCC method
It is reasonable to assume that some pixels are statistically similar to the historical precipitation characteristics of the C1 pixels within a certain area. Therefore, it is feasible to adjust the satellite estimation bias of the C2 pixels by referring to the appropriate regression relationships at the corresponding C1 pixels based on Rule 1.
First, the spatial area in which pixels may have highly similar characteristics is established. Several studies indicate that the geographical location, elevation, and other terrain information influence the spatial distribution of rainfall, especially in mountainous areas with complex topography (Anders et al., 2006; Long and Singh, 2013). The size of the spatial range is an important parameter to distinguish the spatial similarity and heterogeneity. In the WHU-SGCC method, the fuzzy c-means (FCM) clustering approach was used to determine the spatial range considered for each pixel's terrain factors, including longitude, latitude, elevation, slope, aspect, and curvature. The FCM method was developed by J. C. Dunn in 1973 (Dunn, 1973) and improved in 1983 (Wang, 1983). It is an unsupervised fuzzy clustering method and its steps are as follows (Pessoa et al., 2018).
-
Choose the number of clusters c. The optimum number of clusters is determined by L(c), which is derived from the inter-distance and inner distance of the samples in Eq. (2). It is ensured that the distance between similar samples is smaller, while the distance between different samples is larger.
In Eq. (2), the denominator is the inner distance, and the numerator is the inter-distance. The initial value of c is 1 and the maximum value of c is the number of gauge stations in the study area. The optimum number of clusters was optimized to maximize L(c). For this reason, the value of c is varied from 1 to the number of gauge stations with an increment of 1 in this study.
-
Assign coefficients randomly to each data point xi for the degree to which it belongs in the ith cluster wij(xi):
where x is a finite collection of n elements that will be partitioned into a collection of c fuzzy clusters, ci is the centre of each cluster, m is the hyper-parameter that controls the level of cluster fuzziness, wij is the degree to which element xi belongs to ci, and is the centre vector of the collection. In Eq. (3), represents the cluster centre in iteration t. If the minimum improvement in the objective function between two consecutive iterations satisfies the following equation, the algorithm terminates with iteration t (Eq. 6):
-
Minimize the objective function Fc to achieve data partitioning.
The results of the FCM are the degree of membership of each pixel to the cluster centre as represented by numerical values. The pixels in each cluster have similar terrain features and precipitation characteristics.
Second, as mentioned above, the aim of Rule 2 is to derive an adjustment method for the C2 pixels based on learning from Rule 1. With the establishment of a regression relationship between the gauge observations and the corresponding CHIRP grid cell values of the C1 pixels by the RFR method, the determination of the C2 pixels follows a complicated procedure. With the exception of the C1 pixels, the remaining pixels in each cluster represent potential C2 pixels, which are called R pixels. The PCC and p values between the satellite estimations (multi-annual daily CHIRP grid cell values) at the R pixels and the C1 pixels are the criteria for the final determination of the C2 pixels. The PCC is defined as follows:
where n is the number of samples, xi and yi are individual samples (CHIRP grid cell values at the C1 and C2 pixels, respectively), is the arithmetic mean of x calculated by , and is the arithmetic mean of y calculated by .
The PCC ranges between −1 and +1. If there are no repeated data values, a perfect PCC of +1 or −1 occurs when each of the variables is a perfect monotonic function of the other. However, if the value is close to zero, there is zero correlation. In addition, the correlation is determined not only by the value of the correlation coefficient, but also by the correlation test's p value. The critical values for the PCC and p value are 0.5 and 0.05, respectively; thus, a PCC value higher than 0.5 and a p value lower than 0.05 indicate that the data are significantly correlated (Zhang and Chen, 2016). Therefore, the final determination of the C2 pixels must meet the following criteria:
Each R pixel has m PCC and p values (the number of C1 pixels in the cluster), and the subset of C2 pixels is identified by excluding the data that failed the correlation test and retaining both the data with a maximum PCC of at least 0.5 and a p value lower than 0.05, and the corresponding index of C1 pixels. The selected C2 pixels can then be considered statistically similar to the precipitation characteristics of the corresponding C1 pixels in their defined spatial area.
Third, after identifying the C2 pixels and their corresponding C1 pixels, the adjustment method for the C2 pixels is derived from the regression model for the C1 pixels:
where Tree is the decision tree derived from the RFR algorithm at the corresponding C1 pixel, is the CHIRP grid cell value at the C2 pixels, and C2as is the adjusted satellite precipitation estimate calculated by the average of the values from the RFR decision trees.
3.1.4 Rule 3 of the WHU-SGCC method
Recognizing that precipitation has a spatial distribution, the assumption that the C3 pixels are statistically similar to the precipitation characteristics of the C2 pixels is adopted to establish the adjustment method for the C3 pixels.
First, the determination of the C3 pixels in each spatial cluster is based on the selection of C2 pixels. The satellite-based estimation values at the pixels other than the C1 and C2 pixels are used to calculate the PCC and p value with the satellite-based estimation values at the C2 pixels in the same cluster. The results of each pixel's k PCC and p value (the number of C2 pixels in the cluster) are evaluated based on the correlation test (Eq. 9) that the pixels have a maximum PCC of at least 0.5 and a p value is no more than 0.05, and the corresponding index of C2 pixels is retained. The selected pixels are called C3 pixels, which are statistically similar to the precipitation characteristics of the corresponding C2 pixels in the defined spatial area.
After identifying the C3 pixels, a method for merging the CHIRP grid cell values at the C3 pixels (Ys) and the target reference values of C2as at the corresponding C2 pixels is applied to estimate the adjusted precipitation values at the C3 pixels. This method combines the Ys and C2as values into one variable, as shown in Eq. (11):
where λ is a positive constant set to 10 mm (Sokol, 2003), C2as is the adjusted precipitation values at the C2 pixels, is extracted from the CHIRP grid cell values at the corresponding location of the C2 pixels, and n is the number of C2 pixels in each spatial cluster.
Each w of the C3 pixels is assigned the same value as the corresponding C2 pixel. Therefore, the values of the C3 pixels are derived from Eq. (12):
where C3as is the adjusted target precipitation value at one C3 pixel, and Ys is the corresponding CHIRP grid cell value. To avoid precipitation estimates below 0, Eq. (12) sets negative values to 0.
3.1.5 Rule 4 of the WHU-SGCC method
The pixels other than the C1, C2, and C3 pixels are called C4 pixels, and they are adjusted by inverse distance weighting (IDW). IDW is based on the concept of the first law of geography from 1970, which was defined as everything is related to everything else, but near things are more related than distant things. Therefore, the attribute value of an unsampled point is the weighted average of the known values within the neighbourhood, and the distance weighting can be determined by means of IDW (Lu and Wong, 2008). In Rule 4, IDW is used to interpolate the unknown spatial precipitation data from the adjusted precipitation values at the C2 and C3 pixels. The IDW formulas are given as Eqs. (13) and (14).
where Ras is the unknown spatial precipitation data, Ri is the adjusted precipitation values at the C2 and C3 pixels, n is the number of C2 and C3 pixels, di is the distance from each C2 or C3 pixel to the unknown grid cell, and α is the power which is generally specified as a geometric form for the weight. Several studies (Simanton and Osborn 1980; Tung, 1983) have experimented with variations in the power; a small α tends to estimate values with the averages of sampled grids in the neighbourhood, while a large α tends to give larger weights to the nearest points and increasingly down-weights points farther away (Chen and Liu, 2012; Lu and Wong, 2008). The value of α has an influence on the spatial distribution of the information from precipitation observations. For this reason, α is varied in the range of 0.1 to 3 (0.1, 0.3, 0.5, 1.0, 1.5, 2.0, 2.5, and 3.0) in this study.
Note that the unknown spatial precipitation data include C1 and C4 pixels because the C1 pixel values were not adjusted in Rule 1.
After applying these four rules, we obtained complete daily adjusted regional precipitation maps for the four seasons over the Jinsha River basin.
3.2 Accuracy assessment
The performance of the WHU-SGCC adjusted precipitation estimates was evaluated by eight mathematic metrics: the PCC, root mean square error (RMSE), mean absolute error (MAE), relative bias (BIAS), Nash–Sutcliffe efficiency coefficient (NSE), probability of detection (POD), false alarm ratio (FAR), and critical success index (CSI). The results of the accuracy assessment are the average values validated by the leave-one-out cross method. Each validated pixel will probably be a C2, C3, or C4 pixel in the process of the WHU-SGCC algorithm. The PCC, RMSE, MAE, and BIAS were used to evaluate how well the WHU-SGCC method adjusted the satellite estimation bias, while POD, FAR, and CSI were used to evaluate the performance of precipitation forecasting (Su et al., 2011). The PCC measures the strength of the correlation relationship between the satellite estimations and observations. The RMSE is an absolute measurement used to compare the difference between the satellite estimations and observations, and the MAE represents the average magnitude of error estimations considering both systematic and random errors. The NSE (Nash and Sutcliffe, 1970) determines the relative magnitude of the variance of the residuals compared to the variance of the observations, bounded by minus infinity and 1; a negative value indicates a poor precipitation estimate and a value of 1 indicates an optimal estimate. The BIAS measures the mean tendency of the estimated precipitation to be larger (positive values) or smaller (negative values) than the observed precipitation and has an optimal value of 0. The POD, also known as the hit rate, represents the probability of rainfall detection, and the FAR is defined as the ratio of the false alarm of rainfall to the total number of rainfall events. All of the accuracy assessment metrics are shown in Table 2.
Note: Yoi is the observation data; Ci is the adjusted value using the WHU-SGCC method for the test sample pixel; is the arithmetic mean of Yo and is given by ; is the arithmetic mean of C and is given by ; H represents the number of both observed and estimated precipitation events (successfully forecasted); F is the number of false alarms when the observed precipitation was below the threshold and the estimated precipitation was above the threshold (false alarms); and M is the number of events in which the estimated precipitation was below the threshold and the observed precipitation was above the threshold (missed forecasts). The POD and FAR values are dimensionless numbers ranging from 0 to 1. The precipitation threshold (event/no event) was set to 0.1 mm d−1.
A total of 18 482 daily pixels were adjusted by blending the satellite estimations (CHIRP) and observations (rain gauge stations) using the WHU-SGCC approach over the Jinsha River basin from 1990 to 2014. The percentage of pixels adjusted by each rule in the WHU-SGCC method is shown in Table 3. The number of C1 pixels was the number of training gauge stations, which accounted for 0.16 % of the total pixels (18 482) within the basin. Due to the leave-one-out cross-validation step, the different training samples will have different numbers of C2, C3, and C4 pixels within the Jinsha River basin. The percentages of C2 and C3 pixels are highest in autumn, followed by summer, spring, and winter. In the spring, the average percentage of C2 pixels was approximately 21.27 %, the average percentage of C3 pixels was approximately 17.12 %, and the percentage of C4 pixels was approximately 61.46 %. In the summer, the percentage of C2 pixels was approximately 17.86 %, the percentage of C3 pixels was approximately 23.43 %, and the percentage of C4 pixels was approximately 58.55 %. In the autumn, the average percentage of C2 pixels was approximately 31.40 %, the average percentage of C3 pixels was approximately 21.77 %, and the average percentage of C4 pixels was approximately 46.68 %. In the winter, the average percentage of C2 pixels was approximately 15.60 %, the average percentage of C3 pixels was approximately 19.23 %, and the average percentage of C4 pixels was approximately 65.01 %. In addition, the pixel type of the validation gauge station is shown in Table D1 and the spatial distribution of C1–C3 pixels in Fig. D1, with the most uniform in the autumn and the sparsest in the winter. Each validation gauge station could be identified as either C2, C3, or C4 pixels to evaluate the performances of all the rules in the WHU-SGCC method.
4.1 Model performance based on overall accuracy evaluations
The multi-annual (1990–2014) average seasonal precipitation over the Jinsha River basin interpolated from WHU-SGCC, CHIRP, and CHIRPS is shown in Fig. 4. There exist some differences in the spatial pattern of precipitation estimates. Overall, the WHU-SGCC method exhibits a similar spatial distribution of precipitation to the CHIRP and CHIRPS, while the WHU-SGCC method attenuated the intense rain in the central area. The statistical accuracy evaluations are needed to further analyse the performance of the WHU-SGCC method.
To test the performance of the WHU-SGCC method for precipitation estimates, the PCC, RMSE, BAE, BIAS, NSE, POD, FAR, and CSI were calculated and are presented in Table 4 (the results were derived from the 22 clusters for the FCM in Rule 2, as shown in Appendix E, and α=0.1 for the IDW in Rule 4 after the comparison of the RMSE). After the correction, the PCC in the WHU-SGCC method shows an improvement relative to the CHIRP and CHIRPS estimates. The spring and autumn have better correlations than the summer and winter. In addition, the NSE of the WHU-SGCC provides substantial improvements over CHIRP and CHIRPS, especially in the spring and autumn, which were better than the summer and winter. The RMSE and MAE are the largest in the summer, followed by the autumn, spring, and winter; however, the performances of the BIAS in the summer and autumn are better than those in the spring and winter, which might be influenced by the greater precipitation in the summer and autumn than in the spring and winter. The assessments of the POD and CSI are lowest, and the FAR is largest in the winter due to the overestimation of no-rain events estimated by the satellite-based dataset.
Compared with the estimates of CHIRP and CHIRPS, the PCCs of the WHU-SGCC method are improved to more than 0.5 in the spring and autumn and to approximately to 0.5 in the winter, with overall average improvements of the Pearson correlation coefficient (PCC) by 0.0082–0.2232 and 0.0612–0.3243, respectively. In addition, the RMSE and MAE of the WHU-SGCC were all lower than those of CHIRP and CHIRPS, with overall average decreases in the root mean square error (RMSE) by 0.0922–0.65 and 0.2249–2.9525 mm, respectively. The absolute values of the BIAS of the WHU-SGCC are substantial improved in the spring, followed by the summer, winter, and autumn. Although the absolute values of the BIAS of the WHU-SGCC in autumn are not significantly better than those of CHIRP and CHIRPS, all of the values are approximately 0. The NSEs of the WHU-SGCC reached 0.2836, 0.2944, and 0.1853 in the spring, autumn, and winter, respectively, which are substantially better than the negative or zero values of CHIRP and CHIRPS. In the summer, the NSE of the WHU-SGCC is still negative, but it is improved to be nearly zero, which indicates that the adjusted results are similar to the average level of the rain gauge observations. It is worth noting that in the spring, summer and autumn, the POD values of the WHU-SGCC are in the range of 0.95 to 1, better than CHIRP and CHIRPS, and the FAR values of the WHU-SGCC are no more than 0.3, lower than CHIRP and CHIRPS; these results represent the better ability of the WHU-SGCC method to predict precipitation events. The rainfall detection ability is the worst in the winter compared to the other seasons. This can be explained by the seasonal distribution of precipitation in the Jinsha River basin, in which the most rainfall occurs in the summer, followed by the autumn, spring, and winter. In addition, the spatial distribution of C2 and C3 pixels might slightly impact the overall accuracy in different seasons that are the sparsest in the winter but more uniform in the summer. However, the performances of PCC, RMSE, MAE, and NSE in the winter are better than those in the summer. The worst errors of forecasting performance in the summer may be attributed to the highest precipitation. The limited precipitation event detection in the winter could also be explained by the lowest precipitation (Xu et al., 2019).
The spatial distributions of the statistical comparisons between the observations and the WHU-SGCC precipitation estimations are shown in Figs. 5 and 6. Overall, the variation in the PCC shows low correlations in areas with lower elevation, particularly in the south-eastern Jinsha River basin, where there is higher precipitation and a greater density of rain gauges. The PCC is highest in the autumn, followed by the spring and winter, and finally by summer. The higher correlations are located in the northern-central area along the Tongtian River, Jinsha River, and upstream part of the Yalong River, which has complex terrain and few rain gauges. The RMSE is lowest in the winter than in the spring, autumn, and summer, which can be attributed to the lower precipitation in the winter and the greatest in the summer. The spatial distribution of the RMSE shows that the smaller errors are scattered in the north-western area of the river basin, with values lower than 5 mm, while the highest errors are located along the border between the lower reaches of the Jinsha Jiang River and the river basin. This is related to the climate regimes of the Jinsha River basin, which includes more rainfall in the southern and south-eastern areas than in the north and north-west.
The results show that the WHU-SGCC method improves the correlation relative to CHIRP and CHIRPS, especially in the central and south-eastern river basin during the spring, autumn, and winter, with most of the PCC values falling between 0.4 and 0.8 (Fig. 5). As shown by the RMSE (Fig. 6), the WHU-SGCC can also correct the precipitation bias in the central and south-eastern river basin, especially along the downstream part of the Yalong River. In addition, the WHU-SGCC slightly improved the RMSE around the convergence of the rivers, where it is less than 5 mm in the spring and autumn, and most of the RMSE values are less than 1 mm in the winter. In spite of the correction, the RMSE values in the summer are still substantial.
All of the spatial distribution statistics indicate that the statistical relationships established during the process of the WHU-SGCC method are susceptible to the mode values of the rain gauge station data, especially in the summer. Although the average summer precipitation in the southern Jinsha River basin was more than 600 mm (Fig. 2), days of light rain still represent a large percentage, which causes large biases and limits the performance over the south, while there are sufficient data with similar precipitation features for the WHU-SGCC in the north. Nevertheless, the WHU-SGCC approach is still effective at adjusting the satellite biases by blending the data with the observations, particularly in the complicated mountainous regions, where higher PCCs correspond to lower RMSEs.
4.2 Model performance based on daily accuracy evaluations
After the overall accuracy evaluations were conducted, further evaluations of the daily accuracy in the four seasons were conducted, and the results are shown in Fig. 7. The evaluation of the daily accuracy indicates that the PCCs of the WHU-SGCC were slightly better than those of CHIRP and CHIRPS in the spring, autumn, and winter but were not as good in the summer and winter. The WHU-SGCC had lower RMSEs and MAEs than CHIRP and CHIRPS, especially compared to CHIRPS. The daily RMSE and MAE in the summer are the highest, although the WHU-SGCC still corrects the bias. Figure 7 indicates that there is a slight increase in the PCC, with average improvements of 0.0249–0.0405 and 0.0456–0.1355, respectively; however, the PCC is a relative metric of the magnitude of the association between paired variables, and a relative consistency may not indicate absolute proximity. Thus, the absolute measure indicated by the RMSE may be more reasonable. In this study, the RMSE and MAE derived from the WHU-SGCC are reduced by approximately 14.47 % and 33.87 % on average compared to CHIRP and CHIRPS, respectively. As for BIAS, WHU-SGCC method can correct the CHIRP precipitation bias in the spring, autumn, and winter, but the results are not as good compared with CHIRPS. The larger BIAS values and higher PCCs in the spring and autumn may be attributed to the seasonal variations, when the CHIRP is highly consistent with the observations but subject to large biases. After the correction, a substantial decrease in BIAS occurs in the winter, and there is no significant reduction in the summer; all of the median and average adjusted values are approximately 0. The WHU-SGCC method provides an obvious improvement in the NSE, with average improvements of 0.1742–13.8322 and 2.0131–14.7052 relative to CHIRP and CHIRPS, though the median and average values are still less than 0, which may be due to the inherent uncertainty in the CHIRP. Moreover, in terms of the POD, FAR, and CSI, except for the results in winter, the WHU-SGCC method appears to be better at detecting precipitation than CHIRP and CHIRPS; the results of POD and CSI are closest to 1, although FAR is worse than CHIRPS on some days. However, the overall result of FAR is the best in the WHU-SGCC. The POD and FAR results are the worst in the winter, and the CSI is slightly higher, which may be attributed to the overestimation of no-rain events and the inherent uncertainty in the CHIRP.
Overall, the WHU-SGCC approach can be regarded as an effective tool for daily precipitation adjustments.
4.3 Model performance in rain event predictions
To measure the WHU-SGCC performance in predicting rain events, daily precipitation thresholds of 0.1, 10, 25, and 50 mm were considered, and the results are shown in Tables 5 and 6. The average percentages of each class of rain event at the validation gauge station during the four seasons from 1990 to 2014 are shown in Table 5. The major rain events within the Jinsha River basin were no rain (< 0.1 mm) and light rain (0.1–10 mm), which accounted for more than 80 % of the total days (the average percentage of rain event days of the total days at each gauge station), while the number of days with daily precipitation greater than 50 mm was the smallest (no more than 1 % of the total days), and fewer than 5 % of the days had daily precipitation in the range of 25 to 50 mm. In the spring, autumn, and summer, significantly more no-rain days occurred than rainy days, and approximately 5 % of the days had daily precipitation of 10–50 mm. The seasonal distribution of rainfall was concentrated in the summer, and 54.76 %, 14.01 %, and 3.62 % of the days had daily precipitation of 0.1–10, 10–25, and 25–50 mm, respectively. The results indicated that the average daily precipitation was less than 10 mm throughout the years of the study.
The WHU-SGCC approach had lower errors than CHIRP and CHIRPS, as indicated by the RMSE, MAE, and BIAS, but the performance of WHU-SGCC is not promising for events with total rainfall greater than 25 mm in the summer (Fig. 8). This negative performance for total rainfall higher than 25 mm in the summer might be attributed to the overestimation of rainfall by CHIRP and CHIRPS. For the seasonal distribution of precipitation (Table 5), the average daily precipitation within the basin was less than 10 mm over the study period, which results in numerous rain gauge station data with values lower than 10 mm, which had a significant impact on the establishment of statistical relationships for the WHU-SGCC. Besides, the WHU-SGCC dataset has almost always a negative bias, while CHIRP and CHIRPS have a positive bias in the different rain events. After bias correction of the WHU-SGCC, some precipitation estimates are lower than observations. The estimates of extreme rain events might also be attenuated during the process of WHU-SGCC adjustment.
Besides, the POD and CSI results of CHIRPS are the worst, while the results of the WHU-SGCC are the highest, which indicate its superiority for the detection of precipitation events. As for the results of the WHU-SGCC, the assessments of POD and CSI are the best in the summer, followed by the autumn, spring, and winter, which are related to the seasonal rainfall pattern of more rain in the summer and less in the winter.
Therefore, the WHU-SGCC approach is applicable for the detection of rainfall events in the Jinsha River basin, while in the summer it is better, with rainfall less than or approximately equal to the average daily precipitation. Due to the homogenization of the WHU-SGCC method, its performance for short intense and extreme rain events was poorer than those of CHIRP and CHIRPS, which should be improved in a future study.
All the resulting datasets derived from the WHU-SGCC approach are available on PANGAEA, with the following: https://doi.org/10.1594/PANGAEA.905376 (Shen et al., 2019). The high-resolution (0.05∘) daily precipitation estimation data over the Jinsha River basin from 1990 to 2014 can be downloaded in TIFF format.
This study provides a novel approach, the WHU-SGCC method, for merging daily satellite-based precipitation estimates with observations. A case study of the Jinsha River basin was conducted to verify the effectiveness of the WHU-SGCC approach during all four seasons from 1990 to 2014, and the adjusted precipitation estimates were compared to CHIRP and CHIRPS. The WHU-SGCC method aims to reduce the bias and uncertainties in CHIRP data over regions with complicated mountainous terrain and sparse rain gauges. To the best of the authors' knowledge, this study is the first to use daily CHIRP and CHIRPS data in this area.
According to our findings, the following conclusions can be drawn. (1) The WHU-SGCC method is effective for the adjustment of precipitation biases from points to surfaces. The precipitation adjusted by the WHU-SGCC method can achieve greater accuracy compared with CHIRP and CHIRPS, with average improvements of Pearson's correlation coefficient (PCC) of 0.0082–0.2232 and 0.0612–0.3243, respectively. The PCCs were improved to more than 0.5 in the spring and autumn and to approximately 0.5 in the winter, and they were the worst in the summer, which may be attributed to the greater precipitation in the summer and lower precipitation in the winter. In addition, the NSE of the WHU-SGCC provides substantial improvements over CHIRP and CHIRPS, which reached 0.2836, 0.2944, and 0.1853 in the spring, autumn, and winter, respectively. In the summer, the NSE of the WHU-SGCC is still negative, but it is improved to be nearly 0, which indicates that the adjusted results are similar to the average level of the rain gauge observations. All of the measured errors were reduced except for the BIAS, which showed no significant improvement in the summer but was approximately 0. Overall, the WHU-SGCC approach achieves good performance in error correction of CHIRP and CHIRPS. (2) The spatial distribution of the precipitation estimate accuracy derived from the WHU-SGCC method is related to the topographic complexity. These errors over the lower-elevation regions and the large size of light precipitation events with short durations resulted in a limited improvement in accuracy, with PCC values less than 0.3. However, higher PCCs and lower errors were observed over the northern-central part of the river basin, which is a drier region with complex terrain and sparse rain gauges. The spatial distribution statistics indicate that the WHU-SGCC method is promising for the adjustment of satellite biases by blending with the observations over regions of complex terrain. (3) The leave-one-out cross validation of WHU-SGCC on daily rain events confirmed that the model is effective in the detection of precipitation events that are less than or approximately equal to the average annual precipitation in the Jinsha River basin. The WHU-SGCC approach achieves reductions of the RMSE, MAE, and BIAS metrics, while on rain events less than 25 mm in the summer. Specifically, the WHU-SGCC has the best ability to reduce precipitation bias for daily accuracy evaluations, with average reductions of 21.68 % and 31.44 % compared to CHIRP and CHIRPS, respectively. As for the results of the WHU-SGCC, the assessments of POD and CSI are the best in the summer, followed by the autumn, spring, and winter, which are related to the seasonal rainfall pattern of more rain in the summer and less in the winter. In spite of the corrections, the performance of the WHU-SGCC for short intense and extreme rain events was poorer than those of CHIRP and CHIRPS, and the biases in the precipitation forecasts in the summer are still large, which may be due to the homogenization attenuating the extreme rain event estimates.
In conclusion, the WHU-SGCC approach can help adjust the biases of daily satellite-based precipitation estimates over the Jinsha River basin, which contains complicated mountainous terrain with sparse rain gauges. This approach is a promising tool to monitor daily precipitation over the Jinsha River basin, considering the spatial correlation and historical precipitation characteristics between raster pixels in regions with similar topographic features. Future development of the WHU-SGCC approach will focus on the following three aspects: (1) the improvement of the adjusted precipitation quality to better monitor extreme rainfall events by blending multiple data sources for different rain events; (2) the introduction of more climatic factors and multi-model ensembles to achieve more accurate spatial distributions of precipitation; and (3) investigations of the performance over other areas and for particular hydrological cases to validate the applicability of the WHU-SGCC approach.
The station identification numbers and relevant geographical characteristics are shown in Appendix Table A1.
The multi-annual land cover types in the Jinsha River basin from 2001 to 2013 are shown in Fig. B1. All of the land cover type maps were derived from the MODIS/Terra+Aqua Land Cover Type Yearly L3 Global 500 m SIN Grid V051 dataset, which is available online at https://search.earthdata.nasa.gov/search/granules?p=C200106111-LPDAAC_ECS&q=MCD12&ok=MCD12 (last access: 23 July 2019). Figure B1 shows that the land use had no obvious changes over the study period. In addition, the upstream area of the Jinsha River is an untraversed region that has not been affected significantly by human activities. Thus, the land use in the study area has hardly changed.
This Appendix shows how to set the number of clusters in the FCM method.
To adjust the pixels other than those of the gauge stations, the pixels that are statistically similar to the C1 pixels were selected. According to Rule 2, the C2 pixels were identified in a spatial area defined by the FCM method. In the following experiments of Rule 2, we set the parameters , and the maximum number of iterations was set to 1000 (a sufficiently large value considering the algorithm efficiency). To determine the optimal numbers of clusters, the value of c was varied from 1 to 30 with an increment of 1. The values of L(c) during the running of the FCM are shown in Fig. E1. The optimum number of clusters was 22, and the number of iterations was 690 less than the maximum number of iterations.
Therefore, the number of clusters was set to 22, and the number of iterations was set to 1000 for full operation by means of the FCM. The spatial clustering results considering the terrain factors are shown in Fig. E2. In general, the spatial results of the FCM have many of the same characteristics as the areas defined by the terrain variations, especially with respect to the slope and runoff directions, which may influence the regional rainfall.
All the authors contributed extensively to the work presented in this paper. SG, CZ, and CN conceived of and designed the research. SG, CZ, and CN developed the approach, and SG wrote the WHU-SGCC code and performed most of the computations with WW. SG and CZ drafted and coordinated the work on the paper and revised the paper. CZ and CN contributed to the analysis of the results and writing of the paper. CZ, CN, and WW acquired project funds and supervised the project.
The authors declare that they have no conflict of interest.
The authors would like to thank data support from the Climate Hazards Group at the University of California, Santa Barbara, for providing CHIRP and CHIRPS datasets (http://chg.ucsb.edu/data/, last access: 10 July 2019), and the National Climate Center (NCC) of the China Meteorological Administration (CMA) for providing the daily rain gauged observations (http://data.cma.cn/, last access: 10 December 2018). The authors also thank the PANGAEA Data Publisher for Earth & Environmental Science platform for providing the storage to disseminate the data generated in this experiment.
The authors are grateful to the editor and anonymous reviewers for their useful suggestions that clearly improved this paper.
This research was supported by the National Key R&D program (grant no. 2018YFB2100500), National Natural Science Foundation of China programme (grant nos. 41890822, 41771422, and 41971351) and Creative Research Groups of Natural Science Foundation of Hubei Province of China (grant no. 2016CFA003).
This paper was edited by Giulio G. R. Iovine and reviewed by Yaping Zhou and six anonymous referees.
AghaKouchak, A., Behrangi, A., Sorooshian, S., Hsu, K., and Amitai, E.: Evaluation of satellite-retrieved extreme precipitation rates across the central United States, J. Geophys. Res.-Atmos., 116, D02115, https://doi.org/10.1029/2010jd014741, 2011.
Agutu, N. O., Awange, J. L., Zerihun, A., Ndehedehe, C. E., Kuhn, M., and Fukuda, Y.: Assessing multi-satellite remote sensing, reanalysis, and land surface models' products in characterizing agricultural drought in East Africa, Remote Sens. Environ., 194, 287–302, https://doi.org/10.1016/j.rse.2017.03.041, 2017.
Ali, H. and Mishra, V.: Contrasting response of rainfall extremes to increase in surface air and dewpoint temperatures at urban locations in India, Sci. Rep.-UK, 7, 1228, https://doi.org/10.1038/s41598-017-01306-1, 2017.
Anders, A. M., Roe, G. H., Hallet, B., Montgomery, D. R., Finnegan, N. J., and Putkonen, J.: Spatial patterns of precipitation and topography in the Himalaya, Tectonics, Climate, and Landscape Evolution, 398, 39–53, https://doi.org/10.1130/2006.2398(03), 2006.
Aonashi, K., Awaka, J., Hirose, M., Kozu, T., Kubota, T., Liu, G., Shige, S., Kida, S., Seto, S., Takahashi, N., and Takayabu, Y. N.: GSMaP Passive Microwave Precipitation Retrieval Algorithm: Algorithm Description and Validation, J. Meteorol. Soc. Jpn., 87, 119–136, https://doi.org/10.2151/jmsj.87A.119, 2009.
Ashouri, H., Hsu, K.-L., Sorooshian, S., Braithwaite, D. K., Knapp, K. R., Cecil, L. D., Nelson, B. R., and Prat, O. P.: PERSIANN-CDR: Daily Precipitation Climate Data Record from Multisatellite Observations for Hydrological and Climate Studies, B. Am. Meteorol. Soc., 96, 69–83, https://doi.org/10.1175/bams-d-13-00068.1, 2015.
Bai, L., Shi, C. X., Li, L. H., Yang, Y. F., and Wu, J.: Accuracy of CHIRPS Satellite-Rainfall Products over Mainland China, Remote Sens., 10, 362, https://doi.org/10.3390/rs10030362, 2018.
Beck, H. E., Vergopolan, N., Pan, M., Levizzani, V., van Dijk, A. I. J. M., Weedon, G. P., Brocca, L., Pappenberger, F., Huffman, G. J., and Wood, E. F.: Global-scale evaluation of 22 precipitation datasets using gauge observations and hydrological modeling, Hydrol. Earth Syst. Sci., 21, 6201–6217, https://doi.org/10.5194/hess-21-6201-2017, 2017.
Beck, H. E., Wood, E. F., Pan, M., Fisher, C. K., Miralles, D. G., van Dijk, A. I. J. M., McVicar, T. R., and Adler, R. F.: MSWEP V2 Global 3-Hourly 0.1 degrees Precipitation: Methodology and Quantitative Assessment, B. Am. Meteorol. Soc., 100, 473–502, https://doi.org/10.1175/bams-d-17-0138.1, 2019.
Behrangi, A., Andreadis, K., Fisher, J. B., Turk, F. J., Granger, S., Painter, T., and Das, N.: Satellite-Based Precipitation Estimation and Its Application for Streamflow Prediction over Mountainous Western US Basins, J. Appl. Meteorol. Clim., 53, 2823–2842, https://doi.org/10.1175/jamc-d-14-0056.1, 2014.
Berndt, C., Rabiei, E., and Haberlandt, U.: Geostatistical merging of rain gauge and radar data for high temporal resolutions and various station density scenarios, J. Hydrol., 508, 88–101, https://doi.org/10.1016/j.jhydrol.2013.10.028, 2014.
Cattani, E., Merino, A., Guijarro, J. A., and Levizzani, V.: East Africa Rainfall Trends and Variability 1983-2015 Using Three Long-Term Satellite Products, Remote Sens., 10, 931, https://doi.org/10.3390/rs10060931, 2018.
Chen, F. W. and Liu, C. W.: Estimation of the spatial rainfall distribution using inverse distance weighting (IDW) in the middle of Taiwan, Paddy Water Environ., 10, 209–222, https://doi.org/10.1007/s10333-012-0319-1, 2012.
Chen, J., Brissette, F. P., Chaumont, D., and Braun, M.: Finding appropriate bias correction methods in downscaling precipitation for hydrologic impact studies over North America, Water Resour. Res., 49, 4187–4205, https://doi.org/10.1002/wrcr.20331, 2013.
Derin, Y., Anagnostou, E., Berne, A., Borga, M., Boudevillain, B., Buytaert, W., Chang, C.-H., Delrieu, G., Hong, Y., Hsu, Y. C., Lavado-Casimiro, W., Manz, B., Moges, S., Nikolopoulos, E. I., Sahlu, D., Salerno, F., Rodriguez-Sanchez, J.-P., Vergara, H. J., and Yilmaz, K. K.: Multiregional Satellite Precipitation Products Evaluation over Complex Terrain, J. Hydrometeorol., 17, 1817–1836, https://doi.org/10.1175/jhm-d-15-0197.1, 2016.
Duan, Z., Liu, J. Z., Tuo, Y., Chiogna, G., and Disse, M.: Evaluation of eight high spatial resolution gridded precipitation products in Adige Basin (Italy) at multiple temporal and spatial scales, Sci. Total Environ., 573, 1536–1553, https://doi.org/10.1016/j.scitotenv.2016.08.213, 2016.
Dunn, J. C.: A fuzzy relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters, J. Cybernetics, 3, 32–57, 1973.
Durre, I., Menne, M. J., Gleason, B. E., Houston, T. G., and Vose, R. S.: Comprehensive Automated Quality Assurance of Daily Surface Observations, J. Appl. Meteorol. Clim., 49, 1615–1633, https://doi.org/10.1175/2010jamc2375.1, 2010.
Funk, C., Peterson, P., Landsfeld, M., Pedreros, D., Verdin, J., Rowland, J., Bo, E., Husak, G. J., Michaelsen, J. C., and Verdin, A. P.: A Quasi-Global Precipitation Time Series for Drought Monitoring Data Series 832, Usgs Professional Paper, Data Series, 2014.
Funk, C., Peterson, P., Landsfeld, M., Pedreros, D., Verdin, J., Shukla, S., Husak, G., Rowland, J., Harrison, L., Hoell, A., and Michaelsen, J.: The climate hazards infrared precipitation with stations-a new environmental record for monitoring extremes, Sci. Data, 2, 150066, https://doi.org/10.1038/sdata.2015.66, 2015a.
Funk, C., Verdin, A., Michaelsen, J., Peterson, P., Pedreros, D., and Husak, G.: A global satellite-assisted precipitation climatology, Earth Syst. Sci. Data, 7, 275–287, https://doi.org/10.5194/essd-7-275-2015, 2015b.
Genuer, R., Poggi, J. M., Tuleau-Malot, C., and Villa-Vialaneix, N.: Random Forests for Big Data, Big Data Res., 9, 28–46, https://doi.org/10.1016/j.bdr.2017.07.003, 2017.
Hou, A. Y., Kakar, R. K., Neeck, S., Azarbarzin, A. A., Kummerow, C. D., Kojima, M., Oki, R., Nakamura, K., and Iguchi, T.: The Global Precipitation Measurement Mission, B. Am. Meteorol. Soc., 95, 701–722, https://doi.org/10.1175/bams-d-13-00164.1, 2014.
Huffman, G. J., Adler, R. F., Bolvin, D. T., Gu, G., Nelkin, E. J., Bowman, K. P., Hong, Y., Stocker, E. F., and Wolff, D. B.: The TRMM multisatellite precipitation analysis (TMPA): Quasi-global, multiyear, combined-sensor precipitation estimates at fine scales, J. Hydrometeorol., 8, 38–55, https://doi.org/10.1175/jhm560.1, 2007.
Huffman, G. J., Adler, R. F., Bolvin, D. T., and Nelkin, E. J.: The TRMM Multi-Satellite Precipitation Analysis (TMPA), in: Satellite Rainfall Applications for Surface Hydrology, edited by: Gebremichael, M. and Hossain, F., Springer Netherlands, Dordrecht, 3–22, 2010.
Huffman, G. J., Bolvin, D. T., Braithwaite, D., Hsu, K., Joyce, R., and Xie, P.: NASA Global Precipitation Measurement Integrated Multi-satellitE Retrievals for GPM (IMERG), NASA Algorithm theoretical basis document (ATBD) version 5.2, 35 pp., available at: https://pmm.nasa.gov/sites/default/files/document_files/IMERG_ATBD_V5.2.pdf (last access: 18 July 2019), 2018.
Johnson, R. W.: An Introduction to the Bootstrap, Chapman & Hall/CRC Press, 49–54, 1998.
Joyce, R. J. and Xie, P.: Kalman Filter-Based CMORPH, J. Hydrometeorol., 12, 1547–1563, https://doi.org/10.1175/jhm-d-11-022.1, 2011.
Joyce, R. J., Janowiak, J. E., Arkin, P. A., and Xie, P. P.: CMORPH: A method that produces global precipitation estimates from passive microwave and infrared data at high spatial and temporal resolution, J. Hydrometeorol., 5, 487–503, https://doi.org/10.1175/1525-7541(2004)005<0487:camtpg>2.0.co;2, 2004.
Katsanos, D., Retalis, A., and Michaelides, S.: Validation of a high-resolution precipitation database (CHIRPS) over Cyprus for a 30-year period, Atmos. Res., 169, 459–464, https://doi.org/10.1016/j.atmosres.2015.05.015, 2016a.
Katsanos, D., Retalis, A., Tymvios, F., and Michaelides, S.: Analysis of precipitation extremes based on satellite (CHIRPS) and in situ dataset over Cyprus, Nat. Hazards, 83, 53–63, https://doi.org/10.1007/s11069-016-2335-8, 2016b.
Kummerow, C., Barnes, W., Kozu, T., Shiue, J., and Simpson, J.: The Tropical Rainfall Measuring Mission (TRMM) sensor package, J. Atmos. Ocean. Tech., 15, 809–817, https://doi.org/10.1175/1520-0426(1998)015<0809:ttrmmt>2.0.co;2, 1998.
Long, D. and Singh, V. P.: Assessing the impact of end- member selection on the accuracy of satellite-based spatial variability models for actual evapotranspiration estimation, Water Resour. Res., 49, 2601–2618, https://doi.org/10.1002/wrcr.20208, 2013.
Lu, G. Y. and Wong, D. W.: An adaptive inverse-distance weighting spatial interpolation technique, Comput. Geosci., 34, 1044–1055, https://doi.org/10.1016/j.cageo.2007.07.010, 2008.
Maggioni, V. and Massari, C.: on the performance of satellite precipitation products in riverine flood modeling: A review, J. Hydrol., 558, 214–224, https://doi.org/10.1016/j.jhydrol.2018.01.039, 2018.
Maggioni, V., Meyers, P. C., and Robinson, M. D.: A Review of Merged High-Resolution Satellite Precipitation Product Accuracy during the Tropical Rainfall Measuring Mission (TRMM) Era, J. Hydrometeorol., 17, 1101–1117, https://doi.org/10.1175/jhm-d-15-0190.1, 2016.
Mahmoud, M. T., Al-Zahrani, M. A., and Sharif, H. O.: Assessment of global precipitation measurement satellite products over Saudi Arabia, J. Hydrol., 559, 1–12, https://doi.org/10.1016/j.jhydrol.2018.02.015, 2018.
Martens, B., Cabus, P., De Jongh, I., and Verhoest, N. E. C.: Merging weather radar observations with ground-based measurements of rainfall using an adaptive multiquadric surface fitting algorithm, J. Hydrol., 500, 84–96, https://doi.org/10.1016/j.jhydrol.2013.07.011, 2013.
Nash, J. E. and Sutcliffe, J. V.: River flow forecasting through conceptual models, Part I – A discussion of principles, J. Hydrol., 10, 282–290, https://doi.org/10.1016/0022-1694(70)90255-6, 1970.
Ning, S., Wang, J., Jin, J., and Ishidaira, H.: Assessment of the Latest GPM-Era High-Resolution Satellite Precipitation Products by Comparison with Observation Gauge Data over the Chinese Mainland, Water, 8, 481–497, https://doi.org/10.3390/w8110481, 2016.
Nogueira, S. M. C., Moreira, M. A., and Volpato, M. M. L.: Evaluating Precipitation Estimates from Eta, TRMM and CHRIPS Data in the South-Southeast Region of Minas Gerais State-Brazil, Remote Sens., 10, 313, https://doi.org/10.3390/rs10020313, 2018.
Paredes-Trejo, F. J., Barbosa, H. A., and Kumar, T. V. L.: Validating CHIRPS-based satellite precipitation estimates in Northeast Brazil, J. Arid. Environ., 139, 26–40, https://doi.org/10.1016/j.jaridenv.2016.12.009, 2017.
Pessoa, F. C. L., Blanco, C. J. C., and Gomes, E. P.: Delineation of homogeneous regions for streamflow via fuzzy c-means in the Amazon, Water Pract. Technol., 13, 210–218, https://doi.org/10.2166/wpt.2018.035, 2018.
Prakash, S.: Performance assessment of CHIRPS, MSWEP, SM2RAIN-CCI, and TMPA precipitation products across India, J. Hydrol., 571, 50–59, https://doi.org/10.1016/j.jhydrol.2019.01.036, 2019.
Rivera, J. A., Marianetti, G., and Hinrichs, S.: Validation of CHIRPS precipitation dataset along the Central Andes of Argentina, Atmos. Res., 213, 437–449, https://doi.org/10.1016/j.atmosres.2018.06.023, 2018.
Roy, T., Gupta, H. V., Serrat-Capdevila, A., and Valdes, J. B.: Using satellite-based evapotranspiration estimates to improve the structure of a simple conceptual rainfall–runoff model, Hydrol. Earth Syst. Sci., 21, 879–896, https://doi.org/10.5194/hess-21-879-2017, 2017.
Shen, G. Y., Chen, N. C., Wang, W., and Chen, Z. Q.: Improving the Climate Hazards Group Infrared Precipitation (CHIRP) using WHU-SGCC method over the Jinsha River Basin from 1990 to 2014, PANGAEA, https://doi.org/10.1594/PANGAEA.905376, 2019.
Simpson, J., Adler, R. F., and North, G. R.: A PROPOSED TROPICAL RAINFALL MEASURING MISSION (TRMM) SATELLITE, B. Am. Meteorol. Soc., 69, 278–295, https://doi.org/10.1175/1520-0477(1988)069<0278:aptrmm>2.0.co;2, 1988.
Simanton, J. R. and Osborn, H. B.: RECIPROCAL-DISTANCE ESTIMATE OF POINT RAINFALL, J. Hydraul. Eng. Div.-ASCE, 106, 1242–1246, 1980.
Skofronick-Jackson, G., Petersen, W. A., Berg, W., Kidd, C., Stocker, E. F., Kirschbaum, D. B., Kakar, R., Braun, S. A., Huffman, G. J., Iguchi, T., Kirstetter, P. E., Kummerow, C., Meneghini, R., Oki, R., Olson, W. S., Takayabu, Y. N., Furukawa, K., and Wilheit, T.: The Global Precipitation Measurement (GPM) Mission for Science and Society, B. Am. Meteorol. Soc., 98, 1679–1695, https://doi.org/10.1175/bams-d-15-00306.1, 2017.
Sokol, Z.: The use of radar and gauge measurements to estimate areal precipitation for several Czech River basins, Stud. Geophys. Geod., 47, 587–604, https://doi.org/10.1023/a:1024715702575, 2003.
Su, F. G., Gao, H. L., Huffman, G. J., and Lettenmaier, D. P.: Potential Utility of the Real-Time TMPA-RT Precipitation Estimates in Streamflow Prediction, J. Hydrometeorol., 12, 444–455, https://doi.org/10.1175/2010jhm1353.1, 2011.
Thiemig, V., Rojas, R., Zambrano-Bigiarini, M., and De Roo, A.: Hydrological evaluation of satellite-based rainfall estimates over the Volta and Baro-Akobo Basin, J. Hydrol., 499, 324–338, https://doi.org/10.1016/j.jhydrol.2013.07.012, 2013.
Trejo, F. J. P., Barbosa, H. A., Penaloza-Murillo, M. A., Moreno, M. A., and Farias, A.: Intercomparison of improved satellite rainfall estimation with CHIRPS gridded product and rain gauge data over Venezuela, Atmosfera, 29, 323–342, https://doi.org/10.20937/atm.2016.29.04.04, 2016.
Tung, Y. K.: Point Rainfall Estimation for a Mountainous Region, J. Hydraul. Eng.-ASCE, 109, 1386–1393, https://doi.org/10.1061/(asce)0733-9429(1983)109:10(1386), 1983.
Ushio, T. and Kachi, M.: Kalman filtering applications for global satellite mapping of precipitation (GSMaP), in: Satellite rainfall applications for surface hydrology, Springer, 105–123, 2010.
Ushio, T., Sasashige, K., Kubota, T., Shige, S., Okamoto, K. i., Aonashi, K., Inoue, T., Takahashi, N., Iguchi, T., Kachi, M., Oki, R., Morimoto, T., and Kawasaki, Z.-I.: A Kalman Filter Approach to the Global Satellite Mapping of Precipitation (GSMaP) from Combined Passive Microwave and Infrared Radiometric Data, J. Meteorol. Soc. Jpn., 87, 137–151, https://doi.org/10.2151/jmsj.87A.137, 2009.
Verdin, A., Rajagopalan, B., Kleiber, W., and Funk, C.: A Bayesian kriging approach for blending satellite and ground precipitation observations, Water Resour. Res., 51, 908–921, 2015.
Vila, D. A., de Goncalves, L. G. G., Toll, D. L., and Rozante, J. R.: Statistical Evaluation of Combined Daily Gauge Observations and Rainfall Satellite Estimates over Continental South America, J. Hydrometeorol., 10, 533–543, https://doi.org/10.1175/2008jhm1048.1, 2009.
Wang, P. H.: Pattern Recognition with Fuzzy Objective Function Algorithms (James C. Bezdek), SIAM Rev., 25, 442–442, 1983.
Xie, P., Joyce, R., Wu, S., Yoo, S.-H., Yarosh, Y., Sun, F., and Lin, R.: Reprocessed, Bias-Corrected CMORPH Global High-Resolution Precipitation Estimates from 1998, J. Hydrometeorol., 18, 1617–1641, https://doi.org/10.1175/jhm-d-16-0168.1, 2017.
Xu, L., Chen, N., Zhang, X., Chen, Z., Hu, C., and Wang, C.: Improving the North American multi-model ensemble (NMME) precipitation forecasts at local areas using wavelet and machine learning, Clim. Dynam., 53, 601–615, https://doi.org/10.1007/s00382-018-04605-z, 2019.
Yang, T. T., Asanjan, A. A., Welles, E., Gao, X. G., Sorooshian, S., and Liu, X. M.: Developing reservoir monthly inflow forecasts using artificial intelligence and climate phenomenon information, Water Resour. Res., 53, 2786–2812, https://doi.org/10.1002/2017wr020482, 2017.
Yang, Z., Hsu, K., Sorooshian, S., Xu, X., Braithwaite, D., and Verbist, K. M. J.: Bias adjustment of satellite-based precipitation estimation using gauge observations: A case study in Chile, J. Geophys. Res.-Atmos., 121, 3790–3806, https://doi.org/10.1002/2015jd024540, 2016.
Yuan, Z., Xu, J. J., and Wang, Y. Q.: Projection of Future Extreme Precipitation and Flood Changes of the Jinsha River Basin in China Based on CMIP5 Climate Models, Int. J. Environ. Res. Pu., 15, 2491, https://doi.org/10.3390/ijerph15112491, 2018.
Zambrano-Bigiarini, M., Nauditt, A., Birkel, C., Verbist, K., and Ribbe, L.: Temporal and spatial evaluation of satellite-based rainfall estimates across the complex topographical and climatic gradients of Chile, Hydrol. Earth Syst. Sci., 21, 1295–1320, https://doi.org/10.5194/hess-21-1295-2017, 2017.
Zhang, X. and Chen, N. C.: Reconstruction of GF-1 Soil Moisture Observation Based on Satellite and In Situ Sensor Collaboration Under Full Cloud Contamination, IEEE T. Geosci. Remote, 54, 5185–5202, https://doi.org/10.1109/tgrs.2016.2558109, 2016.
Zhang, Y. R., Sun, A., Sun, H. W., Gui, D. W., Xue, J., Liao, W. H., Yan, D., Zhao, N., and Zeng, X. F.: Error adjustment of TMPA satellite precipitation estimates and assessment of their hydrological utility in the middle and upper Yangtze River Basin, China, Atmos. Res., 216, 52–64, https://doi.org/10.1016/j.atmosres.2018.09.021, 2019.
- Abstract
- Introduction
- Study region and data
- Methods
- Results and discussion
- Data availability
- Conclusions
- Appendix A: Geographical characteristics of rain stations
- Appendix B: Multi-annual land cover type
- Appendix C: Selection of decision trees for random forest regression
- Appendix D: Spatial distribution of C1, C2, and C3 pixels
- Appendix E: Spatial clustering from the FCM method
- Author contributions
- Competing interests
- Acknowledgements
- Financial support
- Review statement
- References
- Abstract
- Introduction
- Study region and data
- Methods
- Results and discussion
- Data availability
- Conclusions
- Appendix A: Geographical characteristics of rain stations
- Appendix B: Multi-annual land cover type
- Appendix C: Selection of decision trees for random forest regression
- Appendix D: Spatial distribution of C1, C2, and C3 pixels
- Appendix E: Spatial clustering from the FCM method
- Author contributions
- Competing interests
- Acknowledgements
- Financial support
- Review statement
- References