A vegetation phenology dataset by integrating multiple sources using the Reliability Ensemble Averaging method
Abstract. Global change has substantially shifted vegetation phenology, with important implications in the carbon and water cycles of terrestrial ecosystems. Various vegetation phenology datasets have been developed using remote sensing data; however, the significant uncertainties in these datasets limit our understanding of ecosystem dynamics in terms of phenology. It is therefore crucial to generate a reliable large-scale vegetation phenology dataset, by fusing various existing vegetation phenology datasets, to provide comprehensive and accurate estimation of vegetation phenology with fine spatiotemporal resolution. In this study, we merged four widely used vegetation phenology datasets to generate a new dataset using the Reliability Ensemble Averaging fusion method. The spatial resolution of the new dataset is 0.05° and its temporal scale spans 1982–2022. The evaluation using the ground-based PhenoCam dataset from 280 sites indicated that the accuracy of the newly merged dataset was improved substantially. The start of growing season and the end of growing season in the newly merged dataset had the largest correlation (0.84 and 0.71, respectively) and accuracy in terms of the root mean square error (12 and 17 d, respectively). Using the new dataset, we found that the start of growing season exhibits a significant (p < 0.01) advanced trend with a rate of approximately 0.24 d yr−1, and that the end of growing season exhibits a significant (p < 0.01) delayed trend with a rate of 0.16 d yr−1 over the period 1982–2022. This dataset offers a unique and novel source of vegetation phenology data for global ecology studies.