A full-coverage daily XCO2 dataset in China from 2015 to 2020 based on DSC-DF-LGB
Abstract. Carbon dioxide (CO2), as a major greenhouse gas, is one of the important causes of global warming. In recent years, the atmospheric CO2 concentration in China has been increasing year by year. Satellite observation is the main means of obtaining atmospheric CO2 concentration. However, the current onboard sensors used for measuring atmospheric CO2 have a narrow observation range and cannot obtain spatiotemporal continuous atmospheric CO2 concentrations. Therefore, this paper proposes a daily full-coverage XCO2 dataset generation method based on the DSC-DF-LGB (Deep Separable Convolutional Neural Network and Deep Forest concatenated with LightGBM) model to obtain the spatiotemporal distribution of atmospheric CO2 in China. The DSC-DF-LGB model was established to train the mapping relationship between OCO-2 XCO2 retrieval and related variables (reanalysis XCO2, vegetation parameters, human factors, elevation, and meteorological parameters). The model was used to generate a daily 0.1° full-coverage XCO2 dataset for China from 2015 to 2020. The cross validation (CV) result indicates that the model has strong performance in estimating XCO2, with R2 and RMSE of 0.9633 and 0.9761 ppm. The TCCON independent site validation result indicates that the estimated XCO2 has high consistency with in-situ measurements, with R2 and RMSE of 0.8786 and 1.5452 ppm. The full-coverage and high-resolution XCO2 dataset can provide data support for research on carbon sources and sinks. The dataset is available at https://zenodo.org/doi/10.5281/zenodo.12696674 (Huang, 2024).