the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Reconstructing two-decadal global daily high-resolution XCO2 records using a hybrid Transformer–BiLSTM model
Abstract. Accurate and temporally continuous global observations of atmospheric carbon dioxide (XCO2) are essential for climate monitoring and emission assessment. However, satellite-based XCO2 observations are often spatially incomplete and temporally discontinuous, while existing products typically suffer from coarse spatial resolutions, hindering the detection of fine-scale emission changes. Here, we developed a novel spatiotemporal Transformer-BiLSTM deep-learning network, which integrates the local temporal feature extraction capability of bidirectional long short-term memory with the global spatial dependency modeling strength of Transformer via self-attention mechanisms. The network assimilates multisource data, from satellite observations, meteorological reanalysis, and precursor gases, to reconstruct global, daily, and seamless XCO2 at 0.1° resolution from 2003 to 2022. Validation against independent Total Carbon Column Observing Network (TCCON) measurements shows excellent agreement, with a correlation coefficient (R2) value of 0.99, a root mean square error (RMSE) of 1.10 ppm, and a mean bias of 0.01 ppm. A subsequent bias correction scheme further improves cross-satellite consistency, achieving a cross-validation coefficient of determination (CV-R2) of 0.99 and an RMSE of 0.97 ppm. Our dataset enables accurate characterization of daily XCO2 concentrations over global land surfaces, facilitating the detection of spatial heterogeneity associated with emission hotspots and point-source activities. The record reveals a persistent global increase in atmospheric XCO2 over the past two decades, with a mean growth rate of 2.24 ppm/yr (p < 0.001). It reliably resolves global XCO2 variability across a wide range of temporal scales, from day-to-day fluctuations to long-term trends. It consistently captures large-scale climate-driven signals, such as ENSO-related interannual variability, and short-lived XCO2 enhancements associated with major wildfire events, demonstrating its capability to represent both persistent and episodic emission signals. This high-resolution, daily global XCO2 product (GlobalHighXCO2) provides a valuable benchmark for carbon cycle studies, atmospheric model evaluation, and emission monitoring, and is publicly available at https://doi.org/10.5281/zenodo.18220961 (Qu and Wei, 2026).
Competing interests: At least one of the (co-)authors is a member of the editorial board of Earth System Science Data.
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.- Preprint
(7602 KB) - Metadata XML
- BibTeX
- EndNote
Status: final response (author comments only)
-
RC1: 'Comment on essd-2026-50', Anonymous Referee #1, 09 Mar 2026
-
AC1: 'Reply on RC1', Jing Wei, 14 May 2026
The comment was uploaded in the form of a supplement: https://essd.copernicus.org/preprints/essd-2026-50/essd-2026-50-AC1-supplement.pdf
-
AC1: 'Reply on RC1', Jing Wei, 14 May 2026
-
RC2: 'Comment on essd-2026-50', Anonymous Referee #2, 16 Apr 2026
Qu et al. presents a timely and potentially valuable high-resolution daily XCO2 dataset by combining multi-mission satellite retrievals with a hybrid Transformer–BiLSTM framework, followed by a bias-correction step. The topic is well suited to ESSD, and the dataset has clear potential applications in carbon-cycle analysis, emission monitoring, and climate studies. Overall, the manuscript is well written, and I have several suggestions to further improve the study.
Specific comments:
The spatial scope of the dataset should be described more consistently. The title and several sections of the manuscript give the impression of a fully global XCO2 product, whereas the abstract states that the dataset characterizes XCO2 “over global land surfaces.” This scope should be made consistent across the title, abstract, main text, and dataset description.
The description of the Bi-LSTM architecture needs clarification. The statement that the model “consists of 64 hidden layers with a dimension of 128” is difficult to interpret and seems unlikely in its current form. It would be helpful to clarify whether this refers to hidden units, hidden states, or stacked layers.
To enhance readability, Figure 5 should include a corresponding legend.
Several language and wording issues should be refined. For example, “till 2012” should be revised to “until 2012,” and “Electric power plant in Algerian” should be corrected to “in Algeria.” A careful language edit would improve overall readability.
The ENSO discussion contains a minor numerical inconsistency. A reported correlation of R = 0.55 corresponds to approximately 30% explained variance rather than ~34%. Please check the correlation coefficient or revise the corresponding statement accordingly.
The performance metrics reported in the abstract and main text would benefit from clearer interpretation. Different statistics are presented for independent TCCON validation, bias-corrected results, and various cross-validation strategies, but the abstract may give the impression that all values correspond to a single evaluation setting. It would be helpful to explicitly indicate which metric corresponds to which validation framework.
In the caption for Figure 7, the years are listed as “May 30, 2009, 2013, 2020.” For grammatical correctness, this should be revised to “May 30, 2009, 2013, and 2020.”
The seasonal averages should be presented in a consistent format, for example (MAM; average = 395.36 ± 2.20 ppm).
The terminology used for the reconstructed product should be standardized. Terms such as “seamless,” “gap-free,” and “gapless” are used interchangeably; selecting one consistent term would improve clarity.
Some minor grammatical issues should also be corrected, for example “Sample-based CV show” should be revised to “Sample-based CV shows.”
The model name should be used consistently throughout the manuscript. In most sections it is referred to as “Transformer–BiLSTM,” whereas in the XAI section it appears as “4D-STransformer-BiLSTM,” which may confuse readers.
It would also be helpful to clarify whether coarser-resolution products were regridded prior to comparison in Figure 7.
The interpretation of localized XCO2 hotspots should be phrased more cautiously. Some descriptions appear overly definitive for a column-integrated quantity that can also be influenced by atmospheric transport and meteorological conditions. Expressions such as “consistent with localized enhancements associated with…” would be more appropriate unless supported by stronger independent evidence.
The formatting of several references should be carefully checked, as there are minor issues such as missing spaces or compressed publisher/year formatting.
In Table 2, it would be beneficial to include comparisons with the most recent relevant studies to better position the dataset within the current literature.
Finally, the names of datasets and networks should be presented consistently. For example, “ObsPack” and “Obspack” appear in different places and should be unified.
Citation: https://doi.org/10.5194/essd-2026-50-RC2 -
AC2: 'Reply on RC2', Jing Wei, 14 May 2026
The comment was uploaded in the form of a supplement: https://essd.copernicus.org/preprints/essd-2026-50/essd-2026-50-AC2-supplement.pdf
-
AC2: 'Reply on RC2', Jing Wei, 14 May 2026
Data sets
Global daily high-resolution XCO2 product (GlobalHighXCO2) Yu Qu and Jing Wei https://doi.org/10.5281/zenodo.18220961
Viewed
| HTML | XML | Total | BibTeX | EndNote | |
|---|---|---|---|---|---|
| 385 | 318 | 30 | 733 | 39 | 39 |
- HTML: 385
- PDF: 318
- XML: 30
- Total: 733
- BibTeX: 39
- EndNote: 39
Viewed (geographical distribution)
| Country | # | Views | % |
|---|
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
The authors reconstructed two-decade global daily high-resolution XCO2 data based on a hybrid Transformer–BiLSTM model. However, the topic is usual, and the method should be more innovative in this study, which requires large improvements. Specific comments are as follows:
Major comments:
Minor comments:
References:
[1] Wang, J. (2026). Global daily 1 km gapless XCO₂ (2003− 2023) derived from multi-satellite observations and a spatiotemporal deep learning framework. Environmental Impact Assessment Review, 117, 108146.
[2] Wang, Z., Zhang, C., Shi, K., Shangguan, Y., Hu, B., Chen, X., ... & Zhang, Q. (2025). A full-coverage satellite-based global atmospheric CO 2 dataset at 0.05° resolution from 2015 to 2021 for exploring global carbon dynamics. Earth System Science Data, 17(10), 5355-5375.
[3] Li, J., Zhang, Z., Li, T., Yuan, Q., & Zhang, L. (2026). Global daily seamless XCO2 Mapping (2016–2020): Spatio-temporal trends and variations during wildfire events. International Journal of Applied Earth Observation and Geoinformation, 146, 105092.
[4] Hwang, S., Choi, H., Kang, Y., & Im, J. (2026). Reconstructing long-term (2003–2019) global high-resolution XCO2: bridging observational gaps with machine learning. GIScience & Remote Sensing, 63(1), 2627042.
[5] Yu, Y., Tian, W., Zhang, L., Yu, T., Wu, Y., & Cheng, T. (2026). MCF-XCO2: A cross-mission consistency and fusion framework for integrating multi-satellite XCO2 observations. Atmospheric Research, 108747.