Preprints
https://doi.org/10.5194/essd-2025-286
https://doi.org/10.5194/essd-2025-286
28 May 2025
 | 28 May 2025
Status: this preprint is currently under review for the journal ESSD.

A surface ocean pCO2 product with improved representation of interannual variability using a vision transformer-based model

Xueying Zhang, Enhui Liao, Wenfang Lu, Zelun Wu, Guansuo Wang, Xueming Zhu, and Shiyu Liang

Abstract. The ocean plays a crucial role in regulating the global carbon cycle and mitigating climate change, with the spatial distribution and temporal variations of ocean surface partial pressure of CO2 (spCO2) directly determining the air-sea CO2 flux. However, constructing a global spCO2 data product that is able to resolve interannual and decadal variability remains a challenge due to the spatial sparsity and temporal discontinuity of observational data. This study presents an approach based on the Vision Transformer (ViT) model, combining high-quality observational data from the CO2 Atlas (SOCAT) with multiple advanced global ocean biogeochemical models results to reconstruct a global monthly spCO2 dataset (SJTU-AViT) at 1° resolution from 1982 to 2023. The approach employs the self-attention mechanism of the ViT model to enhance the modeling of the spatial and temporal variations of spCO2, as well as incorporates physical-biogeochemical constraints from the derivative of spCO2 with respect to key controlling factors as additional features. The incorporation of advanced ocean biogeochemical models during the training process allows the ViT-based model to capture more accurate spCO2 variability in these data-sparse regions. Evaluations demonstrate that the new data product effectively captures spCO2 variability at both global and regional scales, showing good consistency with SOCAT observations, long-term ocean station data, and global atmospheric CO2 trends. The reconstructed spCO2 demonstrates strong capability in reproducing spCO2 anomalies during El Niño-Southern Oscillation (ENSO) events, particularly in the eastern Pacific Ocean, where it shows a correlation of 0.81 with the Niño 3.4 index and demonstrates high consistency with cruise data. Based on the SJTU-AViT dataset, the estimated global air-sea CO2 flux patterns are consistent with known regional features such as strong uptake in the Southern Ocean and outgassing in the tropical Pacific. This study not only provide a new 42-year data product for advancing understanding of the ocean carbon cycle and global carbon budget assessments, but also introduces a new Transformer-based deep learning framework for Earth system data reconstruction. The data product is publicly accessible at https://doi.org/10.5281/zenodo.15331978 (Zhang et al., 2025) and will be updated regularly.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.
Share
Xueying Zhang, Enhui Liao, Wenfang Lu, Zelun Wu, Guansuo Wang, Xueming Zhu, and Shiyu Liang

Status: open (until 04 Jul 2025)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Xueying Zhang, Enhui Liao, Wenfang Lu, Zelun Wu, Guansuo Wang, Xueming Zhu, and Shiyu Liang

Data sets

A surface ocean pCO2 product with improved representation of interannual variability using a vision transformer-based model Xueying Zhang, Enhui Liao, Wenfang Lu, Zelun Wu, Guansuo Wang, and Shiyu Liang https://doi.org/10.5281/zenodo.15331978

Xueying Zhang, Enhui Liao, Wenfang Lu, Zelun Wu, Guansuo Wang, Xueming Zhu, and Shiyu Liang

Viewed

Total article views: 15 (including HTML, PDF, and XML)
HTML PDF XML Total Supplement BibTeX EndNote
15 0 0 15 0 0 0
  • HTML: 15
  • PDF: 0
  • XML: 0
  • Total: 15
  • Supplement: 0
  • BibTeX: 0
  • EndNote: 0
Views and downloads (calculated since 28 May 2025)
Cumulative views and downloads (calculated since 28 May 2025)

Viewed (geographical distribution)

Total article views: 15 (including HTML, PDF, and XML) Thereof 15 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 29 May 2025
Download
Short summary
We created a new global dataset that reveals how ocean surface carbon dioxide has changed each month over the past four decades. By applying a deep learning model trained on both observational data and model simulations, we improved the representation of interannual variability and more accurately captured ocean responses to climate events like El Niño. This work supports global efforts to understand the ocean’s role in the carbon cycle and its response to climate change.
Share
Altmetrics