Preprints
https://doi.org/10.5194/essd-2025-699
https://doi.org/10.5194/essd-2025-699
01 Dec 2025
 | 01 Dec 2025
Status: this preprint is currently under review for the journal ESSD.

GEOXYGEN: a global long-term dissolved oxygen dataset based on biogeochemistry-aware machine learning framework and multi-source observations

Zhenguo Wang, Weiwei Fu, Cunjin Xue, and Guihua Wang

Abstract. Dissolved oxygen (DO) serves as an essential indicator of marine ecosystem health. However, sparse and uneven observations have limited our ability to characterize its full spatiotemporal variability, underscoring the continued need for long-term, high-resolution, and physically consistent global DO datasets. Here, we present GEOXYGEN, a global dataset of monthly DO fields at 0.5° × 0.5° resolution spanning 1960–2024 and depths from the surface to 5500 m (Wang et al., 2025, https://doi.org/10.5281/zenodo.17615657). GEOXYGEN is generated with a hierarchical modeling framework that accounts for regional and vertical heterogeneity. By integrating physical and biogeochemical predictors with an adaptive feature-selection strategy, GEOXYGEN achieves high predictive accuracy across all depth layers on an independent out-of-time test (R² > 0.92). The reconstructed spatial patterns align closely with the World Ocean Atlas 2023 climatology, and in subsurface and deep waters, GEOXYGEN demonstrates superior generalization relative to existing data-driven products. A sensitivity analysis further reveals that including coastal data in model training increases basin-wide uncertainty by approximately 7.5 %, underscoring that current observing systems remain insufficient to reliably resolve nearshore DO dynamics. GEOXYGEN provides a consistent, physically informed baseline for analyzing global and regional variability of DO. It also offers a valuable benchmark for evaluating and improving the representation of DO in climate and Earth system models and can support future studies on long-term deoxygenation trends and regional hotspots.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Zhenguo Wang, Weiwei Fu, Cunjin Xue, and Guihua Wang

Status: open (until 07 Jan 2026)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Zhenguo Wang, Weiwei Fu, Cunjin Xue, and Guihua Wang

Data sets

GEOXYGEN: a global long-term dissolved oxygen dataset (V1.0) Z. Wang et al. https://doi.org/10.5281/zenodo.17615657

Model code and software

GEOXYGEN-code Z. Wang https://github.com/layne1202/GEOXYGEN-code

Zhenguo Wang, Weiwei Fu, Cunjin Xue, and Guihua Wang

Viewed

Total article views: 55 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
49 5 1 55 2 2
  • HTML: 49
  • PDF: 5
  • XML: 1
  • Total: 55
  • BibTeX: 2
  • EndNote: 2
Views and downloads (calculated since 01 Dec 2025)
Cumulative views and downloads (calculated since 01 Dec 2025)

Viewed (geographical distribution)

Total article views: 55 (including HTML, PDF, and XML) Thereof 55 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 02 Dec 2025
Download
Short summary
Oxygen in the ocean is vital for marine life and for the global climate, but long records are patchy. We combine millions of ship and float measurements with advanced computer methods to create GEOXYGEN, a new monthly map of global ocean dissolved oxygen from 1960 to 2024. It shows where and how fast oxygen is changing, highlights blind spots near coasts and in deep water, and offers a solid basis to test and improve climate and ecosystem models.
Share
Altmetrics