the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
A Novel Global Gridded Ocean Oxygen Product Derived from a Neural Network Emulator and in-situ observations
Abstract. Ocean deoxygenation, driven by climate change, poses significant challenges to marine ecosystems and can profoundly alter nutrient and carbon cycling. Quantifying the rate and regional patterns of deoxygenation relies on spatio-temporal interpolation tools to fill gaps in observational coverage of dissolved oxygen. However, this task is challenging due to the sparsity of observations, and classical interpolation methods often lead to high uncertainty and biases, typically underestimating long-term deoxygenation trends. In this work, we develop a novel gridded dissolved oxygen product by integrating direct oxygen observations with machine-learning-based emulated oxygen estimates derived from temperature and salinity profiles. The gridded product is then generated through optimal interpolation of both the observed and emulated data. The resulting product shows strong agreement with baseline climatology and captures well-known patterns of seasonal variability and long-term deoxygenation trends. It also outperforms current state-of-the-art products by more accurately capturing dissolved oxygen variability at synoptic and decadal scales, and by reducing uncertainty around long-term changes. This study highlights the potential of combining machine learning with classical interpolation methods to generate improved gridded biogeochemical products, enhancing our ability to study and understand ocean biogeochemical processes and their variability under a changing climate.
- Preprint
(17149 KB) - Metadata XML
- BibTeX
- EndNote
Status: open (until 01 Sep 2025)
-
RC1: 'Comment on essd-2025-288', Anonymous Referee #1, 14 Jul 2025
reply
I would like to thank the authors for the interesting and timely work. The paper presents a novel approach to generating a gridded dissolved oxygen product by integrating direct observations with ML–based emulations derived from temperature and salinity profiles, followed by optimal interpolation. The methodology is simple to follow and technically sound, the results are compelling, and the product demonstrates clear improvements over existing datasets, especially in capturing long-term trends and reducing uncertainties. I recommend acceptance with minor revisions, but I would like to note that my review is primarily focused on the ML aspect.
Comments:
* The training/test split was done randomly, how the authors ensure there is no data leakage? It would have been more interesting if the trains was done in a temporal way.
* It would have been also more robust to use a validation dataset, instead of only train/test
* Any reason why the test locations do not include any points near Europe?
* Any reason why using Month of the year + Day of the month in the MLP inputs instead of just using Day of the year?
* Can the authors describe the hyper parameter search procedure to tune the MLP?
* Figure 1 would have been more informative if the plots where done per test region
* Any explanation of what's happening at depth 500 in test region J (Figure 2)?
* It would be interesting to use any XAI method to study feature importance for the MLP
* Any plans to share the code used and not only the dataset?Typos:
* Line 35: "weather forecasting" instead of "forecasting"
* Many citations are badly formatted, /citet vs /citepCitation: https://doi.org/10.5194/essd-2025-288-RC1 -
AC1: 'Reply on RC1', Said Ouala, 12 Aug 2025
reply
Please see attached file.
-
AC1: 'Reply on RC1', Said Ouala, 12 Aug 2025
reply
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
260 | 41 | 29 | 330 | 14 | 28 |
- HTML: 260
- PDF: 41
- XML: 29
- Total: 330
- BibTeX: 14
- EndNote: 28
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1