Articles | Volume 16, issue 9
https://doi.org/10.5194/essd-16-4267-2024
© Author(s) 2024. This work is distributed under
the Creative Commons Attribution 4.0 License.A 20-year (1998–2017) global sea surface dimethyl sulfide gridded dataset with daily resolution
Download
- Final revised paper (published on 19 Sep 2024)
- Supplement to the final revised paper
- Preprint (discussion started on 12 Dec 2023)
- Supplement to the preprint
Interactive discussion
Status: closed
Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor
| : Report abuse
-
RC1: 'Comment on essd-2023-249', Murat Aydin, 27 Jan 2024
- AC2: 'Reply on RC1', Shengqian Zhou, 07 Apr 2024
-
RC2: 'Comment on essd-2023-249', Anonymous Referee #2, 29 Jan 2024
- AC1: 'Reply on RC2', Shengqian Zhou, 07 Apr 2024
Peer review completion
AR: Author's response | RR: Referee report | ED: Editor decision | EF: Editorial file upload
AR by Shengqian Zhou on behalf of the Authors (07 Apr 2024)
Author's response
Author's tracked changes
Manuscript
ED: Referee Nomination & Report Request started (09 Apr 2024) by François G. Schmitt
RR by Murat Aydin (02 May 2024)

ED: Reconsider after major revisions (04 May 2024) by François G. Schmitt

AR by Shengqian Zhou on behalf of the Authors (26 Jun 2024)
Author's response
Author's tracked changes
Manuscript
ED: Referee Nomination & Report Request started (07 Jul 2024) by François G. Schmitt
RR by Murat Aydin (22 Jul 2024)

ED: Publish as is (22 Jul 2024) by François G. Schmitt

AR by Shengqian Zhou on behalf of the Authors (25 Jul 2024)
Manuscript
Post-review adjustments
AA: Author's adjustment | EA: Editor approval
AA by Shengqian Zhou on behalf of the Authors (13 Sep 2024)
Author's adjustment
Manuscript
EA: Adjustments approved (18 Sep 2024) by François G. Schmitt
The manuscript by Zhou et al. offers a 20-year (1998-2017) global sea surface dimethyl sulfide (DMS) dataset with daily resolution. The new dataset is developed with an artificial neural network (ANN) ensemble model based on 9 environmental parameters. DMS is produced biogenically in the ocean and its emissions contribute to aerosol radiative forcing in the troposphere. There are a few other global ocean DMS emissions datasets, including one based on an artificial neural network. What makes this dataset unique is that it offers a high time resolution data product covering a 20 year period. The authors claim it is an improved emission inventory of oceanic DMS which can facilitate improved simulations of aerosols derived from DMS. This is a useful dataset with unique features that suits the Earth System Science Data Journal goals of publishing articles on original datasets. However, I would like to see the authors address my comments and questions below before the publication of the manuscript.
My primarily concerns are centered around the comparisons between the ANN product and actual data displayed in Fig. 3. The statistical metrics chosen for arguing good agreement between the ANN product and the observations are R2 and root mean square error (RMSE). These metrics are appropriate for testing the predictive capability of linear regressions, in other words the accuracy of a linear model, but they do not necessarily address the fidelity of the model data to reality. If we are to prefer the ANN data product over the observations to estimate DMS flux, the manuscript needs to present convincing evidence that the model vs. observations relationship is not only linear but also has a slope that centers around 1:1. In this context, it is not enough to show that there is a strong linear relationship between the ANN product and the observations, rather the slope of the linear relationship should be quantified and ideally shown to be statistically indistinguishable from 1.
Fig. 3 offers only qualitative information about the value of the slopes. I found the data density color scale helpful in trying to estimate what the slope of the best fits to these scatter plots might be, but the slopes should really be quantified in the manuscript. The manuscript includes a passing reference to a potential bias issue with regards to the coastal region. I agree that, if the bias is limited to high concentrations in that region alone, this would not be a big deal. However, looking at Fig. 3, I suspect that the slopes might be different than unity for multiple regions, although I cannot be not certain without seeing a proper analysis. The fact that the entire analysis is in log-log space makes me more worried because small looking deviations in a log-log linear relationship can result in significant biases in actual concentration and flux calculations.
One can think of many different ways to conduct this type of analyses, but I would advise investigating the residuals of the scatter plots in Fig. 3 from the 1:1 slope versus the DMS_obs. Fitting linear regression lines to these residuals-plots would be a good way to test for biases; ideally these slopes should equal zero within uncertainties, meaning the residuals do not have a positive or negative relationship with DMS_obs. For example, Figs. 4b,c display linear fits to the data and these slopes are different from unity. This is also noticeable in Fig. 4a as most of the higher concentrations during July-Aug are underestimated by the model and conversely the lower concentrations in the winter tend to be overestimated, leading to a damped seasonal cycle. I grant that the differences look small in Fig. 4a, but given that this figure too is on a logarithmic scale, it would be good to see a formal quantification of fluxes generated with observed data versus the simulated ones. Do under and over estimations at either end cancel each other out or does one win out over the other, leading to biases in the annual fluxes? I have a cautionary note when conducting linear fits. For your raw data, I’m guessing the errors for individual DMS_obs will be very small compared with the dynamic range of the dataset, therefore the x errors can be ignored. Likewise, it is probably reasonable to assume y errors are uniform, meaning standard (least-squares in y direction) linear regression analyses could be safely implemented. For the regionally-averaged data show in Fig. 4c, the x errors look very large and both x and y errors look nonuniform, meaning a standard linear regression approach will yield inaccurate estimates of the slope and its confidence band.
Some other shorter general comments and questions:
I’m confused about how the data from different time periods are treated during the training and validation steps of ANN model development. As far as I can tell, you use all data from all periods in training and validation. Once you have the ANN model, you input time variable parameters to estimate temporal changes in concentrations and fluxes, is this correct? Your criticism of previous work for using data from different time periods to estimate a global average flux does not seem justified because you seem to develop your model in the same fashion, or am I missing something?
It would be good see how much data each region contributes to the full dataset. The coastal region appears to contribute the most even though the emissions from the coastal regions constitute only 3% of the global DMS flux, and conversely the trades regions have little data even though the integrated fluxes in these regions are high. Did you try training the model without the coastal data to see if the model results change?
What are the contributions of the 9 different model parameters to the final outcome? Which parameters carry more important information according to your ANN model?
Was the ANN allowed to freely chose model equations, did you impose any restrictions or try other models?
Minor comments/corrections as they appear in the manuscript:
Line 81-82: This sentence here gives the impression that you are not using all data from all years with equal weight.
Line 113: Are you using exactly the same data that went into Hulswar et al (2022)?
Lines 128-131: What happens in SI covered areas? What level of SI cover lead to zero emissions?
Line 143: Are the SeaWiFS and Aqua-MODIS data in reasonable agreement?
Fig. 4a: The markers look quite faint on my screen. I suggest sharper colors.
Lines 339-344: Refer to Fig. 9 somewhere.
Lines 388-390: What drives the trends in Kt?