Preprints
https://doi.org/10.5194/essd-2025-725
https://doi.org/10.5194/essd-2025-725
13 Feb 2026
 | 13 Feb 2026
Status: this preprint is currently under review for the journal ESSD.

MLAWind: A Monthly Sea Surface Wind Dataset Derived from an Interpretable Machine Learning Approach Integrating In-Situ Observations and Satellite Data

Weihao Guo, Rongwang Zhang, Xin Wang, and Dongxiao Wang

Abstract. A gridded sea surface wind dataset with long temporal coverage is crucial for understanding atmospheric circulation changes and air-sea interactions at different time scales. This study employs an interpretable machine learning model based on random forest algorithm to generate a 1°×1° monthly sea surface wind dataset (MLAWind) from 1950 to 2023, covering the near-global ocean within 60° S–60° N. The data reconstruction model integrates the Cross-Calibrated Multi-Platform (CCMP) satellite data and the spatially sparse long-term International Comprehensive Ocean-Atmosphere Data Set (ICOADS), exhibiting robust interpretability and generalization capability. Evaluations demonstrate that the MLAWind dataset exhibits better agreement with remote sensing observations than existing reanalysis datasets during the training period (1993–2022), while maintaining robust performance during the independent testing period in 2023. Moreover, the performance of MLAWind since 1950 is assessed across multiple time scales. Its characteristics in climatology, annual cycle, and inter-annual variability are comparable to those of existing reanalysis datasets, even during the non-satellite period prior to 1993. Uncertainties remain in the long-term trends of different datasets. The trend derived from MLAWind is corroborated by independent coral records during 1950–1982, which demonstrates its strong capability in reconstructing historical sea surface wind variations. The results indicate that MLAWind serves as a reliable data resource for global climate change research. The reconstructed MLAWind dataset is publicly accessible at https://doi.org/10.5281/zenodo.17354864 (Guo et al., 2025b).

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Weihao Guo, Rongwang Zhang, Xin Wang, and Dongxiao Wang

Status: open (until 11 Apr 2026)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on essd-2025-725', Anonymous Referee #1, 17 Feb 2026 reply
Weihao Guo, Rongwang Zhang, Xin Wang, and Dongxiao Wang

Data sets

Machine learning-assisted sea surface wind dataset (MLAWind) W. Guo et al. https://doi.org/10.5281/zenodo.17354864

Weihao Guo, Rongwang Zhang, Xin Wang, and Dongxiao Wang

Viewed

Total article views: 222 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
146 64 12 222 12 15
  • HTML: 146
  • PDF: 64
  • XML: 12
  • Total: 222
  • BibTeX: 12
  • EndNote: 15
Views and downloads (calculated since 13 Feb 2026)
Cumulative views and downloads (calculated since 13 Feb 2026)

Viewed (geographical distribution)

Total article views: 215 (including HTML, PDF, and XML) Thereof 215 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 05 Mar 2026
Download
Short summary
This study develops a monthly near-global sea surface wind dataset (MLAWind) from 1950–2023 through an interpretable machine learning framework integrating in-situ observations and satellite data. Evaluations show that MLAWind achieves comparable performance to the widely-used reanalysis datasets at different time scales. It provides reliable historical wind data during both satellite and non-satellite periods, demonstrating broad application prospects in ocean and climate research.
Share
Altmetrics