The first consistent global gap-free daily 4 km dataset of particulate organic carbon, particulate organic nitrogen and their ratio (1998–2023)
Abstract. Particulate organic carbon (POC), particulate organic nitrogen (PON) and their ratios (POC:PON) are key to understanding particulate organic matter (POM) and marine biogeochemical cycles. However, global observations remain insufficient, with existing products suffering from spatial and temporal gaps, inconsistent methodologies, and interruptions from clouds and satellite sampling limitations. Here, we present a global gap-free daily 4 km dataset of consistent POC, PON and POC:PON from January 1998 to December 2023 (GGFD-POM), generated using a concise retrieval then reconstruction workflow. POC and PON concentrations were retrieved from ocean color data (OC-CCI v6.0) and Copernicus reanalysis physics data using newly developed Gaussian Process Regression (GPR) models. The models were trained on ~3110 matchups between in-situ observations, OC-CCI bio-optical properties, and Copernicus physical properties, achieving R2 of 0.87 and 0.89, and RMSE of 1.47 mg m-3 and 1.41 mg m-3 for POC and PON, respectively. The Discrete Cosine Transform–Penalized Least Squares (DCT-PLS) approach was subsequently applied to reconstruct missing values in the satellite-retrieved POC and PON fields, resulting in gap-free global datasets. Based on these reconstructed products, a gap-free global POC:PON dataset was further derived. Validation using an independent in-situ dataset confirmed high accuracy of both satellite retrievals and reconstructions for POC, PON and POC:PON. Triple-collocation analysis (TCA) exhibited that the GGFD POC data outperform existing MODIS-Aqua and MULTIOBS POC datasets, reflecting the distinct strengths of the GGFD-POM product. Comparative analysis of spatiotemporal variations in POC, PON, and POC:PON further demonstrated that the gap-free dataset better captures trends and magnitudes, enhancing understanding of their roles in the global carbon and nitrogen cycles. The complete GGFD POM product dataset (1998–2023) is openly available at https://doi.org/10.11888/Ocean.tpdc.303488 (Zhang and Liu, 2026).