Reprocessing of eXpendable BathyThermograph (XBT) profiles from the Ligurian and Tyrrhenian seas over the time period 1999–2019 with a full metadata upgrade

Simoncelli, Simona; Reseghetti, Franco; Fratianni, Claudia; Cheng, Lijing; Raiteri, Giancarlo

doi:https://doi.org/10.5194/essd-16-5531-2024

Articles | Volume 16, issue 12

https://doi.org/10.5194/essd-16-5531-2024

Articles | Volume 16, issue 12

Data description paper

03 Dec 2024

Data description paper |

| 03 Dec 2024

Reprocessing of eXpendable BathyThermograph (XBT) profiles from the Ligurian and Tyrrhenian seas over the time period 1999–2019 with a full metadata upgrade

Simona Simoncelli, Franco Reseghetti, Claudia Fratianni, Lijing Cheng, and Giancarlo Raiteri

Abstract

The advent of open science and the United Nations Decade of Ocean Science for Sustainable Development are revolutionizing the ocean-data-sharing landscape for an efficient and transparent ocean information and knowledge generation. This blue revolution raised awareness on the importance of metadata and community standards to activate interoperability of the digital assets (data and services) and guarantee that data-driven science preserves provenance, lineage and quality information for its replicability. Historical data are frequently not compliant with these criteria, lacking metadata information that was not retained, crucial at the time of data generation and further ingestion into marine data infrastructures. The present data review is an example attempt to fill this gap through a thorough data reprocessing starting from the original raw data and operational log sheets. The data gathered using XBT (eXpendable BathyThermograph) probes during several monitoring activities in the Tyrrhenian and Ligurian seas between 1999 and 2019 have first been formatted and standardized according to the latest community best practices and all available metadata have been inserted, including calibration information never applied, uncertainty specification and bias correction from Cheng et al. (2014). Secondly, a new automatic quality control (QC) procedure has been developed and a new interpolation scheme applied. The reprocessed (REP) dataset has been compared to the data version, presently available from the SeaDataNet (SDN) data access portal, processed according to the pioneering work of Manzella et al. (2003) conducted in the framework of the European Union Mediterranean Forecasting System Pilot Project (Pinardi et al., 2003). The comparison between REP and SDN datasets has the objective to highlight the main differences derived from the new data processing process. The maximum discrepancy among the REP and SDN data versions always resides within the surface layer (REP profiles are warmer than SDN ones) until 150 m depth generally when the thermocline settles (from June to November). The overall bias and root mean square difference are equal to 0.002 and 0.041 °C, respectively. Such differences are mainly due to the new interpolation technique (Barker and McDougall, 2020) and the application of the calibration correction in the REP dataset.

The REP dataset (Reseghetti et al., 2024; https://doi.org/10.13127/rep_xbt_1999_2019.2) is available and accessible through the INGV (Istituto Nazionale di Geofisica e Vulcanologia, Bologna) ERDDAP (Environmental Research Division's Data Access Program) server, which allows for machine-to-machine data access in compliance with the FAIR (findable, accessible, interoperable and reusable) principles (Wilkinson et al., 2016).

Download & links

Article (PDF, 6410 KB)

Download & links

How to cite.

Received: 16 Dec 2023 – Discussion started: 03 Jan 2024 – Revised: 30 Sep 2024 – Accepted: 04 Oct 2024 – Published: 03 Dec 2024

1 Introduction

The open-science paradigm boosted the sharing of data through different pathways, determining the generation of different versions of the same datasets. This might depend on the timeliness of data delivery in either near-real time (NRT) or delayed mode (DM), with the data center managing the dataset, and the data assembly center or the marine data infrastructure collating it. The awareness of the importance of a complete metadata description is increasing among the scientific community since it allows for interoperability, traceability of the data life cycle, transparency and replicability of the knowledge generation process. In particular, some key information is crucial in climate science because it allows for the reanalysis of historical data and quantifying and reducing uncertainties, which are used to derive accurate scientific knowledge (Simoncelli et al., 2022).

The data provider should define the overall quality assurance strategy along with the data life cycle to guarantee the availability of the best data product, which implies the possibility of reprocessing the dataset according to the state-of-the-art quality control (QC) procedures and standards. Data-driven research should use the most extensive datasets with complete metadata information passed through a trustworthy QC procedure. These are also the basic requirements to guarantee data reusability once the data are made openly accessible. The complete set of metadata assures transparency of the data provenance and avoids the circulation of multiple versions.

The integration in global databases of data not compliant with these principles emerged recently for measurements gathered in the last century when the importance of storing data with complete ancillary information was not yet clear. A striking example is provided by the XBT (eXpendable BathyThermograph) probes, the oceanographic instruments that recorded the largest number of temperature profiles in the ocean from the 1970s to the 1990s (Meyssignac et al., 2019). The complete metadata information is crucial for QC, data reprocessing (Cheng et al., 2014, 2018; Goni et al., 2019) and integration with other data types to estimate key ocean monitoring indicators, such as the trend of global ocean heat content (Cheng et al., 2020, 2021, 2022), one of the most important climate change indicators. According to the literature (Cheng et al., 2016, 2017; Parks et al., 2022), the crucial metadata information that must be associated with XBT data includes the probe type and manufacturer, fall rate equation, launch height, and recording system. This information was not mandatory for the data ingestion in the main marine data infrastructure; thus, most historical data miss it. For example, 50 % of XBT profiles in the World Ocean Database (WOD) have no information about manufacturer or probe type (Cowley et al., 2021), necessitating the application of intelligent metadata techniques to complement it (Palmer et al., 2018; Leahy et al., 2018; Haddad et al., 2022).

This data review originated from the recognition that the historical XBTs from the Ligurian and Tyrrhenian seas, presently available in the main marine data infrastructures – SDN (https://www.seadatanet.org/, last access: 28 November 2024), WOD (https://www.ncei.noaa.gov/products/world-ocean-database, last access: 28 November 2024) and Copernicus Marine Service (CMS, https://marine.copernicus.eu/, last access: 28 November 2024) – have incomplete metadata descriptions, and the data might also differ. Our objective was to recover the raw data together with the full metadata description and secure them for the future generation of scientists for their further use. This raised awareness contemporary to the evolution of open science and FAIR (findable, accessible, interoperable and reusable) data management principles, which motivated us to adopt the latest community standards and QC procedures and to implement an ERDDAP server as a data dissemination strategy. ERDDAP is an open-source environmental data server software developed by NOAA and used throughout the ocean observing community (Pinardi et al., 2019; Tanhua et al., 2019) which allows us to become a node of the present data digital ecosystem in line with one of the expected societal outcomes (transparent and accessible ocean) of the United Nations Decade of Ocean Science 2021–2030 (Ryabinin et al., 2019; Simoncelli et al., 2022).

The paper describes the reprocessing of temperature profiles from expendable probes deployed between 1999 and 2019 in the Ligurian and Tyrrhenian seas, with most of them from vessels operating a commercial line between the Italian ports of Genoa and Palermo within the Ships Of Opportunity Program (SOOP) of the Global Ocean Observing System (GOOS), currently identified as the MX04 line. Additional XBT data were collected through ancillary monitoring surveys with commercial and research vessels. The dataset contains some XCTD (eXpendable Conductivity–Temperature–Depth) probe profiles (less than 1 %) too. The reprocessed dataset (REP) is obtained from the original raw XBT profiles, the readable output of the data acquisition system (DAQ). A correction based on the DAQ calibration (when available) is not only applied to each temperature recorded value but also provided as separate information to allow the user to eventually subtract it. Automated QC tests, specifically tuned for western Mediterranean basins, based on the latest documented QC procedures (Cowley et al., 2021; Parks et al., 2022; Good et al., 2023; Tan et al., 2023) and best practices to assign a quality flag (QF) are applied followed by interpolation of raw profiles at each meter depth. All available information collected during data taking has been added in the metadata section according to SeaDataNet standards (https://www.seadatanet.org/Standards, last access: 28 November 2024) and IQuOD (International Quality-controlled Ocean Database, https://www.iquod.org/index.html, last access: 28 November 2024) recommendations. The uncertainty specification for both depth and temperature is also provided, being a crucial information for assimilating data in ocean reanalysis or for utilizing them in downstream applications. Cheng et al. (2014) demonstrated that XBT data are characterized by systematic bias when compared with data gathered from CTD and computed the commonly used correction scheme for both temperature and depth records, which is very important to derive integrated data products or ocean indicators from multiple data sources and instruments (Cheng et al., 2016). The REP dataset includes the Cheng et al. (2014) correction scheme applied to the calibrated profiles at original depth and then interpolated at each meter depth.

The REP data product allows the user to select between the original profiles to the validated, interpolated and corrected ones, filtering on the basis of the required quality level and selecting the associated QF. Furthermore, the dataset is accessible through the ERDDAP (Environmental Research Division's Data Access Program) data server (http://oceano.bo.ingv.it/erddap/index.html, last access: 28 November 2024) installed at the INGV (https://ror.org/029w2re51, last access: 28 November 2024), which provides a simple and consistent way to download it in several common file formats.

This study was conducted in the framework of the MACMAP (Multidisciplinary Analysis of Climate change indicators in the Mediterranean And Polar regions) project (https://progetti.ingv.it/it/progetti-dipartimentali/ambiente/macmap, last access: 28 November 2024) funded by INGV (https://ror.org/00qps9a02, last access: 28 November 2024) (2020–2024) in technical collaboration with ENEA (Italian National Agency for New Technologies, Energy and Sustainable Economic Development) and GNV (Grandi Navi Veloci) shipping company. In fact, the reprocessing of the historical XBTs was preparatory to the automatic validation, management and publication of new XBT data gathered on the MX04 line from September 2021 after a 2-year interruption in the monitoring activity.

The paper is organized as follows: Sect. 2 describes the main characteristics of an XBT system. Sect. 3 describes the original dataset and the monitoring activities that sustained it. Sect. 4 describes the methodology applied for the automatic QC and the correction derived from calibration, Sect. 5 is about the results, Sect. 6 describes the REP dataset findability and accessibility, and Sect. 7 summarizes the main results and draws conclusions.

2 The XBT system

In the early 1960s, following a request from the US Navy looking for a seawater temperature profiler for military applications, engineers from Francis Associates developed an early version of an XBT probe. The prototype was improved within Sippican Corp. (now part of Lockheed Martin Co., hereinafter Sippican) and then adopted by the US Navy (Reid, 1964; Little, 1965, 1966). Within a few years Sippican optimized the original project and marketed different XBT types with specifications suitable for various depths and ship speed. XBTs became very popular within the oceanographic community (Flierl and Robinson, 1977), allowing for the gathering of temperature (T) profiles through the use of commercial vessels (ships of opportunity) and not just research vessels.

The XBT system consists of an expendable ballistic probe falling into seawater; a device (DAQ) that records an electrical signal and converts it into usable numerical data (in combination with a computer unit) and the connection between the falling probe and the DAQ (e.g., Goni et al., 2019; Parks et al., 2022). The sensing component is an NTC (negative temperature coefficient) thermistor that changes its resistance according to the temperature of seawater flowing through the central hole of the probe nose where it is located. Its thermal time constant, τ (time needed to detect 63 % of a thermal step signal) is ∼0.11 s (Magruder, 1970, and references therein), so a time of ∼0.6 s is needed to detect a step temperature change. Technical characteristics required by Sippican for the NTC thermistor, reading circuit and resistance to the temperature conversion procedure (e.g., Sippican, 1991, and Appendix A), put some limits on the accuracy of XBT measurements.

Another essential component is the thin twin copper wire, which is part of the acquisition circuit and which is unwound by two spools simultaneously (clockwise from the ship and counterclockwise from the falling probe), a technique which decouples the XBT vertical motion from the translational motion of the ship. The albeit weak electric current that runs through the wire during acquisition transforms the wire into a large antenna sensitive to nearby electromagnetic phenomena. A non-uniform coating application and a defective winding on one of the spools cause a significant part of the faulty or prematurely terminated acquisitions.

XBT probes do not house any pressure sensors, and the depth associated with a temperature measurement is not measured directly but estimated by a fall rate equation (FRE) provided by the manufacturer with coefficients that depend on the probe type and are valid for the world ocean. The software transforms a time series of resistance values sensed by the thermistor into a series of depth and T values using first a resistance-to-temperature conversion relationship (identical for all XBT types because it is specific for the thermistor used; see Appendix A) and then calculating the corresponding depth values by applying a specific FRE for each probe type. Sippican has preset conservative values for the recording time in its acquisition software, but these values can be freely modified in order to use all the wire wound on the probe spools. The first column of Table 1 shows the nominal values and the maximum recorded depth in the same areas for each specific probe type.

Each component of an XBT system contributes to the overall uncertainty in depth and T measurements. Recently, the IQuOD group (Cowley et al., 2021) released a summary of T uncertainty specifications for different oceanographic devices determined using available knowledge (type-B uncertainty). The uncertainty estimate associated with XBT probes adopts the accuracy values provided by the manufacturer:

for depth, 4.6 m up to 230 m depth and 2 % at greater depths;
for T, within the range of 0.1–0.2 °C, with small variations depending on the manufacturer and the manufacturing date. The value associated with the XBT probes in the REP dataset is equal to 0.10 °C.

Bordone et al. (2020) compared XBT profiles from SOOP activities in the Ligurian and Tyrrhenian seas with quasi-contemporaneous (±1 d) and co-located (distance smaller than 12 km) Argo profiles. The XBT profiles used by Bordone et al. (2020) are included in the REP dataset, but they went through a different QC and interpolation procedure that could slightly modify their results. In the 0–100 m layer, the mean T difference was 0.24 °C (the median 0.09 °C), and the standard deviation (SD) was 0.67 °C. Below 100 m depth, the XBT measurements were on average 0.05 °C warmer than the corresponding Argo values (mean and median were almost coincident), and the SD was 0.10 °C. This last SD value agrees with the manufacturer specification and the T uncertainty value reported by Cowley et al. (2021), which has been assigned to the REP data. The values estimated by Bordone et al. (2020) for the surface and sub-surface layer (depth <100 m) are instead affected by both XBT (4.6 m) and Argo (2.4 dbar) depth uncertainty estimation, meaning that a small variation in depth could correspond to a large variation in temperature, especially when the seasonal thermocline develops, so the comparison with Argo values would not be significant. The specified uncertainties are independent of the systematic error or bias affecting the XBT temperature and depth measurements that have been corrected in the REP dataset applying the Cheng et al. (2014) correction scheme.

In fact, the first part of the XBT motion is critical, meaning that the T and depth values in the surface layer must be considered very carefully, especially if the launch height (which influences the entry velocity of the probe and consequently the time and depth at which it reaches the terminal velocity, i.e., the value used in the FRE) differs from 3 m above sea level, the value suggested by Sippican. Very high launch platforms make the initial depth values calculated through the FRE incorrect (Bringas and Goni, 2015, and references therein). In addition, the time constant of the thermistor (Magruder, 1970, and references therein), the thermal mass of the XBT probe (e.g., Roemmich and Cornuelle, 1987) and the storage temperature influence the reliability of the first T records. For these reasons, careful data validation in the near-surface layer and where the seasonal thermocline occurs (i.e., depths shallower than 100 m in the study region) is crucial.

The depth resolution depends on both the DAQ sampling rate and the FRE of the XBT probe. All DAQ models used in this dataset work at 10 Hz (i.e., a sample every 0.1 s, a time interval nearly coincident with the time constant of the NTC thermistor) so that the depth resolution has actual values close to 0.6 m. The T resolution is usually 0.01 °C when using the standard Sippican software, while 0.001 °C is the standard output for Devil and Quoll DAQs and some old Sippican software versions. Throughout the work, three decimal digits are always used for T values and the derived quantities (i.e., vertical gradient). The computer clock (always updated to the UTC value shortly before the start and after the end of operations) provides the time coordinate of each profile with a sensitivity of 1 s. The differences recorded with respect to the standard UTC time have always been smaller than 1 s over a 24 h time frame.

Sippican's manuals released over the years (e.g., Sippican, 1968, 1980, 1991; Lockheed Martin Sippican 2006, 2010, 2014) and reports (e.g., Sy, 1991; Cook and Sy, 2001; Sy and Wright, 2001; Parks et al., 2022) describe the best practices for XBT use well. The checking of the XBT system with a tester before and after data collection as well as the complete description of the system characteristics in the metadata is highly recommended for an optimal use of XBT measurements. When strip chart recorders were used, a preliminary and accurate calibration of the acquisition unit with a tester was mandatory (e.g., Sippican, 1968, 1980; Plessey Company Limited, 1975). With the advent of digital systems, this procedure was also recommended (Bailey et al., 1994). Only since July 2010 has the tester check been introduced in the monitoring activity along the MX04 line and a few other subsets of profiles contained in the REP dataset. Reseghetti et al. (2018) found a reduction in the (XBT-CTD) temperature difference after introducing a correction based on the tester check. This was also confirmed by the comparison between XBT and Argo profiles described in Bordone et al. (2020). Based on these findings, a specific correction has been developed, and it represents a key component of the information never used in previous data versions and unlocked in the REP dataset (Sect. 4.3).

The first XCTD models were developed by Sippican (Sippican Ocean Systems Inc., 1983) in the 1980s and were analog. They were completely replaced in the last years of the last century by digital versions produced by the Japanese company TSK (Tsurumi-Seiki Co.). XCTD-1 probes present some differences compared to XBTs in terms of resolution and accuracy, and a completely different recording circuitry. The manufacturer (the Japanese company TSK) claims an accuracy of 0.02 °C on T (a factor of 5 better than XBTs) and a resolution of 0.01 °C, while the depth accuracy is the same as for XBT probes. These accuracy values can be considered type-B uncertainties, as in Cowley et al. (2021), and they are included in the REP dataset metadata. The sampling frequency is 25 Hz (i.e., a reading of the thermistor resistance value every 0.04 s), with a falling speed which is just over half that the XBT probes (see Table 1), and the depth resolution for the model XCTD-1 is about 0.14 m.

3 The dataset

In total, 3782 temperature profiles, collected from September 1999 to September 2019 in operations managed by ENEA (S. Teresa Marine Environment Research Centre; STE hereafter), mainly through the use of commercial ships, are included in the REP dataset. They come from XBT probes, with a few dozen XCTDs. Figure 1 shows the XBT profiles' temporal and spatial distribution, highlighting their sparseness, mainly influenced by the irregular monitoring activity and data concentration along the MX04 Genoa–Palermo line. The vertical data distribution (Fig. 1c) is also non-homogeneous due to the local bathymetry, the use of different probe types and the ship speed.

https://essd.copernicus.org/articles/16/5531/2024/essd-16-5531-2024-f01

Figure 1(a) Temporal distribution of the REP (reprocessed) XBT profiles, (b) geographical location, and (c) vertical distribution in layers of 50 m in depth.

Table 1 shows some of the characteristics of the expendable probes used in this dataset, the FRE coefficients applied to calculate the depth and the mass of the various components of each probe type (ZAMAK – zinc, aluminum, magnesium, and copper – for the nose, plastic for the body, and spool and copper wire, considering the total quantity that can unwind from the on-board spool), which allows for the evaluation of the overall quantity of material abandoned at sea caused by the REP dataset. We have no information regarding the components of the XCTD-1 probes, but their nose is made of plastic material. Sippican is the manufacturer of all the XBT probes used, while the XCTD-1 probes are manufactured by TSK – Tsurumi-Seiki Co. – and marketed in Italy by Sippican.

The profiles were gathered during the following monitoring activities:

SOOP monitoring on the Genoa–Palermo MX04 line, which provides the greatest contribution in terms of both campaigns (1999–2000, 2004–2006, and 2010–2019) and quantity of profiles;
SOOP monitoring in collaboration with CSIRO (Commonwealth Scientific and Industrial Research Organization), from 2007 to 2011;
Sporadic additional SOOP monitoring by ENEA-STE in the Mediterranean (2012–2014);
An agreement between ENEA and IIM (Italian Hydrographic Institute of the Navy) (2006–2019);
An operational collaboration between ENEA-STE and National Research Council of Italy – Institute of Marine Sciences (CNR-ISMAR, Lerici) (2000–2017).

The main characteristics of the vessels and the instrumentation used for data collection are summarized in Table B1 in Appendix B.

Table 1Characteristics of the different probes used: nominal depth suggested (and guaranteed) by Sippican and experienced maximum depth in the Mediterranean; maximum ship speed suggested by Sippican for an optimal drop; coefficients of the fall rate equation, $D (t) = A t - B t^{2}$ , used for depth calculation (provided by the manufacturer or by the integrated global ocean station system, IGOSS; Hanawa et al., 1995); per probe amount of ZAMAK, copper and plastic; and the number of probes included in the dataset for each probe type.

NA: not available

Download Print Version | Download XLSX

The first SOOP in the Mediterranean Sea (September 1999–December 2000) started in the framework of the European Mediterranean Forecasting System Pilot Project (MFSPP; Pinardi et al., 2003; Manzella et at., 2003; Pinardi and Coppini, 2010) under INGV coordination to support the development of operational oceanography forecasting activities through the NRT provision of ocean observations. XBT profiles were collected along transects crossing the Mediterranean Sea designed to monitor the variability of the main circulation features. The raw profiles were subsampled on board by Argos software (15 inflection points) and quickly inserted into the Global Telecommunication System (GTS) while the full resolution profiles were sent to the ENEA-STE assembly center for QC, interpolation and NRT provision to the forecasting center (e.g., Fusco et al., 2003; Manzella et at., 2003; Zodiatis et al., 2005; Millot and Taupier-Letage, 2005a, b). The MX04 line is the only SOOP line still active in the Mediterranean Sea on a seasonal basis, thanks to the MACMAP project and the collaboration with GNV, whose ships connect (just under 20 h sailing at about 22 knots) Genoa (44.40° N, 8.91° E) to Palermo (38.13° N, 13.36° E) daily.

Starting from September 1999, 20 campaigns were carried out in collaboration between CNR-ISMAR and ENEA-STE with initially monthly monitoring frequency, then every 15 d (December 1999–May 2000), and again monthly frequency until December 2000. T4 probes (with some T6 probes) were launched at fixed intervals of time (every 30 min), corresponding to a sampling distance of about 11 nm. A Sippican MK12 card inserted into the motherboard of a desktop running Windows 98 Second Edition and with the software set to stop acquisition at 460 m depth was used. All the campaigns were carried out using the MV Excelsior; its route was always the same and almost coincident with track 44 of the altimetric satellites (Vignudelli et al., 2003).

After a hiatus of more than 3 years and a campaign in May 2004 to check slightly different operational procedures, monitoring along the MX04 line resumed on a monthly basis from September 2004 to December 2005 (no cruises in July and August 2005), with two additional cruises in May and October 2006 for a total of 17 campaigns within the European Union MFS Toward Environmental Predictions project (MFS-TEP, Manzella et al., 2007; Pinardi and Coppini, 2010). The ships (always GNV vessels) followed a route with marginal differences compared to the previous one due to the introduction of nature conservation limitations in the Tuscan Archipelago. In November 2004 and February and December 2005, the route was significantly different due to bad weather and sea conditions. The campaigns were planned to travel as close as possible to the passage date of the Jason-1 altimetric satellite along track 44, and for this reason, some were carried out on the route traveled in the opposite direction independently of weather and sea conditions. T4 and DB XBT probes were usually deployed (with a few XCTD-1 and some T6), and the sampling distance was variable, from 8 to 12 nm. After a few months, the DAQ (a Sippican MK21 ISA), despite excellent operating conditions and good ground connection, began to record profiles with rapid oscillations (amplitude of ≃0.05 °C) not attributable to the known water mass characteristics (not shown). Only at the end of the MFS-TEP data taking did careful laboratory checks identify a pair of capacitors on the ISA board as responsible for this malfunction. Unlike MFS-PP, the acquisition software was set to use all the wire available on the probe spool (i.e., 600 m for T4 and 1000 m for DB probes).

Monitoring on the MX04 line resumed in July 2010, managed directly by ENEA-STE, and until January 2013 was widely variable in terms of both frequency and sampling distance (due to the uncertainty in the supply of XBT probes). A regular sampling scheme was then adopted with a launch every 10^′ of latitude (corresponding to 11–12 nm depending on the ship's course), excluding the archipelago of Toscana, with five to six annual repetitions, following the same route as in 2004–2006 (excluding February 2013 and April 2014 because of very bad weather and sea conditions). It was also decided to carry out monitoring campaigns only with good weather and sea conditions. From June 2015, the ships moved to a more westerly route in the northern part of the transect crossing the Corsica Channel (this allows for the monitoring of the water exchange between the Tyrrhenian Sea and the Ligurian Sea) to rejoin the previous one at around 39° N. The number of drops at fixed positions increased to 37 of mainly DB probes, while other XBT types were used in particular areas due to the reduced bathymetry (T10) or with interesting deep thermal structures (T5/20). Based on the experience from comparison tests between XBT and CTD since March 2011, the XBT probes were placed in the open air (but always in the shade) for at least half an hour before the deployment to allow them to thermalize with the atmosphere and reduce the temperature difference with the sea surface layer as much as possible.

A short SOOP activity in collaboration with CSIRO was completed between December 2007 and March 2011 (19 campaigns) using containerships from Hapag Lloyd (namely Canberra Express, Stadt Weimar and Wellington Express) and CMA CGM (CMA CGM Charcot) shipping companies, operating between northern European ports and Australia. These campaigns were characterized by irregular frequency throughout the year, a very high launching platform (25 m over the sea level or more), and a sampling distance between 20 and 35 nm. XBT launches began near the Aegadian Islands (west of Sicily) and terminated in the Corsica Channel following a path halfway between the MX04 transect and the island of Sardinia. CSIRO installed a Turo Devil DAQ on each vessel, while ENEA-STE provided the DB probes.

Some additional XBT profiles (of mainly the DB type) were gathered in the Ligurian Sea between May 2012 and March 2014 on board the GNV ship Excellent (in five campaigns), and in 2014, so were two different cruises using a Sippican MK21 USB on board the containership Daniel A from the Turkish shipping company Arkas.

From 2006 to 2019, 10 campaigns were carried out in collaboration between ENEA and IIM using the ships Ammiraglio Magnaghi, Aretusa and Galatea, collecting a total of about 200 profiles using different XBT types deployed from different heights and using different DAQs.

Finally, an operational collaboration between ENEA-STE and CNR-ISMAR allowed them to carry out 29 campaigns between 2000 and 2017 using vessels managed by the CNR (mainly RV Urania, but also RV Minerva Uno and Ibis), gathering several hundred profiles with different XBT probe types deployed from different heights and recorded using four different Sippican DAQ units.

The total amount of material abandoned at sea, due to the launch of the XBT and XCTD probes which constitute the REP dataset, is provided using the per-probe values reported in Table 1: over 2300 kg of ZAMAK, 220 kg of plastic material and 1060 kg of copper wire. Furthermore, there was no additional contribution to greenhouse gas emission since mainly commercial vessels were used and, in the case of research vessels, the launch of XBT probes was ancillary to the main activities of the cruise.

4 Methodology

Specific QC procedures for XBT profiles in the Mediterranean Sea were first developed by Manzella et al. (2003) within the MFS-PP project and later improved in Manzella et al. (2007). Temperature observations in the Mediterranean Sea, due to its thermohaline circulation, water mass characteristics and large temperature variability, might present peculiar features like thermal inversions or zero thermal gradient in areas of deep water formation, thus necessitating regional tuning of QC tests. The prior QC procedures included the detection of profile's end, gross range check, position control, elimination of spikes, interpolation at 1 m intervals, Gaussian smoothing, general malfunctioning control, comparison with climatology and final visual check by operator. Some additional constraints were applied: elimination of the initial part of each profile (the first acceptable value is at 4 m depth following the standard international procedure), allowed temperature values within the 10–30 °C interval, maximum temperature inversion of 4.5 °C in the 0–200 m layer, 1.5 °C below 200 m and 3 °C m⁻¹ as the maximum thermal gradient. This QC has not been applied to the data released in NRT through the GTS (Global Telecommunication System, https://community.wmo.int/en/activity-areas/global-telecommunication-system-gts, last access: 28 November 2024) but only to the data made available in DM through the SDN infrastructure (accessible through the relative saved query from the SDN Common Data Index portal at https://cdi.seadatanet.org/search/welcome.php?query=1866 &query_code={4E510DE6-CB22-47D5-B221-7275100CAB7F}, last access: 28 November 2024). The raw data for the GTS dissemination were provided to NOAA, and in the early 2000s, the profiles were also heavily sub-sampled due to the low-bit-rate satellite system provided by Argos, the basic GTS data transmission system (Manzella et al., 2003). These different dissemination channels contributed to the existence of several versions of the same profile in different blue data infrastructures (i.e., WOD and SDN).

A new automated QC procedure, written in Python and structured as a package, has been implemented in the framework of the MACMAP project starting from the original raw XBT profiles and considering the scientific progress made in the field in the last 2 decades and the full metadata information available. The aim was twofold: first, to secure the best version and most complete dataset for further use to the scientific community, and secondly, to implement an automated QC workflow for the seasonal XBT campaigns started in September 2021 thanks to the MACMAP project. This also allowed for the refinement and standardization of the quality assurance procedures on board the vessels to record all ancillary information in a pre-defined format and minimize the impact of different operators on the data quality. The calibration correction, detailed in Sect. 4.3, has been added, when available, to the raw data before the QC analysis. However, it is provided as a separate variable associated with each XBT profile, and the user can remove it, if required. None of the original data have been deleted but integrated with quality indexes, with the exception of those repeated during data taking. These replicates have been decided by the operator during the sampling activity when the observed profile was affected by serious acquisition problems, both external (i.e., electrical discharge) and probe-specific (wire break or anomalous stretching, insulation penetration, leakage and so on).

A final visual check has also been performed using the ODV software (Schlitzer, 2023), which highlighted the presence of anomalous behavior in some T profiles that the automatic QC tests could not detect. Some examples are discussed in Sect. 5 (Fig. 10). This visual check suggested assigning a general QF to each profile, choosing between these two options: (1) excellent, indicating all QC done, and (2) mixed, indicating some problems, with comments to warn the user about the anomalous features.

4.1 Automatic quality control procedure

The XBT raw profiles have been QCed using a sequence of independent tests, checking for invalid information on geographic characteristics and for known signatures of spurious measurements. The results of each test are recorded by inserting the relative exit values to the corresponding measurement in ancillary variables (POSITION_SEADATANET_QC, DEPTH_TEST_QC, and TEMPET01_TEST_QC) according to the scheme shown in Table 2, while Fig. 2 provides an example of the QC tests applied to a profile.

The independent QC tests are described hereafter.

4.1.1 Position on land check

The profile position should be located at sea; thus, the latitude and longitude of each profile is checked against gridded General Bathymetric Chart of the Oceans (GEBCO) bathymetry (GEBCO Compilation Group, 2021) on a 15 arcsec interval grid to determine if it is located on land or not (test 1): if the height is negative, it is lower than sea level, and it is flagged as GOOD (profile is at sea), otherwise it is flagged as BAD (profile is on land). The ancillary variable, POSITION_SEADATANET_QC, contains the exit value of the position check. However, there are no data flagged as BAD due to the position on land in the REP dataset since the operators checked both the position and the launch time before the data transmission to the data assembly center (ENEA-STE). Since we did not encounter specific issues with date and time, we did not implement additional checks.

4.1.2 Depth check

The depth values of each XBT profile are compared to the local bottom depth extracted from GEBCO (test 2) and the last good depth (test 3) value provided by the operator. Depth values are flagged as GOOD (depth is below reference depth value) if they are shallower than the reference depth value; otherwise they are flagged as BAD (depth is above reference depth values). The corresponding local bottom depth extracted from GEBCO (BATHYMETRIC_INFORMATION) and the last good depth value provided by the operator (LAST_GOOD_DEPTH_ACCORDING_TO_OPERATOR) are annotated in the metadata as global attributes associated with each profile to facilitate further analysis by expert users.

Table 2Summary of the automated QC tests, the exit values assigned to each measurement and the ancillary variables containing them.

Download Print Version | Download XLSX

4.1.3 Gross range check

The gross range check applies a gross filter on observed temperature considering T thresholds that vary on five vertical layers, as reported in Table 3. T thresholds have been defined analyzing the seasonal T distribution in four sub-regions displayed in Fig. 3: (1) the Ligurian Sea; (2) the northern Tyrrhenian Sea; (3) the southwest Tyrrhenian Sea; (4) the southeast Tyrrhenian Sea. The domain subdivision is based on the mean circulation features at 15 and 350 m depth, computed using the Mediterranean Sea reanalysis (Simoncelli et al., 2014) data over the time period 1999–2018 (Fig. 3). A detailed description of the circulation is out of scope here, but its main features are detailed in Pinardi et al. (2015) and von Schuckmann et al. (2016, Sect. 3.1).

4.1.4 Surface check

In general, a probe needs a couple of seconds from the impact with the sea surface to stabilize its motion and reach the terminal velocity (Bringas and Goni, 2015, and references therein). Different approaches have been followed over the years on how to handle the near-surface values. In the late 1970s, the Intergovernmental Oceanographic Commission of UNESCO (IOC) proposed to isothermally extrapolate the values from 3 to 5 m upward to obtain the surface temperature for encoding (Intergovernmental Oceanographic Commission, 1975), while the FNWC (US Navy Fleet Numerical Weather Central) procedure was to extrapolate from 8 ft (2.4 m) to the surface using the slope at that depth. Wannamaker (1980) suggested reaching the surface starting from 4 m using the slope between 4 and 6 m depth. Afterwards, other authors decided to discard the initial measurements, considering only the values starting from a certain depth to be valid, also depending on the used DAQ (e.g., Bailey et al., 1994; Intergovernmental Oceanographic Commission, 1997; Kizu and Hanawa, 2002; Gronell and Wijffels, 2008; Cowley and Krummel, 2022, and reference therein). For example, Manzella et al. (2003) selected the value at 5 m depth as the first acceptable value during the MFS-PP project and then changed it to 4 m during MFS-TEP.

It is preferred that the user is provided all the original measurements by adding a test that analyzes the measurements in the surface layer and annotating the resulting exit value in the ancillary variable. The proposed test chooses as reference the value recorded at time t=0.6 s (the first value currently considered acceptable), calculates the differences between this value and shallower measurements, and classifies them using the T standard uncertainty (SD) associated with an XBT probe (0.10 °C) as a metric. In detail, the temperature difference T(t_0.6)−T(t_i), with $0.0 \leq t_{i} \leq 0.5$ s, is calculated, and the QF is assigned as follows:

GOOD if $| T (t_{0.6}) - T (t_{i}) | \leq 1 \cdot SD$ ,
PROBABLY GOOD if $1 \cdot SD < | T (t_{0.6}) - T (t_{i}) | \leq 2 \cdot SD$ ,
PROBABLY BAD if $2 \cdot SD < | T (t_{0.6}) - T (t_{i}) | \leq 3 \cdot SD$ ,
BAD if $| T (t_{0.6}) - T (t_{i}) | > 3 \cdot SD$ .

The flag GOOD means a value indistinguishable from the record at t=0.6 s, while PROBABLY GOOD defines an excellent compatibility. The PROBABLY BAD and BAD flags simply indicate a difference greater than the established threshold with respect to the reference value at t=0.6 s.

4.1.5 Inversion and gradient checks

This test is performed to detect unrealistic T oscillations with abrupt T reversals or unusually large T gradients. The vertical gradient is defined as the difference between vertically adjacent measurements, $T z = (T_{2} - T_{1}) / (Z_{2} - Z_{1})$ , where T₂ and T₁ are temperatures at depths Z₂ and Z₁, with level 2 being deeper than level 1. This test is applied three times iteratively, discarding values that failed the test in the following iteration. The acceptable T gradient ranges (Table 3) have been defined through a statistical analysis in five vertical layers and four sub-regions (Fig. 3) through an approach that blends expert decisions with statistical support. Due to the spatial (horizontal and vertical) and temporal sparseness of the data, the 0.0001th and 0.9999th quantile have been computed in the five layers while considering (1) the whole dataset, (2) the four sub-regions, and (3) the entire domain but for four seasons. The thresholds are the absolute minimum 0.0001th quantile and maximum 0.9999th quantile derived from the three cases. The thresholds of the two deepest levels are from case 1, the upper layer uses values from case 2, and the second and third layers use the results of case 3.

Table 3Temperature and thermal gradient thresholds defined in five layers.

Download Print Version | Download XLSX

4.1.6 Wire break/stretch

Results of inversion and gradient checks are used to identify sharp variations toward negative values, indicating that the copper wire breaks on the shipside, or toward high values (close to 35 °C or more), when the wire breaks on the probe side, where there is often a progressive increase in temperature values rather than a step transition to full scale.

4.1.7 Spike detection

This test looks for single value spikes, and it checks T measurements for large differences between adjacent values. A spike is detected by computing the median value (Med_k) in a five-point interval (3 m approximately), with the profile value at the central point of the interval (T_k). The spike is detected, and the consequent flag is applied if T_k is not equal to Med_k and the difference (s_k) between T_k and the mean (Ave_k) in the chosen interval is greater than a threshold value.

\begin{matrix} (1) & {Med}_{k} = median (T_{k - 2} : T_{k + 2}) \\ (2) & {Ave}_{k} = mean (T_{k - 2} : T_{k + 2}) \\ (3) & s_{k} = T_{k} - {Ave}_{k}, c_{k} = T_{k} - {Med}_{k} \neq 0 \end{matrix}

The spike threshold values have been defined for the entire region in five vertical layers as the 0.999th quantile of the s_k distribution and they are reported in Table 4. Figure 4a shows the probability distribution of s_k values, with c_k not equal to zero in five layers. s_k distribution is characterized by large values above 80 m that diminish with depth as the temperature variability does. The s_k scatterplot (Fig. 4b) shows its values along the water column, with the red dots highlighting the values over the selected thresholds.

Table 4Spike detection threshold defined in five vertical layers.

Download Print Version | Download XLSX

4.1.8 High-frequency noise

It helps to identify critical T drops in the profile (such as large T differences over a large depth) by checking continual spiking over a wide range of depths (Cowley and Krummel, 2022). In the case of continual spikes, values before and after a chosen interval (4 m approximately, i.e., seven points) are tested considering the same acceptable range of T inversion and gradient as in the inversion and gradient checks and flagged as bad if they are out of the ranges.

https://essd.copernicus.org/articles/16/5531/2024/essd-16-5531-2024-f02

Figure 2Example of the QFs generated by the automatic QC tests (Table 2) applied to a temperature profile. The raw profile is on the top left, and the final interpolated profile is on the bottom right.

Download

https://essd.copernicus.org/articles/16/5531/2024/essd-16-5531-2024-f03

Figure 3Maps of the mean circulation computed from the Mediterranean Sea reanalysis dataset (Simoncelli et al., 2014) at (a) 15 m and (b) 350 m depth.

https://essd.copernicus.org/articles/16/5531/2024/essd-16-5531-2024-f04

Figure 4(a) Distribution in terms of probability of the spike threshold (s_k) in five layers, with a zoom probability of below 0.1 %. (b) Vertical distribution of the spike threshold, with of the values above the 0.999th quantile indicated in red.

Download

4.2 Mapping QC test exit values to standard quality flags

Each basic QC test assigns a corresponding exit value to each original depth and T record (Table 2) within the vertical profile in the DEPTH_TEST_QC and TEMPET01_TEST_QC ancillary variables, respectively. The mapping of these ancillary variables to QFs is necessary to allow the user to filter the original data according to the quality requirements for the intended use.

The adopted QFs, whose labels and corresponding definition are reported in Table 5, have been selected from the SDN common vocabulary (Intergovernmental Oceanographic Commission, 2013, 2019; https://www.seadatanet.org/Standards/Common-Vocabularies, last access: 28 November 2024). The QF (Table 5) associated with each original T measurement or depth value summarizes the results of the performed automatic tests, and it is stored in the dedicated ancillary variable (TEMPET01_FLAGS_QC or DEPTH_FLAGS_QC).

Table 5The quality flags (QFs) selected from the SeaDataNet common vocabulary (Intergovernmental Oceanographic Commission, 2013, 2019) assigned to the reprocessed XBT data.

Download Print Version | Download XLSX

The DEPTH_TEST_QC variable contains the outcome of two tests, one based on GEBCO local bathymetry (test 2 in Table 2) and one based on the last good depth recorded by the operator (test 3 in Table 2). Since the GEBCO local bathymetry was often in disagreement with the operator information, we decided to keep the output of test 3 in DEPTH_FLAGS_QC, considering the operator's annotation more reliable.

The general rule adopted for mapping the QC test exit values to T QFs is the following:

GOOD (QF = 1) where all the tests are passed,
BAD (QF = 4) where at least one of the checks fails.

We decided to use a higher level of detail, introducing probably good (QF = 2) and probably bad (QF = 3) flags when needed since surface (test 5 in Table 2) and inversion and gradient tests (test 6 in Table 2) can provide more information on profile behavior. After applying general rule for GOOD and BAD flags, we consider the flags coming from the two mentioned tests, and we update the flags as follows:

PROBABLY GOOD (QF = 2) if the surface test returns a probably good flag,
PROBABLY BAD (QF = 3) if the surface and/or the inversion test returns a probably bad flag.

Only measurements that have associated T and depth QFs equal to 1 or 2 have been used for the interpolation at each meter depth. A relative QF associated with the interpolated profile (interpolated value, QF = 8) has also been generated in order to have a label for when there is a gap of more than five consecutive points in the original profile, which coincides with the number of points used to detect spikes (∼3 m).

4.3 Calibration of the XBT system and correction

As previously highlighted, checking with a tester provides an assessment of the efficiency of an XBT system. Once a tester is connected to an XBT system in a simulated drop, the tester's measurement indicates how the XBT system's reading differs from nominal values at some reference temperatures. These differences, which can be constant or variable over the time interval of data acquisition, can then be used to correct the values of the XBT profiles. Each tester used during the campaigns on the MX04 line after July 2010 has two reference temperatures (see Appendix A for details).

Checks, immediately before the first drop and after the last drop, were routinely performed. Further checks were carried out whenever the computer or DAQ had failures. The differences measured at the reference temperatures at the start and end of each MX04 cruise are shown in Fig. 5a, while their drift during a cruise is shown in Fig. 5b. The values vary marginally and slightly over the time, but large anomalies occurred in September 2013 (cruise 14) and June 2014 (cruise 18) for unknown reasons. The DAQ used in those campaigns showed an initial offset followed by a random and oscillating variability throughout the day: for example, the recorded values during the checks in June 2014 were 26.678 °C (start), 26.649, 26.668 and 26.666 °C (end) instead of 26.758 °C. This type of anomaly was also found from Reseghetti et al. (2018) during comparison tests between XBT and CTD, where it was pointed out that the T differences between the XBT and CTD profiles were heavily affected by the DAQ functioning.

https://essd.copernicus.org/articles/16/5531/2024/essd-16-5531-2024-f05

Figure 5(a) Temperature difference (XBT system tester) obtained from the checks at the reference temperatures before starting and at the end of each MX04 cruise. (b) Difference between initial and final measurements, with the tester during the same cruise at the reference temperatures.

Download

4.3.1 Correction algorithm

The measurements with a tester are used to correct the T values of each XBT profile of a campaign under the assumption that the difference between the initial and final tester readings at reference temperatures varies linearly over time from the beginning to the end of the campaign. The reference values are obtained by calculating the average resistance value over the last 30 consecutive recorded values at each temperature in the simulated drop (i.e., 3 s of acquisition, with a sampling frequency of 10 Hz) and then converted into T values (for details, see Appendix A). The differences between the nominal temperatures and the read values are linearly interpolated as a function of the time elapsed since the first launch to calculate their hypothetical value in correspondence with each XBT probe during the campaign. In the case of a single-point tester, a constant correction is added to each value of the XBT profile. In the case of two-point tester, the correction is obtained by a further linear interpolation based on the differences at the upper and lower temperatures of this tester.

The notation is as follows:

N is the number of XBT probes deployed during the campaign;
T₊ and T₋ are the nominal upper and lower temperature on the tester,
$Δ T_{+, i}$ and $Δ T_{+, f}$ are the initial and final temperature difference at the value T₊,
$Δ T_{-, i}$ and $Δ T_{-, f}$ are the initial and final temperature difference at the value T₋,
t_i and t_f are the initial and final time of the XBT drops (usually, t_i is set to 0),
t_k is the time elapsed from the initial check with the tester, which is assumed to be coincident with the first XBT drop ( $1 \leq k \leq N$ ),
$T_{+, k}$ and $T_{-, k}$ are the theoretical upper and lower temperature that the tester should read at the kth drop.

These last values can be calculated as

\begin{matrix} (4) & T_{+, k} = T_{+, i} + Δ T_{+, k} \end{matrix}

and

\begin{matrix} (5) & T_{-, k} = T_{-, i} + Δ T_{-, k}, \end{matrix}

where the estimated differences at upper and lower reference T corresponding to the kth drop are

\begin{matrix} (6) & Δ T_{+, k} = - [Δ T_{+, i} + (\frac{Δ T_{+, f} - Δ T_{+, i}}{t_{f} - t_{i}}) (t_{k} - t_{i})] \end{matrix}

and

\begin{matrix} (7) & Δ T_{-, k} = - [Δ T_{-, i} + (\frac{Δ T_{-, f} - Δ T_{-, i}}{t_{f} - t_{i}}) (t_{k} - t_{i})] . \end{matrix}

The so calculated contributions are combined in the correction term for the specific kth XBT:

\begin{matrix} (8) & Δ T_{corr, k} = (\frac{Δ T_{+, k} - Δ T_{-, k}}{T_{+} - T_{-}}) (T_{read, k} - T_{-}) + Δ T_{-, k} . \end{matrix}

Then, the original value T_read,k recorded by the DAQ is added:

\begin{matrix} (9) & T_{corr, k} = T_{read, k} + Δ T_{corr, k} . \end{matrix}

T_corr,k is thus the value that best represents the actual seawater temperature measured by the kth XBT probe, assuming that the calculated correction (based on the initial and final measurements provided by the tester) is the best way to describe how the XBT system operates when the probe is deployed. Obviously, ΔT_corr,k is not related to the measurement quality due to the probe characteristics or to possible issues during data acquisition.

When the calibration is available, the correction calculated in this way is applied to the raw data prior to the QC analysis, but it is also provided as a separate variable (CALIB) so that the user might decide to remove it. This correction must absolutely not be applied to the profiles from XCTD-1 probes because their acquisition circuit works in a completely different way, and the shipboard DAQ simply acts as a data receiver and does not play an active role in the measurement.

4.4 Vertical interpolation

Three interpolation methods were tested: linear (LI), RR (Reiniger and Ross, 1968) and MR-PCHIP (Barker and McDougall, 2020). The goal is to select the most conservative method, i.e., the one that provides the closest interpolated T values to the original reading. The original measurements of each XBT profile were subsampled, discarding half of the measurements and then used as control values against the newly interpolated ones to calculate differences and root mean square differences (RMSDs) and therefore evaluate the best interpolation method for our dataset.

Original values have been interpolated with the three methods on the control depth levels, and the resulting T estimates have been compared with the measured ones. Figure 6 shows an example of an observed profile with highlighted control levels (magenta), the interpolated profile with the three considered methods and the relative differences (interpolated-original). Figure 6a presents an example of the large T differences that occur between interpolated and measured values (0.4 or −0.2 °C) along the thermocline at about 35 m. Figure 6b shows a step-like profile below 600 m depth where the differences are very small (less than 0.02 °C), but they can slightly increase and differ among the three methods where T vertical gradients occur.

Mean bias and RMSD have been computed in vertical bins (766) of 3 m thickness, and the obtained metric profiles are displayed in Fig. 7, associated with their relative vertical data distributions. These metrics have been computed for the whole dataset and for two separate time periods: from June to November (when the thermocline is well developed) and from December to May (when the water column is more homogeneous). The mean bias in Fig. 7 presents values in the range of (−0.001, +0.001) °C, and the interval halves from December to May, whereas it practically doubles and is in the range of (−0.002, +0.001) °C from June to November. The maximum RMSD when considering all profiles is about 0.04 °C, and it halves from December to May, while it is close to 0.06 °C from June to November. Except for the December–May plot, the maximum RMSD values are associated with LI and RR methods, but we note that RMSD <0.01 °C for the three methods below 100 m depth.

The total RMSD on the entire water column has been summarized in Table 6 for the three time periods and the surface layer above 100 m. In fact, the total bias estimated is zero for the three methods and the three time periods, and the total RMSD is 0.011 °C for LI, 0.011 °C for RR and 0.010 °C for MR-PCHIP, while in the surface layer, the values are 0.023, 0.021 and 0.019 °C, respectively. The maximum RMSD values usually occur during the stratified period (June–November) with values equal to 0.013 °C for LI, 0.012 °C for RR and 0.011 °C for MR-PCHIP that, in the surface layer, become 0.030, 0.027 and 0.023 °C, respectively.

The computed metrics in vertical bins present very small values which are much lower than and the specified T uncertainty (0.10 °C). However, the absolute differences in the surface layer when the thermocline settles can be larger than 0.2 °C, as in Fig. 6. The MR-PCHIP interpolation always presents the smallest error for the analyzed dataset (Table 6) with respect to the reference values; thus, it has been applied here.

https://essd.copernicus.org/articles/16/5531/2024/essd-16-5531-2024-f06

Figure 6Temperature profiles in the surface layer, 1–100 m (a), and in the deep layer, 600–1800 m (b). In the leftmost column, magenta dots represent the control records; in the middle column are the interpolated temperature values with linear LI (linear), RR (Reiniger and Ross, 1968) and MR-PCHIP (Barker and McDougall, 2020); and in the rightmost column are the differences between the interpolated and measured T values.

Download

Table 6Summary of the computed metrics from the three interpolation methods: linear (LI), RR and MR-PCHIP temperature RMSD [°C], computed in the entire water column and in the surface layer (0–100 m) from the whole dataset (All) and in two time periods, December–May (mixed) and June–November (stratified).

Download Print Version | Download XLSX

https://essd.copernicus.org/articles/16/5531/2024/essd-16-5531-2024-f07

Figure 7Profile of mean bias (a, d, g) and RMSD (b, e, h) computed from profiles interpolated on selected depths and compared to the corresponding measured values considering the three methods: linear (LI), MR-PCHIP (MR), and Reiniger and Ross (RR). Three different time spans are shown: (a–c) the whole dataset, (d–f) from December to May, and (g–i) from June to November. (c, f, i) Vertical data distribution in 3 m bins.

Download

5 Results

The QC algorithms applied to the dataset are not capable of catching all erroneous values. According to Good et al. (2023) all automatic QC tests produce a percentage of true positives (TPs; correctly detected erroneous data) and false positives (FPs; incorrectly detected erroneous data), and the general aim would be to maximize the TP (correct flagging) rate and minimize the FP (incorrect flagging) rate.

The new automatic QC procedure has been tuned using visual checks to reach an optimal TP $/$ FP rate. Specifically, efforts have been made to tune the vertical gradient and spike thresholds using quantile analysis to maximize the detection of erroneous data (TP) and minimize flagging of GOOD data as BAD (FP). This was particularly tricky for the vertical gradient test, which detected 121 profiles with out-of-bounds values, but 28 of them appeared to be FPs (FP / TP rate of 23 %) in the visual check. In fact, the strong seasonal stratification of the Mediterranean Sea and the presence of several water masses in different water layers might cause the incorrect flagging of GOOD data as BAD (FP), as shown in Fig. 8b, d. This makes the vertical gradient test non-optimal for the Mediterranean Basin with a high FP rate; thus, a very small percentage associated with the quantiles has been selected to minimize this.

The spike test is much more effective (331 profiles with detected spikes, of which 11 are FPs), providing a low FP $/$ TP rate (3.3 %). Figure 9 shows example profiles with TP spikes (panel a) and FP spikes (panel b), mainly marked at the start of the thermocline.

However, some profiles present anomalous features that an automatic QC procedure could not detect. The decision was to add a flag associated with the whole profile indicating the depth range where unrecoverable problems began. The decision is based on the knowledge of the main physical characteristics of the water masses present in the analyzed region. In fact, the very small Rossby radius (∼11 km on average) and the occurrence of repeated and well-documented thermal inversions must always be considered when the quality of the T profiles is analyzed. Step-like structures (“staircases”) are also typical of the southern Tyrrhenian Sea and are usually explained in terms of the double-diffusion process (Meccia et al., 2016; Durante et al., 2021).

Sometimes, the meteorological conditions and a non-accurate knowledge of the bathymetry can make the expert validation of XBT profiles difficult, but their extreme variability can also be ascribed to multiple instrumental and operational factors. In every XBT drop, the correct unwinding of the wire from both spools and adequate and complete protection of the insulating substance along its entire length are essential to guarantee good quality of the recorded data. For example, most profiles from XBTs launched from ships traveling at low speed (i.e., v<15 knots, less than 10 % of the dataset) are generally less affected by significant electrical disturbances even in the presence of wind. Unfortunately, the ships used on the MX04 line (from which most of the REP profiles belong) have a standard speed close to 22 knots, and this makes the acquisition conditions vulnerable. The XBT profiles from containerships also have a lower quality due to the usually very high launch position (h>25 m), which makes the probe depth in the initial measurements provided by software questionable (Bringas and Goni, 2015). As mentioned in Sect. 2, the electric current that circulates in the unwinding copper wire transforms it into an antenna sensitive to all electromagnetic phenomena occurring nearby. The occurrence of atmospheric events (thunderstorms with lightning) can have a non-negligible impact on the recorded signal, which is the same as the proximity to on-board instrumentation producing significant electromagnetic fields and whose operation is random. The physical parameter measured by the XBT system is the electrical resistance, which has two components: one is from the copper wire and the other from the NTC thermistor which falls through the water column. Gusts of wind combined with turbulence produced by the ship hull can produce “whiplash” on the copper wire and badly influence the shape of the profiles collected with particularly unfavorable wind conditions.

https://essd.copernicus.org/articles/16/5531/2024/essd-16-5531-2024-f08

Figure 8Examples of temperature gradient flags applied to different XBT profiles: (a) true positive vertical gradient anomaly in the surface layer, (b) false positive vertical gradient anomaly in the surface layer, (c) true positive vertical gradient anomaly in the bottom layer, and (d) false positive vertical gradient anomaly in the bottom layer. The sub-plots have different axis ranges.

Download

https://essd.copernicus.org/articles/16/5531/2024/essd-16-5531-2024-f09

Figure 9Examples of spikes detected in two different XBT profiles: (a) true positive spikes and (b) false positive spike at the start of a steep thermocline. The orange dots in the right panels of (a) and (b) indicate the estimated value of the s_k parameter having a c_k not equal to zero. The sub-plots have different axis ranges.

Download

A difficult task has been discovering how to identify the external influences that cause high-frequency noise in the T profile, as in the examples in Fig. 10c–e, and how to annotate them in the metadata. Some other anomalous thermal structures, compared to what is expected in a certain period, region and depth layer, are shown in Fig. 10a–b and f. The visual check carried out by the expert allows, in some cases, for highlighting notable deviations in the shape and/or values of a profile compared to adjacent ones. The probability of having the same type of anomalous structure recorded by two adjacent XBT probes in time and space is considered negligible, favoring the occurrence of something physical instead of non-optimal functioning of a specific probe. Sometimes the initial BAD attribution to anomalous structures was subsequently reviewed by the comparison with adjacent profiles that present similar features (e.g., Fig. 10a).

5.1 Comparison with SeaDataNet data

A significant part of the XBT profiles included in this dataset has been systematically disseminated through the SDN infrastructure and can be accessed from the data access portal through the saved query URL (https://cdi.seadatanet.org/search/welcome.php?query=1866 &query_code={4E510DE6-CB22-47D5-B221-7275100CAB7F}, last access: 28 November 2024). Alternatively, they can be found in the Mediterranean aggregated dataset product (Simoncelli et al., 2020b), in which they are integrated with other data types (CTDs, bottles, mechanical bathythermographs (MBTs), profiling floats). This data product has been further validated in the framework of the SeaDataCloud project (https://www.seadatanet.org/About-us/SeaDataCloud, last access: 28 November 2024) as described in Simoncelli et al. (2020a).

The SDN XBT dataset, extracted from Simoncelli et al. (2020b), is considered here a benchmark to highlight the main effects of the proposed data reprocessing. Bias and RMSD profiles have been computed from 3104 matching profiles, with a vertical data distribution shown in Fig. 11. Since SDN profiles do not have the calibration correction, we have computed the separate metrics with and without the correction applied. The black dots represent all matching profiles, the green dots represent the profiles without correction and the purple dots have the correction applied.

https://essd.copernicus.org/articles/16/5531/2024/essd-16-5531-2024-f10

Figure 10Examples of profiles with critical features: (a, b, f) anomalous thermal structures and (c, d, e) profiles affected by high-frequency noise. The name of the selected profiles is shown in the legend. The sub-plots have different axis ranges.

Download

The maximum discrepancy among the two data versions always resides within the surface layer until 150 m depth. The maximum bias and RMSD reach approximately 0.05 and 0.2 °C, respectively, which might imply potential significant changes in downstream applications. The bias is larger (∼0.06 °C) when estimated from profiles without calibration correction and slightly smaller (∼0.04 °C) when estimated from calibrated profiles, while the largest RMSD derives from profiles with the correction applied, indicating that the correction slightly increases, on average, the REP temperature values and consequently the positive bias.

The REP profiles are warmer than SDN ones in the surface layer and below 900 m, while between 150 and 800 m, both metrics are small and consistent. The overall mean bias and RMSD are equal to 0.002 and 0.041 °C, respectively. Such differences are mainly due to the new interpolation technique; the lack of filtering; the application of calibration correction in the REP dataset; and, in very few cases, the use of wrong FRE coefficients or the incorrect probe type assignment in SDN, which can produce a change in the depth values. The sharp reduction in the number of observations available below about 900 m depth and the application of the tester correction affect the shape of both BIAS and RMSD profiles.

https://essd.copernicus.org/articles/16/5531/2024/essd-16-5531-2024-f11

Figure 11Comparison between the reprocessed (REP) and the corresponding SeaDataNet (SDN) profiles at each meter depth: (a) bias mean profile, (b) RMSD profile, and (c) cumulative vertical data distribution which shows the relative contribution of profiles with calibration and profiles without calibration to the total.

Download

Figure 12 shows examples of matching REP and SDN profiles and their difference, with the focus on the surface (panel a) and bottom layer (panels b and c), where the largest differences occur. During the stratified period, the largest differences reside in the thermocline and can exceed 1.5 °C (Fig. 12a), while in the bottom layer, the calibration correction (see Fig. 12b, c) together with the abrupt decrease in the number of data explain the small positive average bias in Fig. 11a. In fact, numerous T5/20 profiles (maximum rated depth; see Table 1) were launched (∼7 % of the total) in the few campaigns in which the acquisition system showed significant negative anomalies, and this influenced both BIAS and RMSD profiles below 900 m depth. The frequent step-like shape of deep profiles (Fig. 12c), due to double-diffusion processes (Meccia et al., 2016; Durante et al., 2021), instead causes positive spikes in the difference profiles.

In the SDN dataset, the interpolation of raw profiles at each meter depth has been combined with the application of a Gaussian filter to reduce possible noise (Manzella et al., 2003, 2007). Consequently, a general smoothing of T profiles is observed, which is important to remove or reduce unrealistic high-frequency oscillations if needed, but it also affects the values of the whole profile. The main effect is that the shape of thermal structures is smoothed out, more or less evidently depending on the recorded T gradient.

https://essd.copernicus.org/articles/16/5531/2024/essd-16-5531-2024-f12

Figure 12An example of a reprocessed (REP) profile and the corresponding SeaDataNet (SDN) one on the left and their difference on the right: (a) focus on the surface layer, 0–150 m, and (b, c) focus on the bottom layer, below 800 m.

Download

6 Data availability

The management of the REP dataset has been conceived since the beginning to be compliant with the FAIR data management principles (Wilkinson et al., 2016) and the open-science paradigm. The REP dataset (Reseghetti et al., 2024; https://doi.org/10.13127/rep_xbt_1999_2019.2) is available and accessible through the INGV (Bologna) ERDDAP server (http://oceano.bo.ingv.it/erddap/index.html, last access: 28 November 2024), which allows for machine-to-machine data access, enables downloading subsets of the dataset and gives the users the possibility of selecting among several download formats. ERDDAP is a FAIR-compliant data access service (O'Brien and Delaney, 2024) in line with the GOOS (Global Ocean Observing System) Observations Coordination Group (https://goosocean.org/who-we-are/observations-coordination-group/, last access: 28 November 2024) strategy. In fact, according to Lange et al. (2023), ERDDAP “(i) supports dozens of popular formats; (ii) provides standards-based metadata and data services and formats; (iii) supports federated access of distributed ERDDAP data services; (iv) supports both human and machine interactions; (v) supports sub-setting of large datasets; (vi) provides improved discovery of datasets through commercial search engines; and (vii) provides support for archival of datasets”. The REP dataset is machine-readable, enabling its automated transfer, through a federated ERDDAP server's approach to other repositories and marine data infrastructures, such as EMODnet Physics (https://emodnet.ec.europa.eu/en/physics, last access: 28 November 2024) (Novellino et al., 2024).

The raw data with calibration information, bias correction and interpolated data at standard depths after data QC are released with a complete metadata description together with all the processing information in order to facilitate data reuse. The metadata are available through the url_metadata variable (Appendix C.1.6). Data and metadata of each profile can be easily associated through the profile_id and cruise_id fields.

7 Summary and conclusions

This work presents the reprocessing of XBT profiles in the Ligurian and Tyrrhenian seas over the time period 1999–2019. The added value of this analysis is the availability of the original raw data and all the metadata from the operational manual notes. This allowed us to create the most complete dataset possible, with metadata accompanying each individual T profile. The surface measurements have been added with quality indication, and a correction from calibration has been applied, when available, to T values (generally in the range 0.01–0.02 °C), representing the best estimate of the thermal offset due to the operating XBT system characteristics. A new automatic QC procedure and a new vertical interpolation (Barker and McDougall, 2020) have been implemented without the application of any filter that, on the one side, removes unrealistic high-frequency oscillations, and on the other, smooths out the thermal structure of the T profiles with the main impact being on the surface layer during stratified conditions. The adoption of a Gaussian filter in SDN data (Manzella et al., 2003, 2007) was justified by the purpose being assimilating XBT profiles in the Mediterranean Forecasting System that in the early 2000s was characterized by a much lower resolution compared to the present numerical model capabilities. The Cheng et al. (2014) XBT bias correction scheme for both temperature and depth records has also been applied to the calibrated profiles, in agreement with the recent literature, to facilitate the REP dataset integration with other data types for climate studies. The REP dataset gives researchers the most complete information for its reuse for different applications (assimilation in ocean and climate models, process and climate studies). It can also be used to test new QC algorithms or the order in which to apply them to further improve the data quality.

The adoption of FAIR data management principles through the use of SeaDataNet standards and the dissemination strategy based on the ERDDAP server implementation are additional values of this effort, allowing for its machine-to-machine access.

XBTs are a 60-year-old piece of technology. Though the quality of their measurements might not fit the purpose of all applications and they leave debris in the ocean, “XBTs provide the simplest and most cost-efficient solution for frequently obtaining temperature profiles along fixed transects of the upper ocean” (Parks et al., 2022) using ships of opportunity. Moreover, the XBT measurements along the MX04 track were, for some periods, among the few measurements recorded in the Tyrrhenian and Ligurian seas. Despite the limitations of the XBT characteristics, they constituted the simplest way to verify the physical state of the upper layer of those basins. It is therefore very important to provide those profiles with the best-quality and usability indications. For this reason, the MX04 line has been re-established for climate monitoring on a seasonal basis in the framework of the MACMAP project after a 2-year break.

In recent years, the use of XBTs has also been criticized because all probe components fall to the seabed. Given the current MACMAP sampling strategy with 37 launches in fixed and determined positions along the MX04 line, the quantity of material abandoned at sea for each campaign can be easily estimated (about 22 kg of ZAMAK, just over 2 kg of plastic and about 11 kg of copper wire). It would be preferable that the XBT probes were made of alternative materials (e.g., iron “nose” and biodegradable plastic components); however, in our cost–benefit analysis, the environmental impact due to the REP dataset is balanced by the scientific results. Finally, the deployment of the XBT probes described here did not contribute to additional emissions of CO₂ and other atmospheric pollutants because only ships of opportunity were used, and, in the case of research vessels, the launch of the XBT probes was ancillary to the primary purpose of the scientific cruise.

Appendix A

Characteristics of test canisters

While in the laboratory, it is easy to have steady and controlled environmental conditions for measurements, in the field, this is only an aspiration of the operators. Furthermore, repeated operation in conditions of high temperature, humidity and salinity certainly does not facilitate the proper functioning of the electronic instrumentation. The DAQ in an XBT system should read the nominal value of a resistance (within the uncertainties in the measurements) without showing changes in its readings over time. The use of a tester with high-quality resistors is the preferred method to verify this. Between 2007 and 2010, two testers were built using very high precision resistors (model KOA Speer RN73r1jttd1002b10) combined in such a way so as to achieve corresponding T values similar to the extreme ones measured in the marine regions under investigation. The resistance values of both testers were checked each year with a Wavetek Datron 1281 8.5-digit multi-meter in a laboratory of the INFN (Italian National Institute of Nuclear Physics) in Milan (room temperature always in the range of 20–24 °C during measurements). The reading remained stable (within 0.1 Ω) over the period 2008–2019 for the former and 2010–2015 for the latter.

The resistance R values shown in Table A1 are then converted to T by applying the Hoge-2 R-to-T equation (Sippican, 1991; Lockheed Martin Sippican (Sippican) Inc. (2010); Hoge, 1988; Chen, 2009; Liu et al., 2018):

\begin{matrix} (A1) & T = \frac{1}{A + B (\ln R) + C (\ln R)^{2} + D (\ln R)^{3}} - 273.15 ° C, \end{matrix}

with the coefficients $A = 1.2901230 \times 10^{- 3}$ , $B = 2.3322529 \times 10^{- 4}$ , $C = 4.5791293 \times 10^{- 7}$ , and $D = 7.1625593 \times 10^{- 8}$ .

To our knowledge, this equation and the coefficients remained unchanged since the 1990s for all the DAQs, namely Sippican MK12, MK21 ISA, MK21 USB, MK21 Ethernet, Turo Devil, and Turo Quoll. Sippican used the Steinhart–Hart relation for its MK9 model (Intergovernmental Oceanographic Commission, 1992), while tabulated R-to-T values were used for MK-2A and similar recorders (Sippican, 1968; Plessey Company Limited, 1975).

Table A1The resistance values measured in the control tests with the corresponding temperature values calculated by a Hoge-2 equation for the two testers used in the XBT data acquisition campaigns since 2010.

Download Print Version | Download XLSX

Appendix B

Table B1Summary of ships, instrumentation and operating conditions during the collection of XBT profiles in the REP dataset.

Download Print Version | Download XLSX

Appendix C

C1 Format and standards

The data format adopted to archive the REP dataset is NetCDF (Network Common Data Form). It is self-describing since it includes the metadata that describe both data and data structures. The NetCDF implementation is based on the community-supported Climate and Forecast (CF) specification (CF1.6 profile for profile data) and it adopts the SeaDataNet vocabularies (https://www.seadatanet.org/Standards/Common-Vocabularies, last access: 28 November 2024). The reference SDN parameter codes (P01 terms; https://vocab.seadatanet.org/v_bodc_vocab_v2/search.asp?lib=P01, last access: 28 November 2024) and the associated standard units (P06 terms; https://vocab.seadatanet.org/v_bodc_vocab_v2/search.asp?lib=P06, last access: 28 November 2024) are used in order to ensure the proper interpretation of values by both humans and machines and to allow for data interoperability in terms of manipulation, distribution and long-term reuse.

Each XBT NetCDF file contains

dimensions that provide information on the size of the variables (a.k.a. parameters),
coordinate variables that orient the data in time and space,
geophysical variables that contain the actual measurements,
ancillary variables that contain the quality information (QF) values,
additional variables that include some of the variables being part of SDN extensions to CF,
global metadata fields that refer to the whole file and not just to one variable (a.k.a. global attributes).

C1.1 Dimensions

The pattern followed by SDN for the profiles data type is to have an INSTANCE unlimited dimension, with a maximum number of z coordinate levels (MAXZ). We also included string size dimension, STRING, for text arrays and added test size dimensions referring, respectively, to testing QFs on temperature (TST_T) and depth (TST_D) values and the maximum number of z coordinate levels for the data re-sampled at a 1 m interval after QC is applied (MAX_INT).

C1.2 Coordinate variables

NetCDF coordinates are a special subset of variables which orient the data in time and space. They are

LONGITUDE for x,
LATITUDE for y,
TIME for t,
DEPTH for z.

C1.3 Geophysical variables

Each file contains the following:

depth, which is depth at original vertical resolution,
TEMPET01, which is calibrated seawater temperature at original vertical resolution,
DEPTH_COR, which is the original vertical resolution depth corrected by applying Cheng et al. (2014),
TEMPET01_COR, which is the calibrated and corrected seawater temperature resulting from applying Cheng et al. (2014),
DEPTH_INT, which is the depth interpolated on standard depth levels using the Barker and McDougall (2020) method,
TEMPET01_INT, which is TEMPET01 interpolated on standard depth levels using the Barker and McDougall (2020) method,
DEPTH_COR_INT, which is DEPTH_COR interpolated on standard depth levels using the Barker and McDougall (2020) method,
TEMPET01_COR_INT, which is TEMPET01_COR interpolated on standard depth levels (each meter depth) using the Barker and McDougall (2020) method.

Calibration values are provided in a separate variable, CALIB, so that experts can trace back the raw (uncalibrated) profile if needed.

For each coordinate and geophysical variable, four mandatory parameter attributes are included, as defined in Lowry et al. (2019):

sdn_parameter_urn. This is the URN (Uniform Resource Name) for the parameter description taken from the P01 vocabulary.
sdn_parameter_name. This is the plain language label (Entryterm) for the parameter taken from the P01 vocabulary at the time of the data creation.
sdn_uom_urn. This is the URN for the parameter units of measurement taken from the P06 vocabulary.
sdn_uom_name. This is the plain language label (Entryterm) for the parameter taken from the P06 vocabulary at the time of data file creation.

Moreover, since some of the coordinate variable names could be ambiguous, particularly for the z coordinate, we adopt the standard_name attribute (P07 vocabulary; https://vocab.seadatanet.org/v_bodc_vocab_v2/search.asp?lib=P07, last access: 28 November 2024), which is not mandatory in CF but widely used and significantly enhances interoperability.

C1.4 Ancillary variables

In order to report data quality information on a point-by-point basis, every measurement is tagged with a single-byte encoded label referred to as a flag. The flag variables are mandatory for all coordinate and geophysical variables to which they relate through ancillary_variables in the parent variable set to the name of the ancillary variable attribute (Lowry et al., 2019). The flags are encoded using the SDN L20 vocabulary (https://vocab.seadatanet.org/v_bodc_vocab_v2/search.asp?lib=L20, last access: 28 November 2024), and each ancillary variable carries attributes flag_values and flag_meanings, which provide a list of possible values and their meanings.

For coordinate variables, the ancillary variables are the following:

TIME_SEADATANET_QC, which is the ancillary variable referring to the TIME parent variable;
POSITION_SEADATANET_QC, which represents longitude and latitude flag variables combined into a single flag for the position following OceanSITES (2020) practice.

For depth coordinate, the ancillary variables are

DEPTH_TEST_QC, which contains flags coming from the application of the depth check test;
DEPTH_FLAGS_QC, which contains flags associated with each original depth value and summarizes the results of the performed depth test check mapped on SDN L20 vocabulary;
DEPTH_COR_FLAGS_QC, which contains flags associated with each corrected (Cheng et al., 2014, CH14) depth value;
DEPTH_INT_SEADATANET_QC, which contains flags associated with the interpolated profile;
DEPTH_COR_INT_SEADATANET_QC, which contains flags associated with the corrected (CH14) interpolated profile.

For temperature geophysical variable, the ancillary variables, similarly to depth coordinate, are the following:

TEMPET01_TEST_QC, which contains exit values coming from the application of independent temperature check tests;
TEMPET01_FLAGS_QC, which contains the QFs associated with each calibrated temperature value and summarizes the results of the performed independent temperature test checks mapped on SDN L20 vocabulary;
TEMPET01_COR_FLAGS_QC, which contains the QFs associated with each calibrated and corrected (CH14) temperature value;
TEMPET01_INT_SEADATANET_QC, which contains QFs associated with the temperature interpolated profile;
TEMPET01_COR_INT_SEADATANET_QC, which contains QFs associated with the corrected (CH14) temperature interpolated profile.

C1.5 Additional variables

In addition to attributes, the following variables from the SDN extension have been adopted:

1.
SDN_CRUISE, an array containing the name of the project which funded the cruise;
2.
SDN_EDMO_CODE, an integer array containing keys identifying the organization in the European Directory of Marine Organizations (EDMO, https://www.seadatanet.org/Metadata/EDMO-Organisations, last access: 28 November 2024);
3.
SDN_BOT_DEPTH, a floating-point array holding bathymetric water depth in meters where the sample was collected or measurement was made, for which we considered the local bottom depth extracted from the GEBCO Compilation Group (2021).

In order to preserve and keep track of metadata associated with each profile (url_metadata) in the dissemination through ERDDAP, the following two variables have also been adopted:

4.
cruise_id, an array containing the name of the project which funded the cruise, with the year and the month of the cruise added;
5.
profile_id, an array referring to the sequence of the profile during the corresponding cruise.

C1.6 Global metadata fields

The global attribute section of a NetCDF file describes its content overall. All attributes should be human-readable and contain meaningful information for data discovery and reuse. Most importantly, all available discovery metadata to the SDN mandatory attributes have been introduced following recommendations of the XBT community. Moreover, several studies (Cheng et al., 2014, 2016, 2018; Goni et al., 2019) highlighted the dependency of the biases on probe type, time (due to variations in the manufacturing process) and changes in the recording systems (Tan et al., 2021). For these reasons, the following information has been inserted in the XBT metadata description: probe type with the serial number, manufacturer, manufacturing date, FRE coefficients used to calculate the depth, launch height, DAQ model and recorder version (Cheng et al., 2016). Ship speed wind speed, and probe mass (available since 2018) have been added to this metadata section when available.

The depth (depth_uncertainty) and temperature (TEMPET01_uncertainty) uncertainties, being equal to each profile within the REP dataset, have been included as global attributes.

The abovementioned information has been kept and made available through ERDDAP by an url_metadata variable in order to more efficiently manage the many metadata strings. A Jupyter notebook in Python (Fratianni and Frizzera, 2024) has been stored on a GitHub repository and published on Zenodo (https://doi.org/10.5281/zenodo.13862792) to enable access to and recombine all data and metadata in NetCDF files, one per XBT profile.

Interactive computing environment (ICE)

To facilitate data reusability, we prepared a Jupyter Notebook in Python that allows for recombining all data and metadata in NetCDF files, one per XBT profile. The notebook (Fratianni and Frizzera, 2024) is available on https://github.com/cfrat74/XBT_ERDDAP (last access: 29 November 2024) and published on Zenodo (https://doi.org/10.5281/zenodo.13862792, Fratianni and Frizzera, 2024).

The standards adopted for the dissemination of the REP dataset are described in detail in Appendix C.

The ODV collection of the REP interpolated dataset, used for the visual check, is also available on request.

Author contributions

SS conceptualized the work, FR curated the original data (collecting a significant portion of it), and CF developed the QC software under the methodology supervision of SS, FR and LC. GR prepared the correction from the calibration of DAQs. CF manages and curates the reprocessed dataset. SS, FR and CF prepared the manuscript with contributions from GR and LC.

Competing interests

At least one of the (co-)authors is a member of the editorial board of Earth System Science Data. The peer-review process was guided by an independent editor, and the authors also have no other competing interests to declare.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Acknowledgements

We thank all people, institutions and companies involved in the data taking. We would like to acknowledge the following:

the Italian shipping company GNV, which was a very special partner that has allowed the monitoring activity since September 1999, and, in particular, Marco Fasciolo, Mattia Canevari, captains, officers and all crews for their precious collaboration;
persons involved in data collection on the MX04 line – namely, Mireno Borghini, Filippo Dell'Amico, Carlo Galli and Egisto Lazzoni (CNR-ISMAR) and Massimo Morgigni and Antonio Baldi (ENEA-STE);
CNR-ISMAR-Lerici for the very long collaboration that has allowed for the acquisition of numerous XBT profiles from research vessels and, in particular, the crew and technicians of the RV Urania;
the international shipping companies Hapag Lloyd, CMA CGM and Arkas and their managers and crews for their valuable collaboration;
responsible officers ashore and on board crews and technicians of ships belonging to IIM – in particular, Maurizio Demarte and Luca Repetti;
Australian government agency CSIRO for its kind cooperation by sharing their instrumentation in the 2007–2011 data collection on containerships and notably Ann Thresher, Lisa Krummel and Rebecca Cowley;
the Federal Research Laboratory NOAA-AOML of Miami (FL) and in particular Gustavo Goni and Francis Bringas for supplying the XBT probes used during some MX04 campaigns and for the support in carrying out the operational activities;
Stefano Latorre (INFN, Milan), a key person in the development and implementation of the testers and their periodic calibration;
one of the authors (Franco Reseghetti) for having supplied his own instrumentation and XBT probes for carrying out oceanographic campaigns since 2008.

We extend very special thanks to Giuseppe M. R. Manzella, who created the SOOP program in the Mediterranean Sea and coordinated it until 2013 and was among the pioneers in the development of marine data infrastructures. He supported this paper, providing useful comments.

We acknowledge Marjahn Finlayson for reviewing the English in a previous version of the manuscript and Mario Locati (head of the INGV data management office) for his continuous support. This work has been developed in the framework of the MACMAP project coordinated by Antonio Guarnieri, whom we thank.

Financial support

This research has been supported by the Istituto Nazionale di Geofisica e Vulcanologia (Environment Department) (grant no. CUP D59C19000090005).

Review statement

This paper was edited by François G. Schmitt and reviewed by Rebecca Cowley and one anonymous referee.

References

Bailey, R., Gronell A., Phillips H., Tanner E., and Meyers, G.: Quality Control Cookbook for XBT Data, CSIRO Report 221, 84 pp., http://hdl.handle.net/102.100.100/237126?index=1 (last access: 28 November 2024), 1994.

Barker, P. M. and McDougall, T. J.: Two Interpolation Methods Using Multiply-Rotated Piecewise Cubic Hermite Interpolating Polynomials, J. Atmos. Ocean. Tech., 37, 605–619, https://doi.org/10.1175/JTECH-D-19-0211.1, 2020.

Bordone, A., Pennecchi, F., Raiteri, G., Repetti, L., and Reseghetti, F.: XBT, ARGO Float and Ship-Based CTD Profiles Intercompared under Strict Space-Time Conditions in the Mediterranean Sea: Assessment of Metrological Comparability, J. Marine Sci. Eng., 8, 313, https://doi.org/10.3390/jmse8050313, 2020.

Bringas, F. and Goni, G.: Early dynamics of Deep Blue XBT probes, J. Atmos. Ocean. Tech., 32, 2253–2263, https://doi.org/10.1175/JTECH-D-15-0048.1, 2015.

Chen, C.: Evaluation of resistance–temperature calibration equations for NTC thermistors, Measurement, 42, 1103–1111, https://doi.org/10.1016/j.measurement.2009.04.004, 2009.

Cheng, L., Zhu, J., Cowley, R., Boyer, T., and Wijffels, S.: Time, probe type, and temperature variable bias corrections to historical expendable bathythermograph observations, J. Atmos. Ocean. Tech., 31, 1793–1825, https://doi.org/10.1175/Jtech-D-13-00197.1, 2014.

Cheng, L., Abraham, J., Goni, G., Boyer, T., Wijffels, S., Cowley, R., Gouretski, V., Reseghetti, F., Kizu, S., Dong, S., Bringas, F., Goes, M., Houpert, L., Sprintall, J., and Zhu, J.: XBT science: assessment of instrumental biases and errors, B. Am. Meteorol. Soc., 97, 923–934, https://doi.org/10.1175/Bams-D-15-00031.1, 2016.

Cheng, L., Trenberth, K. E., Fasullo, J., Boyer, T., Abraham, J., and Zhu, J.: Improved estimates of ocean heat content from 1960 to 2015, Sci. Adv., 3, e1601545, https://doi.org/10.1126/sciadv.1601545, 2017.

Cheng, L., Luo, H., Boyer, T., Cowley, R., Abraham, J., Gouretski, V., Reseghetti, F., and Zhu, J.: How well can we correct systematic errors in historical XBT data?, J. Atmos. Ocean. Tech., 35, 1103–1125, https://doi.org/10.1175/jtech-d-17-0122.1, 2018.

Cheng, L., Abraham, J., Zhu, J., Trenberth, K. E., Fasullo, J., Boyer, T., Locarnini, R., Zhang, B., Wan, L., Chen, X., Song, X., Liu, Y., and Mann, M. E.: Record-Setting Ocean Warmth Continued in 2019, Adv. Atmos. Sci., 37, 137–142, https://doi.org/10.1007/s00376-020-9283-7, 2020.

Cheng, L., Abraham, J., Trenberth, K. E., Fasullo, J., Boyer, T., Locarnini, R., Zhang, B., Yu,F., Wan, L., Chen, X., Song, X., Liu, Y., Mann, M. E., Reseghetti, F., Simoncelli, S., Gouretski, V., Chen, G., Mishonov, A., Reagan, J., and Zhu, J.: Upper Ocean Temperatures Hit Record High in 2020, Adv. Atmos. Sci., 38, 523–530, https://doi.org/10.1007/s00376-021-0447-x, 2021.

Cheng, L., Abraham, J., Trenberth, K. E., Fasullo, J., Boyer, T., Mann, M. E., Zhu, J., Wang, F., Locarnini, R., Li, Y., Zhang, B., Tan, Z., Yu, F., Wan, L., Chen, X., Song, X., Liu, Y., Reseghetti, F., Simoncelli, S., Gouretski, V., Chen, G., Mishonov, A., and Reagan, J.: Another Record: Ocean Warming Continues through 2021 despite La Niña Conditions, Adv. Atmos. Sci., 39, 373–385, https://doi.org/10.1007/s00376-022-1461-3, 2022.

Cook, S. and Sy, A.: Best guide and principles manual for the Ships Of Opportunity Program (SOOP) and eXpendable BathyThermograph (XBT) operations, Geneva, Switzerland, WMO & IOC, 26 pp., https://doi.org/10.25607/OBP-1483, 2001.

Cowley, R. and Krummel, L.: Australian XBT Quality Control Cookbook Version 2.0, Report EP2022-1825 CSIRO, Australia, 1–89 pp., https://doi.org/10.25919/3tm5-zn80, 2022.

Cowley, R., Killick, R. E., Boyer, T., Gouretski, V., Reseghetti, F., Kizu, S., Palmer, M. D., Cheng, L., Storto, A., Le Menn, M., Simoncelli, S., Macdonald, A. M., and Domingues, C. M.: International Quality-Controlled Ocean Database (IQuOD) v0.1: The Temperature Uncertainty Specification, Front. Mar. Sci., 8, 689695, https://doi.org/10.3389/fmars.2021.689695, 2021.

Durante, S., Oliveri, P., Nair, R., and Sparnocchia, S.: Mixing in the Tyrrhenian Interior Due to Thermohaline Staircases, Front. Mar. Sci., 8, 672437, https://doi.org/10.3389/fmars.2021.672437, 2021.

Flierl, G. R. and Robinson, A. R.: XBT Measurements of Thermal Gradients in the MODE Eddy, J. Phys. Oceanogr., 7, 300–302, https://doi.org/10.1175/1520-0485(1977)007<0300:XMOTGI>2.0.CO;2, 1977.

Fratianni, C. and Frizzera, P.: REPROCESSED XBT 1999–2019: how to access data and metadata throught ERDDAP (v1.0.0), Zenodo [software], https://doi.org/10.5281/zenodo.13862792, 2024.

Fusco, G., Manzella, G. M. R., Cruzado, A., Gačić, M., Gasparini, G. P., Kovačević, V., Millot, C., Tziavos, C., Velasquez, Z. R., Walne, A., Zervakis, V., and Zodiatis, G.: Variability of mesoscale features in the Mediterranean Sea from XBT data analysis, Ann. Geophys., 21, 21–32, https://doi.org/10.5194/angeo-21-21-2003, 2003.

GEBCO Compilation Group: GEBCO 2021 Grid, GEBCO Compilation Group, https://doi.org/10.5285/c6612cbe-50b3-0cff-e053-6c86abc09f8f, 2021.

Goni, G., Domingues, R., Goes, M., Lopez, H., Morrow, R., Rivero, U., Rossby, T., Todd, R. E., Trinanes, J., Zilberman, N., Baringer, M., Boyer, T., Cowley, R., Domingues, C. M., Hutchinson, K., Kramp, M., Mata, M. M., Reseghetti, F., Sun, C., Bhaskar, T. V. S. U., and Volkov, D.: More than 50 years of successful continuous temperature section measurements by the global expendable bathythermograph network, its integrability, societal benefits, and future, Front. Mar. Sci., 6, 452, https://doi.org/10.3389/fmars.2019.00452, 2019.

Good, S., Mills, B., Boyer, T., Bringas, F., Castelão, G., Cowley, R., Goni, G., Gouretski, V., and Domingues, C. M.: Benchmarking of automatic quality control checks for ocean temperature profiles and recommendations for optimal sets, Front. Mar. Sci., 9, 1075510, https://doi.org/10.3389/fmars.2022.1075510, 2023.

Gronell, A. and Wijffels, S. E.: A Semiautomated Approach for Quality Controlling Large Historical Ocean Temperature Archives, J. Atmos. Ocean. Tech., 25, 990–1003, https://doi.org/10.1175/JTECHO539.1, 2008.

Haddad, S., Killick, R. E., Palmer, M. D., Webb, M. J., Prudden, R., Capponi, F., and Adams, S. V.: Improved Infilling of Missing Metadata from Expendable Bathythermographs (XBTs) Using Multiple Machine Learning Methods, J. Atmos. Ocean. Tech., 39, 1367–1385, https://doi.org/10.1175/JTECH-D-21-0117.1, 2022.

Hanawa, K., Rual, P., Bailey, R., Sy, A., and Szabados, M.: A new depth-time equation for Sippican or TSK T-7, T-6 and T-4 expendable bathythermographs (XBT), Deep-Sea Res. Pt. I, 42, 1423–1451, https://doi.org/10.1016/0967-0637(95)97154-Z, 1995.

Hoge, H.: Useful procedure in least squares, and tests of some equations for thermistors, Rev. Sci. Instrum., 59, 975–979, https://doi.org/10.1063/1.1139762, 1988.

Intergovernmental Oceanographic Commission: Manuals and guides, 4. Guide to oceanographic and marine meteorological instruments and observing practices, 1–78 pp., ISBN 92-3-101325-4, 1975.

Intergovernmental Oceanographic Commission: Ad hoc meeting of the IGOSS Task Team on quality control for automated systems, Marion, Massachusetts, USA, 3–6 June 1991, Intergovernmental Oceanographic Commission IOC/INF-888, 1–144 pp., 1992.

Intergovernmental Oceanographic Commission: First Session of the Joint IOC-WMO IGOSS Ship-of-Opportunity Programme Implementation Panel: Annex VI, Cape Town, South Africa, 16–18 April 1997, 1–46 pp., 1997.

Intergovernmental Oceanographic Commission: Ocean Data Standards Volume 3. Recommendation for a Quality Flag Scheme for the Exchange of Oceanographic and Marine Meteorological Data. Paris, France, UNESCO-IOC, 5 pp. & Annexes, Intergovernmental Oceanographic Commission Manuals and Guides, 54, IOC/2013/MG/54-3, https://doi.org/10.25607/OBP-6, 2013.

Intergovernmental Oceanographic Commission: Ocean Data Standards Volume 4: Technology for SeaDataNet Controlled Vocabularies for describing Marine and Oceanographic Datasets – A joint Proposal by SeaDataNet and ODIP projects. Oostend, Belgium, IODE/UNESCO, 31 pp., IOC Manuals and Guides, 54, vol. 4, version 1, IOC/2019/MG/54 Vol.4, https://doi.org/10.25607/OBP-566, 2019.

Kizu, S. and Hanawa, K.: Start-up transient of XBT measurement, Deep-Sea Res. Pt. I, 49, 935–940, https://doi.org/10.1016/S0967-0637(02)00003-1, 2002a.

Lange, N., Tanhua, T., Pfeil, B., Bange, H. W., Lauvset, S. K., Grégoire, M., Bakker, D. C. E., Jones, S. D., Fiedler, B., O'Brien, K. M., and Körtzinger, A.: A status assessment of selected data synthesis products for ocean biogeochemistry, Front. Mar. Sci., 10, 1078908, https://doi.org/10.3389/fmars.2023.1078908, 2023.

Leahy, T. P., Llopis, F. P., Palmer, M. D., and Robinson, N. H.: Using Neural Networks to Correct Historical Climate Observations, J. Atmos. Ocean. Tech., 35, 2053–2059, https://doi.org/10.1175/JTECH-D-18-0012.1, 2018.

Little, A. D., Inc.: Experimental evaluation of expendable bathythermographs, Dept. of the Navy Bureau of Ships Rep. ASW Sonar Technology Report No. 4071165, Dept. of the Navy – Bureau of Ships Project SN SF-101-03-21, Task 11353, November 1965, 51 pp., 1965.

Little, A. D., Inc.: Expendable bathythermograph (XBT) system evaluation for tactical sonar application. ASW Sonar Technology Report No. 4150866, Dept. of the Navy – Naval Ship Systems Command, Project SN SF-101-03-21, Task 11353, June 1966, 85 pp., 1966.

Liu, G., Guo, L., Liu, C., and Wu, Q.: Evaluation of different calibration equations for NTC thermistor applied to high-precision temperature measurement, Measurement, 120, 21–27, https://doi.org/10.1016/j.measurement.2018.02.007, 2018.

Lockheed Martin Sippican (Sippican) Inc.: MK21 USB DAQ, surface ship, bathythermograph data acquisition system, installation operation and maintenance manual, P/N 308437, Rev. E, 172 pp., 2006.

Lockheed Martin Sippican (Sippican) Inc.: WinMK21 Data Acquisition and Post Processing Software User's Manual P/N 352210, Rev. B, 134 pp., 2010.

Lockheed Martin Sippican (Sippican) Inc.: MK21 Ethernet Surface 1U DAQ – Bathythermograph data acquisition system, installation and operation manual, P/N 352186, Rev. D, 47 pp., 2014.

Lowry, R., Fichaut, M., and Bregent, S.: SeaDataNet NetCDF format definition, Version 1.21, SeaDataNet, 73 pp., https://doi.org/10.25607/OBP-408, 2019.

Magruder Jr., P. M.: Some characteristics of temperature microstructure in the ocean, M.S. thesis, Dept. of Oceanography, US Naval Postgraduate School, 1–155 pp., 1970.

Manzella, G. M. R., Scoccimarro, E., Pinardi, N., and Tonani, M.: Improved near real-time data management procedures for the Mediterranean ocean Forecasting System-Voluntary Observing Ship program, Ann. Geophys., 21, 49–62, https://doi.org/10.5194/angeo-21-49-2003, 2003.

Manzella, G. M. R., Reseghetti, F., Coppini, G., Borghini, M., Cruzado, A., Galli, C., Gertman, I., Gervais, T., Hayes, D., Millot, C., Murashkovsky, A., Özsoy, E., Tziavos, C., Velasquez, Z., and Zodiatis, G.: The improvements of the ships of opportunity program in MFS-TEP, Ocean Sci., 3, 245–258, https://doi.org/10.5194/os-3-245-2007, 2007.

Meccia, V. L., Simoncelli, S., and Sparnocchia, S.: Decadal variability of the Turner Angle in the Mediterranean Sea and its implications for double diffusion, Deep-Sea Res. Pt. I, 114, 64–77, https://doi.org/10.1016/J.DSR.2016.04.001, 2016.

Meyssignac, B., Boyer, T., Zhao, Z., Hakuba, M. Z., Landerer, F. W., Stammer, D., Köhl, A., Kato, S., L'Ecuyer, T., Ablain, M., Abraham, J. P., Blazquez, A., Cazenave, A., Church, J. A., Cowley, R., Cheng, L., Domingues, C. M., Giglio, D., Gouretski, V., Ishii, M., Johnson, G. C., Killick, R. E., Legler, D., Llovel, W., Lyman, J., Palmer, M. D., Piotrowicz, S., Purkey, S. G., Roemmich, D., Roca, R., Savita, A., von Schuckmann, K., Speich, S., Stephens, G., Wang, G., Wijffels, S. E., and Zilberman, N.: Measuring Global Ocean Heat Content to Estimate the Earth Energy Imbalance, Front. Mar. Sci., 6, 432, https://doi.org/10.3389/fmars.2019.00432, 2019.

Millot, C. and Taupier-Letage, I.: Circulation in the Mediterranean Sea, in: The Mediterranean Sea. Handbook of Environmental Chemistry, edited by: Saliot, A., vol. 5K, Springer, Berlin, Heidelberg, https://doi.org/10.1007/b107143, 2005a.

Millot, C. and Taupier-Letage, I.: Additional evidence of LIW entrainment across the Algerian Basin by mesoscale eddies and not by permanent westward-flowing vein, Prog. Oceanogr., 66, 231–250, https://doi.org/10.1016/j.pocean.2004.03.002, 2005b.

Novellino, A., Pizziol, V., Dapueto, G., Misurale, F., Scotto, B. M., Bordoni, R., Gorringe, P., Schaap, D., and Iona, A.: EMODnet Ingestion and the operational data exchange examples and hot topics, Miscellanea INGV, 80, 364–366, https://doi.org/10.13127/MISC/80/140, 2024.

O'Brien, K. and Delaney, C.: A review of ERDDAP the established best practice in sharing gridded and tabular data from the Earth Sciences community, Miscellanea INGV, 80, 231–232, https://doi.org/10.13127/MISC/80/87, 2024.

OceanSITES: OceanSITES Data Format Reference Manual NetCDF Conventions and Reference Tables, Version 1.4, 16 July 2020, Geneva, Switzerland, OceanSITES, JCOMMOPS, 36 pp., https://doi.org/10.25607/OBP-421.2, 2020.

Palmer, M. D., Boyer, T., Cowley, R., Kizu, S., Reseghetti, F., Suzuki, T., and Thresher, A.: An Algorithm for Classifying Unknown Expendable Bathythermograph (XBT) Instruments Based on Existing Metadata, J. Atmos. Ocean. Tech., 35, 429–440, https://doi.org/10.1175/JTECH-D-17-0129.1, 2018.

Parks, J., Bringas, F., Cowley, R., Hanstein, C., Krummel, L., Sprintall, J., Cheng, L., Cirano, M., Cruz, S., Goes, M., Kizu, S., and Reseghetti, F.: XBT operational best practices for quality assurance, Front. Mar. Sci., 9, 991760, https://doi.org/10.3389/fmars.2022.991760, 2022.

Pinardi, N. and Coppini, G.: Preface “Operational oceanography in the Mediterranean Sea: the second stage of development”, Ocean Sci., 6, 263–267, https://doi.org/10.5194/os-6-263-2010, 2010.

Pinardi, N., Allen, I., Demirov, E., De Mey, P., Korres, G., Lascaratos, A., Le Traon, P.-Y., Maillard, C., Manzella, G., and Tziavos, C.: The Mediterranean ocean forecasting system: first phase of implementation (1998–2001), Ann. Geophys., 21, 3–20, https://doi.org/10.5194/angeo-21-3-2003, 2003.

Pinardi, N., Zavatarelli, M., Adani, M., Coppini, G., Fratianni, C., Oddo, P., Simoncelli, S., Tonani, M., Lyubartsev, V., Dobricic, S., and Bonaduce, A.: Mediterranean Sea large-scale low-frequency ocean variability and water mass formation rates from 1987 to 2007: A retrospective analysis, in: Progress in Oceanography, vol. 132, 318–332, Elsevier BV, https://doi.org/10.1016/j.pocean.2013.11.003, 2015.

Pinardi, N., Stander, J., Legler, D. M., O'Brien, K., Boyer, T., Cuff, T., Bahurel, P., Belbeoch, M., Belov, S., Brunner, S., Burger, E., Carval, T., Chang-Seng, D., Charpentier, E., Ciliberti, S., Coppini, G., Fischer, A., Freeman, E., Gallage, C., Garcia, H., Gates, L., Gong, Z., Hermes, J., Heslop, E., Grimes, S., Hill, K., Horsburgh, K., Iona, A., Mancini, S., Moodie, N., Ouellet, M., Pissierssens, P., Poli, P., Proctor, R., Smith, N., Sun, C., Swail, V., Turton, J., and Xinyang, Y.: The Joint IOC (of UNESCO) and WMO Collaborative Effort for Met-Ocean Services, Front. Mar. Sci., 6, 410, https://doi.org/10.3389/fmars.2019.00410, 2019.

Plessey Company Limited: Plessey-Sippican expendable bathythermograph system, Tech. Rep. MP0400, issue 0401, 51 pp., 1975.

Reid Jr., W. L.: Expendable Bathythermograph Evaluation, Oceanographic Instrumentation Center, US Naval Oceanographic Office, DTIC AD A045064, 78 pp., 1964.

Reiniger, R. F. and Ross, C. K.: A method of interpolation with application to oceanographic data, Deep-Sea Res. Oceanogr. Abstr., 15, 185–193, https://doi.org/10.1016/0011-7471(68)90040-5, 1968.

Reseghetti, F., Cheng, L., Borghini, M., Yashayaev, I. M., Raiteri, G., and Zhu, J.: Assessment of Quality and Reliability of Measurements with XBT Sippican T5 and T5/20, J. Atmos. Ocean. Tech., 35, 1935–1960, https://doi.org/10.1175/JTECH-D-18-0043.1, 2018.

Reseghetti, F., Fratianni, C., and Simoncelli, S.: Reprocessed XBT dataset in the Ligurian and Tyrrhenian seas (1999–2019) (Version 2), Istituto Nazionale di Geofisica e Vulcanologia (INGV) [data set], https://doi.org/10.13127/REP_XBT_1999_2019.2, 2024.

Roemmich, D. and Cornuelle, B.: Digitization and calibration of the expendable bathythermograph, Deep-Sea Res. Pt. A, 34, 299–307, 1987.

Ryabinin, V., Barbière, J., Haugan, P., Kullenberg, G., Smith, N., McLean, C., Troisi, A., Fischer, A., Aricò, S., Aarup, T., Pissierssens, P., Visbeck, M., Enevoldsen, H. O., and Rigaud, J.: The UN Decade of Ocean Science for Sustainable Development, Front. Mar. Sci., 6, 470, https://doi.org/10.3389/fmars.2019.00470, 2019.

Schlitzer, R.: Ocean Data View, https://odv.awi.de/ (last access: 29 November 2024), 2023.

Simoncelli, S., Fratianni, C., Pinardi, N., Grandi, A., Drudi, M., Oddo, P., and Dobricic, S.: Mediterranean Sea Physical Reanalysis (CMEMS MED-Physics) (Version 1), Copernicus Monitoring Environment Marine Service (CMEMS) [data set], https://doi.org/10.25423/MEDSEA_REANALYSIS_PHYS_006_004, 2014.

Simoncelli, S., Oliveri, P., Mattia, G., and Myroshnychenko, V.: SeaDataCloud Temperature and Salinity Historical Data Collection for the Mediterranean Sea (Version 2), Product Information Document (PIDoc), https://doi.org/10.13155/77059, 2020a.

Simoncelli, S., Schaap, D., and Schlitzer, R.: Mediterranean Sea – Temperature and salinity Historical Data Collection SeaDataCloud V2, Sextant [data set], https://doi.org/10.12770/2a2aa0c5-4054-4a62-a18b-3835b304fe64, 2020b.

Simoncelli, S., Manzella, G. M. R., Storto, A., Pisano, A., Lipizer, M., Barth, A., Myroshnychenko, V., Boyer, T., Troupin, C., Coatanoan, C., Pititto, A., Schlitzer, R., Dick, M., Schaap, A., and Diggs, S.: Chapter Four – A collaborative framework among data producers, managers, and users, edited by: Manzella, G. and Novellino, A., Ocean Science Data, Elsevier, 197–280, ISBN 9780128234273, https://doi.org/10.1016/B978-0-12-823427-3.00001-3, 2022.

Sippican: Instruction manual for the expendable bathythermograph system, R-603G – 1971, The Sippican Corporation Ocean Systems Division, 208 pp., 1980.

Sippican Corp.: Instructions for installation, operation and maintenance of Sippican expendable bathythermograph system – M300, R-467B, 100 pp, 1968.

Sippican, Inc.: Sippican MK12 oceanographic data acquisition system user's manual, Sippican, Inc., User's Manual R-2626/B P/N 306130-1, 166 pp., 1991.

Sippican Ocean Systems, Inc.: XCTD Phase I Progress Report (13 July 1983), Contract N00014-82-C-0579, R-1259 – 1983, 66 pp., 1983.

Sy, A.: XBT Measurements. In: WOCE Operations Manual, Part 3.1.3 WHP Operations and Methods, WHP Office Report, WHPO 91-1, 19 pp., 1991.

Sy, A. and Wright, D.: XBT/XCTD standard test procedures for reliability and performance test of expendable probes at sea, Revised draft, Geneva, Switzerland, WMO, TC SOT JCOMM Ship Observations Team, 8 pp., https://doi.org/10.25607/OBP-1487, 2001.

Tan, Z., Reseghetti, F., Abraham, J., Cowley, R., Chen, K., Zhu, J., Zhang, B., and Cheng, L.: Examining the Influence of Recording System on the Pure Temperature Error in XBT Data, J. Atmos. Ocean. Tech., 38, 759–776, https://doi.org/10.1175/JTECH-D-20-0136.1, 2021.

Tan, Z., Cheng, L., Gouretski, V., Zhang, B., Wang, Y., Li, F., Liu, Z., and Zhu, J.: A new automatic quality control system for ocean profile observations and impact on ocean warming estimate, Deep-Sea Res. Pt. I, 194, 103961, https://doi.org/10.1016/j.dsr.2022.103961, 2023.

Tanhua, T., Pouliquen, S., Hausman, J., O'Brien, K., Bricher, P., de Bruin, T., Buck, J. J. H., Burger, E. F., Carval, T., Casey, K. S., Diggs, S., Giorgetti, A., Glaves, H., Harscoat, V., Kinkade, D., Muelbert, J. H., Novellino, A., Pfeil, B., Pulsifer, P. L., Van de Putte, A., Robinson, E., Schaap, D., Smirnov, A., Smith, N., Snowden, D., Spears, T., Stall, S., Tacoma, M., Thijsse, P., Tronstad, S., Vandenberghe, T., Wengren, M., Wyborn, L., and Zhao, Z.: Ocean FAIR Data Services, Front. Mar. Sci., 6, 440, https://doi.org/10.3389/fmars.2019.00440, 2019.

Vignudelli, S., Cipollini, P., Reseghetti, F., Fusco, G., Gasparini, G. P., and Manzella, G. M. R.: Comparison between XBT data and TOPEX/Poseidon satellite altimetry in the Ligurian-Tyrrhenian area, Ann. Geophys., 21, 123–135, https://doi.org/10.5194/angeo-21-123-2003, 2003.

von Schuckmann, K., Le Traon, P.-Y., Alvarez-Fanjul, E., Axell, L., Balmaseda, M., Breivik, L.-A., Brewin, R. J. W., Bricaud, C., Drevillon, M., Drillet, Y., Dubois, C., Embury, O., Etienne, H., García Sotillo, M., Garric, G., Gasparin, F., Gutknecht, E., Guinehut, S., Hernandez, F., Juza, M., Karlson, B., Korres, G., Legeais, J.-F., Levier, B., Lien, V. S., Morrow, R., Notarstefano, G., Parent, L., Pascual, Á., Pérez-Gómez, B., Perruche, C., Pinardi, N., Pisano, A., Poulain, P.-M., Pujol, I. M., Raj, R. P., Raudsepp, U., Roquet, H., Samuelsen, A., Sathyendranath, S., She, J., Simoncelli, S., Solidoro, C., Tinker, J., Tintoré, J., Viktorsson, L., Ablain, M., Almroth-Rosell, E., Bonaduce, A., Clementi, E., Cossarini, G., Dagneaux, Q., Desportes, C., Dye, S., Fratianni, C., Good, S., Greiner, E., Gourrion, J., Hamon, M., Holt, J., Hyder, P., Kennedy, J., Manzano-Muñoz, F., Melet, A., Meyssignac, B., Mulet, S., Buongiorno Nardelli, B., O'Dea, E., Olason, E., Paulmier, A., Pérez-González, I., Reid, R., Racault, M.-F., Raitsos, D. E., Ramos, A., Sykes, P., Szekely, T., and Verbrugge, N.: The Copernicus Marine Environment Monitoring Service Ocean State Report, J. Oper. Oceanogr., 9, s235–s320, https://doi.org/10.1080/1755876X.2016.1273446, 2016.

Wannamaker, B.: XBT measurements near the sea surface: Considerations for satellite IR comparisons and data bases. Saclant ASW Research Centre Memo. SM-132, 13 pp., 1980.

Wilkinson, M. D., Dumontier, M., Aalbersberg, I. J., Appleton, G., Axton, M., Baak, A., Blomberg, N., Boiten, J.-W., Bonino da Silva Santos, L., Bourne, P. E., Bouwman, J., Brookes, A. J., Clark, T., Crosas, M., Dillo, I., Dumon, O., Edmunds, S., Evelo, C. T., Finkers, R., Gonzalez-Beltran, A., Gray, A. J. G., Groth, P., Goble, C., Grethe, J. S., Heringa, J., ’t Hoen, P. A. C., Hooft, R., Kuhn, T., Kok, R., Kok, J., Lusher, S. J., Martone, M. E., Mons, A., Packer, A. L., Persson, B., Rocca-Serra, P., Roos, M., van Schaik, R., Sansone, S.-A., Schultes, E., Sengstag, T., Slater, T., Strawn, G., Swertz, M. A., Thompson, M., van der Lei, J., van Mulligen, E., Velterop, J., Waagmeester, A., Wittenburg, P., Wolstencroft, K., Zhao, J., and Mons, B.: The FAIR Guiding Principles for scientific data management and stewardship, Sci. Data, 3, 160018, https://doi.org/10.1038/sdata.2016.18, 2016.

Zodiatis, G., Drakopoulos, P., Brenner, S., and Groom, S.: Variability of the Cyprus warm core Eddy during the CYCLOPS project, Deep-Sea Res. Pt. II, 52, 2897–2910, https://doi.org/10.1016/j.dsr2.2005.08.020, 2005.

Articles

Short summary

This data review is about the reprocessing of historical eXpendable BathyThermograp (XBT) profiles from the Ligurian and Tyrrhenian seas over the time period 1999–2019. A new automated quality control analysis has been performed starting from the original raw data and operational log sheets. The data have been formatted and standardized according to the latest community best practices, and all available metadata have been inserted, including calibration information and uncertainty specification.