Articles | Volume 13, issue 11
Data description paper
11 Nov 2021
Data description paper |  | 11 Nov 2021

The global and multi-annual MUSICA IASI {H2O, δD} pair dataset

Christopher J. Diekmann, Matthias Schneider, Benjamin Ertl, Frank Hase, Omaira García, Farahnaz Khosrawi, Eliezer Sepúlveda, Peter Knippertz, and Peter Braesicke

We present a global and multi-annual space-borne dataset of tropospheric {H2O, δD} pairs that is based on radiance measurements from the nadir thermal infrared sensor IASI (Infrared Atmospheric Sounding Interferometer) on board the Metop satellites of EUMETSAT (European Organisation for the Exploitation of Meteorological Satellites). This dataset is an a posteriori processed extension of the MUSICA (MUlti-platform remote Sensing of Isotopologues for investigating the Cycle of Atmospheric water) IASI full product dataset as presented in Schneider et al. (2021b). From the independently retrieved H2O and δD proxy states, their a priori settings and constraints, and their error covariances provided by the IASI full product dataset, we generate an optimal estimation product for pairs of H2O and δD. Here, this standard MUSICA method for deriving {H2O, δD} pairs is extended using an a posteriori reduction of the constraints for improving the retrieval sensitivity at dry conditions.

By applying this improved water isotopologue post-processing for all cloud-free MUSICA IASI retrievals, this yields a {H2O, δD} pair dataset for the whole period from October 2014 to December 2020 with global coverage twice per day (local morning and evening overpass times). In total, the dataset covers more than 1500 million individually processed observations. The retrievals are most sensitive to variations in {H2O, δD} pairs within the free troposphere, with up to 30 % of all retrievals containing vertical profile information in the {H2O, δD} pair product. After applying appropriate quality filters, the largest number of reliable pair data arises for tropical and subtropical summer regions, but higher latitudes also show a considerable amount of reliable data. Exemplary time series over the tropical Atlantic and West Africa are chosen to illustrate the potential of the MUSICA IASI {H2O, δD} pair data for atmospheric moisture pathway studies. Furthermore, in order to facilitate the application of this rather comprehensive MUSICA IASI {H2O, δD} pair dataset (referred to as Level-2), we additionally provide the data in a re-gridded and simplified format (Level-3) with focus on the quality-filtered {H2O, δD} pairs in the free troposphere. A technical documentation for guiding the use of both datasets is attached as the Supplement. Finally, the Level-2 dataset is referenced with the DOI (Diekmann et al.2021a) and the Level-3 dataset with DOI (Diekmann et al.2021b).

1 Introduction

Concomitant observations of moisture content and stable water isotopologues allow fundamental insights into the transport and phase transitions of water in the atmosphere. Differences in the molecular masses lead to characteristic responses of each isotopologue to phase changes. Consequently, the ratio of light and heavy water isotopologues inside an air parcel reveals information about moisture processes that have occurred during its pathway through the atmosphere and can hence support the investigation of the atmospheric branch of the hydrological cycle (an extensive overview is given in Galewsky et al.2016).

For describing distributions of water isotopologues, the δ-notation given in per mille is commonly used, for instance between H2O and its heavier isotopologue HDO, with both given as volume mixing ratios:

(1) δ D = HDO / H 2 O R vsmow - 1 1000 .

Rvsmow is the isotopic ratio of Vienna Standard Mean Ocean Water as defined by the International Atomic Energy Agency (Craig1961). Several studies have proposed the combined analysis of H2O and δD distributions (here denoted as {H2O, δD} pairs) and demonstrated its value for analysing moisture processes and transport. For instance, signatures in {H2O, δD} pair distributions from model simulations and measurements were interpreted in terms of the relative contributions of kinetic and equilibrium fractionation, such as Rayleigh condensation, rain evaporation and air mass mixing (e.g. Worden et al.2007; Noone2012; Dyroff et al.2015; González et al.2016; Schneider et al.2017; Eckstein et al.2018; Lacour et al.2018; Dahinden et al.2021; Diekmann et al.2021c).

During the last decades the space-based remote sensing of tropospheric water isotopologues has progressed considerably in terms of retrieval development, quality and application. On the one hand, cloud-free land observations from short-wave infrared sensors were used to generate total columns of the ratio HDO/H2O (e.g. Frankenberg et al.2009; Boesch et al.2013; Frankenberg et al.2013; Schneider et al.2020), while on the other hand thermal infrared sensors allowed for retrieval of HDO/H2O ratios with weak vertical profile information for land as well as ocean observations (e.g. Worden et al.2006, 2019; Lacour et al.2012; Schneider and Hase2011; Schneider et al.2016). To ensure coherence in the vertical sensitivities of remotely sensed H2O and δD, which is necessary for a combined interpretation, a further post-processing that creates optimal {H2O, δD} pair information is proposed by Schneider et al. (2012).

However, most of the aforementioned satellite-based water vapour isotopologue retrievals were performed for case studies that were limited in space and time. Up to now, only few global and long-time referenced space-borne datasets of tropospheric HDO/H2O are available, e.g. total column data from the sensor SCIAMACHY (Scanning Imaging Absorption Spectrometer for Atmospheric Chartography; Schneider et al.2018) between 2003 and 2012, profile data from TES (Tropospheric Emission Spectrometer; Worden et al.2012) between 2004 and 2012 (with few available data between 2017 and 2018), and from AIRS (Atmospheric Infrared Sounder; Worden et al.2019) between 2002 and early 2020. The maximum data availability of these datasets ranges on the order of 1000 to 30 000 observations per day.

In this paper, we present a new global and multi-annual dataset of tropospheric {H2O, δD} pairs using measurements from the sensor IASI (Infrared Atmospheric Sounding Interferometer). IASI is part of the EUMETSAT Polar System programme (EPS) that comprises the current polar-orbiting satellites Metop-A, Metop-B and Metop-C. The mission started in 2006 and will be continued from 2022 onwards by its successor EPS-SG (EPS Second Generation), committed to operating the next-generation sensors IASI-NG (IASI Next Generation) on board three new Metop satellites for another 20 years. Based on the full swath width of the IASI sensors, this mission is able to provide a global scan of the atmosphere multiple times per day, with about 350 000 cloud-free observations per sensor and per day. The overpasses are designed such that the orbits cross the Equator at approximately 09:30 and 21:30 LT (Clerbaux et al.2009).

To process the enormous number of IASI measurements, we have set up a quasi-operational processing chain that efficiently runs on high-performance computing clusters (Schneider et al.2021b). It comprises an extended version of the MUSICA IASI retrieval (Schneider and Hase2011). In this context we present the most recent updates regarding the optimal estimation {H2O, δD} pair product from Schneider et al. (2012), including an a posteriori enhancement of the sensitivity for dry conditions. We discuss and apply a method for achieving an a posteriori reduction of the retrieval constraints. According to the local overpass times, the final results are sorted into morning and evening observations for each day and stored in global output files. The chosen output format is NetCDF4, and the metadata are in agreement with the conventions for metadata (version 1.7; see, last access: 9 November 2021). Additional diagnostic flag variables reflecting the data quality of the retrieved {H2O, δD} pairs support an intuitive and user-friendly data selection. By post-processing all cloud-free MUSICA IASI retrieval results of Schneider et al. (2021b), we have generated a multi-annual and global dataset of tropospheric {H2O, δD} pairs for the whole period from October 2014 to December 2020. Accordingly, this dataset is referred to as the post-processed Level-2 dataset, and, depending on available resources, it is constantly being updated with current IASI measurements.

Since this Level-2 dataset comprises the {H2O, δD} pair results together with several retrieval metrics (e.g. averaging kernels, uncertainty metrics) for all globally available IASI observations in the given time period, this dataset is rather comprehensive and storage-intensive. For this purpose, we additionally generate a simplified and user-oriented dataset of mid-tropospheric {H2O, δD} pairs, which only contains the quality-filtered {H2O, δD} pair data on selected heights of interest and is re-gridded on a regular 1× 1 grid. This so-called Level-3 dataset has a significantly reduced computational resource requirement and is thus well-suited for larger-scale applications.

The paper is organized as follows: Sect. 2 provides a brief overview of the MUSICA IASI processor and describes details of the improved a posteriori generation of {H2O, δD} pairs (Level-2). This includes information about the corresponding error treatment and data filtering as well as the generation of a re-gridded Level-3 product. The output volume of the full {H2O, δD} pair datasets is documented in Sect. 3. In Sect. 4, we document the data availability of the full dataset in terms of spatial and temporal coverage. Section 5 shows examples of {H2O, δD} pair data over the tropical Atlantic and West Africa for the entire data period. Finally, information about the data access is given in the data availability section. A technical user guide for supporting the use of the Level-2 and Level-3 datasets is attached as the Supplement.

2 MUSICA IASI {H2O, δD} pair post-processing

As part of the MUSICA project (MUlti-platform remote Sensing of Isotopologues for investigating the Cycle of Atmospheric water, 2011 to 2016;  Schneider et al.2016) a retrieval processor was developed and validated for creating {H2O, δD} pair information together with N2O, CH4 and HNO3 from IASI spectra. In MUSICA follow-up projects, this processing has been further improved and now runs efficiently on high-performance computing clusters. Figure 1 provides an overview of the full processing chain. The main retrieval processing consists of the pre-processing stage, the PROFFIT-nadir retrieval and the output generation and is extensively described in Schneider et al. (2021b). The supply of the full product dataset (Schneider et al.2021b) offers very good possibilities for data re-usage. Examples are an a posteriori synergetic use with products from other sensors (Schneider et al.2021a) and the a posteriori generation of an optimal estimation {H2O, δD} pair product as presented in this paper.

Figure 1Overview of the full MUSICA IASI processing chain and its output products. The red frame indicates the part that is documented in the underlying paper. Further information about this processing chain can be found in Schneider et al. (2021a).


Here, we shortly recall the relevant information of the MUSICA IASI retrieval and subsequently present the improved post-processing for creating and evaluating the {H2O, δD} pairs.

2.1 Main characteristics of IASI

IASI is a Fourier transform spectrometer measuring the thermal infrared upwelling radiation that is affected by atmospheric processes like absorption and scattering. Its spectral resolution is 0.5 cm−1. The polar sun-synchronous orbits are designed such that the satellites overpass the Equator at 09:30 and 21:30 LT. With around 14–15 orbits per satellite per day and a full swath width of 2200 km, each IASI sensor achieves a global scan of the atmosphere twice daily. The launches of Metop-A, Metop-B and Metop-C took place in 2006, 2012 and 2018, respectively. The expected lifetime of each satellite is 5 years. Currently, all three Metop satellites are successfully operating in orbit. Further sensor details are listed in Clerbaux et al. (2009).

2.2 MUSICA IASI retrieval

The MUSICA IASI retrieval represents an optimal estimation algorithm for retrieving vertical profiles of mixing ratios of water vapour isotopologues and the trace gases CH4, N2O and HNO3 as well as atmospheric and surface temperatures. It uses the nadir version of the radiative transfer code PROFFIT (Hase et al.2004) for the spectral window of 1190–1400 cm−1 and an iterative Gauss–Newton method for the inversion calculations (Rodgers2000; Schneider and Hase2011). As proposed in Schneider et al. (2006) and Worden et al. (2006), the retrieval handles the trace gas variations on a logarithmic scale. For water vapour, this enables the use of the retrieval state vectors (ln[H2O]+ln[HDO])/2 and (ln [HDO]−ln [H2O]) that constitute reliable proxies for variations in H2O and δD (Schneider et al.2012). A concluding post-processing step performs a data compression for large output matrices and creates an output dataset compliant to the CF metadata conventions. For a full technical documentation of the most recent MUSICA IASI retrieval, please refer to Schneider et al. (2021b).

2.3 Post-processing for {H2O, δD} pairs

By considering the full MUSICA IASI retrieval product, we apply a post-processing for optimizing the water isotopologue states and generate an optimal estimation {H2O, δD} pair product. The following section provides details about the corresponding processing, including information about the error treatment and data selection according to data quality.

2.3.1 Vertical representativeness of a remotely sensed observation

In general, a retrieved height-dependent state vector x^ represents a smoothed image of the true atmospheric state xatm and is defined according to the averaging kernel matrix A and the a priori state vector xa:

(2) x ^ = A ( x atm - x a ) + x a .

Following the definition by Rodgers (2000), an averaging kernel (rows of A) depicts the fraction of the retrieved result coming from the retrieval itself and not from the a priori assumption. In the case of a perfect retrieval, the kernel matrix would equal the identity matrix, expressing total independence of the retrieval results from the chosen a priori state. Thus, the degree of deviation from unity quantifies the vertical information content of a remotely sensed observation.

For instance, a common metric for describing the vertical information content is the degree of freedom for signal (DOFS). It is defined as the trace of the averaging kernel matrix. The value of DOFS indicates the number of vertical structures that can be independently determined from an observation (Rodgers2000).

Further, the sum of the values along an individual averaging kernel is called measurement response (Eriksson2000; Baron et al.2002). A measurement response of 1 implies that the retrieved state is a smoothed but unbiased image of the true atmospheric profile, whereas values deviating from unity are induced if the retrieval constraint deviates from pure smoothing (von Clarmann et al.2020).

To examine the vertical resolution of a retrieved profile (i.e. the capability to detect vertical structures), metrics that characterize the vertical sensitivity, i.e. the shape of the averaging kernels, can be valuable. First, the relative position of the sensitivity-weighted altitude compared to its nominal altitude is termed information displacement. For this, we use the centroid offset as defined by Backus and Gilbert (1970) in a discretized form (Keppens et al.2015). And second, to describe the vertical smoothing of the retrieved state, the MUSICA IASI retrievals provide two different diagnostics. The definition of Backus and Gilbert (1970) is used to create a kernel-weighted spread around the kernel centroid, while the data density reciprocal of Purser and Huang (1993) serves to indicate the layer width that covers a DOFS of 1. Discussions of these two metrics can be found in Keppens et al. (2015) and von Clarmann et al. (2020), and in the context of the MUSICA IASI full retrieval product in Schneider et al. (2021b).

2.3.2 Generation of an optimal estimation {H2O, δD} pair product

Due to its high variability in the troposphere, H2O can be detected very well in contrast to δD, which varies only weakly. As the MUSICA IASI retrieval produces an individual optimal estimation for the different target states, this generally results in different averaging kernels for H2O and δD. Therefore, the vertical sensitivity of H2O is much higher than for δD (Schneider et al.2021b). However, an optimal analysis of {H2O, δD} pairs would require similar characteristics in the vertical sensitivities to ensure that the retrieved H2O and δD refer to the same vertical structures. Therefore, Schneider et al. (2012) proposed a post-processing for harmonizing the vertical information contents of individually retrieved H2O and δD. This is achieved by reducing the strength of the averaging kernels of H2O with respect to those from δD. The added value has been proven for IASI results against long-term datasets from ground-based remote sensing stations and in situ aircraft measurements (Wiegele et al.2014; Schneider et al.2015).

As a first step towards harmonizing the sensitivities of H2O and δD, we need to transform the water vapour state vector x^wv and the corresponding averaging kernel block matrix Awv

(3) x ^ wv = x ^ wv,1 x ^ wv,2 , A wv = A wv,11 A wv,12 A wv,21 A wv,22

from the {ln [H2O],ln [HDO]} basis system to the water vapour proxy base {(ln[H2O]+ln[HDO])/2,(ln[HDO]-ln[H2O])}.


In the following, primed variables are consistently referring to the water vapour proxy state base. Detailed information about the transformation operator P can be found in Schneider et al. (2012), Wiegele et al. (2014) and Barthlott et al. (2017). x^wv is in fact the state that is optimally estimated by the MUSICA IASI retrieval, and it represents proxies for H2O and δD.

The following step harmonizes the differing sensitivities of the water vapour proxy states by reducing the sensitivity of the H2O proxy to the sensitivity of the δD proxy. The a posteriori correction operator C

(7) C = A wv,22 0 - A wv,21 I

serves to create the harmonized product


which is called the Type 2 product in Schneider et al. (2012). The main property of x^wv is that it provides profiles for H2O and δD having practically the same averaging kernels. This allows meaningful analyses of paired {H2O, δD} distributions.

Figure 2a illustrates the row kernels from Awv for two exemplary {H2O, δD} pair results above a polar and a tropical site. The main sensitivity lies in the free troposphere, with peaks at  4 kma.s.l. (above sea level). The tropical observation shows additional sensitivity at 6–7 kma.s.l. The kernels for the polar observation show values of measurement response and DOFS falling below 1 by up to 30 % (see values in the parentheses in the legend), meaning there is low vertical sensitivity. In contrast, the {H2O, δD} pair kernels for the tropical observation exhibit a DOFS of 1.68, though its measurement response still deviates from 1 by about 10 %–14 %. In the next section we introduce a method that increases the DOFS and the measurement response a posteriori (the corresponding kernels are shown in Fig. 2b but will be discussed in Sect. 2.3.3).

Figure 2Row averaging kernels for the {H2O, δD} pair product with diagonal constraint (a) and without diagonal constraint (b). The kernels are shown for a polar observation (83.5 N, 147.0 W, left panels in a and b) and for a tropical observation (4.8 N, 45.8 W, right panels in a and b), both above oceans and measured on 1 July 2017. The measurement response values for the kernels at 3.0, 4.2 and 6.4 kma.s.l. are given in the respective parentheses in the legends.


2.3.3 Reduction of retrieval regularization

By inspecting the row kernels in Awv for a full orbit, we observe that there is a general and non-negligible deficit in the measurement response of the {H2O, δD} pairs. The blue dots in Fig. 3 show that, for instance, at 1.8, 4.2 and 6.4 kma.s.l. and for moisture contents below 104ppmv, a large amount of data contain measurement response values far below 1.

Figure 3Measurement response for the MUSICA IASI {H2O, δD} pair product along Metop-A orbit 55524 on 1 July 2017, for the original product (blue scatter) and the optimized one (orange scatter). The former includes the diagonal retrieval constraint, while it is removed for the latter. Results are shown for 1.8, 4.2 and 6.4 kma.s.l.


As emphasized in Sect. 2.3.1, the measurement response is a metric for the influence of the a priori assumptions on the retrieval result. Thus, a too low measurement response can be an indicator for a too strong retrieval constraint that excessively pulls the retrieved states towards the a priori profiles (Rodgers2000; von Clarmann et al.2020). Therefore, to reduce the observed lack of sensitivity in the {H2O, δD} pairs, we apply a method for a posteriori modifying and reducing the underlying retrieval constraint. For this purpose, we adapt a linear optimal estimation method from Rodgers and Connor (2003) that creates a best estimate of a given retrieval result with regards to a new constraint:

(10) M = R d - 1 A T ( A R d - 1 A T + S x ^ , noise ) - 1 .

The purpose of the operator M is that it allows a modification of the retrieval solution x^ and its kernels A according to a weaker constraint. Rd is the regularization matrix chosen according to the desired constraint.

In general, a regularization restricts the variability of a retrieved state vector in order to keep the retrieval solution within the range of physically realistic profiles (Phillips1962; Tikhonov1963; Rodgers2000). By reducing its strength, we increase the allowed variability for the retrieved states. As a consequence, the information content increases. On the downside, a weaker constraint causes larger noise and can produce information that is not provided by the measurement (Rodgers2000). Therefore, the remainder of this section discusses the optimal definition of the input matrices Rd, A and Sx^,noise as well as the correct usage of M in order to enhance the sensitivity of the {H2O, δD} pairs.

First of all, to achieve the full benefit from the matrix operations in Eq. (10), we consider the full MUSICA IASI state for the kernel matrix A as this is also used during the original retrieval processing. This means that we need to take into account the non-harmonized water vapour proxy state Awv from Eq. (6) and the interfering effects of the other retrieval state vectors. Since the retrieval output from Schneider et al. (2021b) provides only the dominant averaging kernels and cross-correlations, we build A as follows.

(11) A = A wv,11 A wv,12 0 0 0 A t,wv,1 A wv,21 A wv,22 0 0 0 A t,wv,2 0 0 A ghg,11 A ghg,12 0 A t,ghg,1 0 0 A ghg,21 A ghg,22 0 A t,ghg,2 0 0 0 0 A hno3 A t,hno3 0 0 0 0 0 A t

The kernels of N2O and CH4 are denoted as Aghg,11 and Aghg,22, respectively. The indices 21 and 12 indicate the respective cross-dependencies. The cross-dependency of the temperature retrieval to the water vapour proxy states is marked with At,wv,1 and At,wv,2. The entries, for which the corresponding kernel matrix are not provided, are filled with the null matrices 0.

Further, we calculate Sx^,noise the retrieval noise error covariance, i.e. the variability in the measured radiances that was not explained during the retrieval processing. It can be calculated from A and the regularization matrix R that was originally applied during the retrieval (Rodgers2000; Schneider et al.2021b):

(12) S x ^ , noise = A ( I - A ) R - 1 .

Now the question arises about the choice of a new meaningful regularization matrix Rd. For this purpose, we first recapitulate the original MUSICA approach for setting up the retrieval constraint. For each target species an individual covariance matrix Sa is given that describes the potential departure of the retrieval solution from the a priori state. This depends on the choice of the height-dependent correlation length, the a priori assumed size of vertical structures that may be resolvable (Schneider et al.2021b). By inverting Sa, we yield the corresponding regularization matrix R. During the MUSICA IASI retrieval, the inversion of Sa is realized by a decomposition into its diagonal and derivative values (Hase et al.2004; Schneider et al.2021b):

(13) R = ( α 0 L 0 ) T α 0 L 0 + ( α 1 L 1 ) T α 1 L 1 + ( α 2 L 2 ) T α 2 L 2 ,

with αi as the strength of the individual constraining terms and Li as the constraint operators. The diagonal matrices αi are derived from Sa and are provided for each target state as output variables from the MUSICA IASI retrieval (Schneider et al.2021b). L0 represents the diagonal constraint operator and equals the identity matrix I. Its effect is to shift the retrieved profile towards the a priori profile. L1 and L2 are the first- and second-order derivative operators and constrain the retrieved profile according to the shape of the a priori covariance, thereby representing smoothing constraints. For the retrieval of atmospheric trace gases with weak spectroscopic signatures, smoothing constraints can be advantageous over L0, because a diagonal constraint tightens the retrieval by means of the absolute a priori values, potentially inducing a bias in the retrieval (e.g. Steck2002). Therefore, we infer that the consideration of the diagonal constraint in Eq. (13) causes the observed sensitivity lack in the {H2O, δD} pair data for dry conditions. Following this hypothesis, we remove the diagonal constraint operators for the water vapour states and create the new weaker regularization matrix Rwv,d:

(14) R wv,d = ( α 1 L 1 ) T α 1 L 1 + ( α 2 L 2 ) T α 2 L 2 .

Keeping the regularization matrices of the other target states unchanged, we can then build the new regularization matrix Rd for the full MUSICA state. With that, Eq. (10) is fully determined, and we now can use M to adjust the kernel matrix A according to the new constraint Rd:

(15) A m = M A .

Based on the optimized kernel matrix Am, we can now create the optimal {H2O, δD} pair information for the constraint reduced state. By extracting Awv,m as the first 2 × 2 block from Am, we calculate the new a posteriori operator Cm analogous to Eq. (7) and generate the constraint reduced pair product:


This product Awv,m with reduced constraint shows a clear increase in the measurement response (see lower panels in Fig. 3). While the improvements are rather small for 6.4 kma.s.l., the results at 1.8 and 4.2 kma.s.l. have a much better measurement response for moisture contents above 700 ppmv.

Figure 4Time series of different retrieval metrics of observations along the Metop-A orbit 55524. Panel (a) shows the location of the observations along the orbit and colour-coded with the chronological observation numbering for this orbit (observation IDs). The grey scatter illustrates all cloud-free Metop-A and Metop-B observations for 1 July 2017 that were considered for the {H2O, δD} pair post-processing. Panel (b) indicates the time series of the measurement response for the {H2O, δD} pairs (upper panel) and the total δD error (lower panel), both for 4.2 kma.s.l. Results for the original product and the product after reducing the retrieval constraint are shown in blue and red, respectively. The solid lines are running means over 200 observations.

The time series of the measurement response along the orbit used in Fig. 3 is shown in Fig. 4b (upper panel). It is found that the constraint reduction leads to a general decrease in the deviation from 1. Over the Pacific and Atlantic oceans (observation IDs of 2500–7500 and 15 000–20 000) there is a shift of the slightly over-estimated measurement response towards 1. In contrast, for higher latitudes its values are originally below 1, but they increase significantly due to the constraint reduction. This is in particular pronounced for observations above Australia (observation IDs of 7500–10 000), where an averaged increase in the measurement response of up to 0.5 is apparent. Also, for polar observations of the Northern Hemisphere (observation IDs of 0–2500 and 20 000–25 000), the measurement strongly improves.

Analogous improvements become apparent for the individual row kernels in Fig. 2 (compare Fig. 2a and b). The measurement response increases for the dry polar data at 3.0 and 4.2 kma.s.l. by 56 % and 18 %, respectively. Also, for the tropical site, the measurement response values approach unity. These improvements are not at the expense of vertical resolution; instead they go along with respective improvements in the maximum amplitudes of the individual kernels as well as in the DOFS. For instance, the DOFS over the tropical site increases from 1.68 to 2.01, indicating that now information at two different altitude layers can be estimated independently.

2.3.4 Error treatment

Several studies have intensively discussed the error treatment for satellite observations in general (Rodgers2000; von Clarmann et al.2020) and with a focus on MUSICA IASI retrieval data (Schneider and Hase2011; Borger et al.2018). Schneider et al. (2021b) provided an overview of the errors that result for the most recent MUSICA IASI retrieval. Along with the kernel modifications for reducing the diagonal constraint for water vapour (see Sect. 2.3.3), a respective processing is required for the dominant MUSICA IASI error covariances.

Given the error covariance Sx^ in the proxy state base, we use the optimized a posteriori operator Cm to transform it according to the reduced constraint:

(18) S x ^ , m = C m M S x ^ M T C m T .

We perform this processing for the retrieval noise error covariance Sx^,noise from Eq. (12) and for the temperature cross-covariance Sx^,temp.:

(19) S x ^ , temp. = A t,wv S a,temp. A t,wv T .

This strongly depends on the choice of the assumed a priori uncertainty covariance Sa,temp. (Schneider et al.2021b).

As these two are the dominant errors for the MUSICA IASI δD product (Schneider et al.2021b), we use their sum as an estimate of the total error covariance for the optimized H2O and δD states:

(20) S x ^ , tot,m = S x ^ , noise,m + S x ^ , temp.,m .

The bottom panel in Fig. 4b illustrates how the total δD error changes due to the a posteriori constraint reduction. In general, with relaxing the regularization strength, the retrieval noise will increase (e.g. Rodgers2000). Following this behaviour, the δD error exhibits a strong increase for areas where the impact of the regularization optimization is large and the measurement response increases. For instance, the strong improvements of the measurement response over the dry Australian desert are at the expense of increasing the averaged δD error by 20 ‰ with single peaks up to 50 ‰. An increase in the noise is also observed for high latitudes in the Northern Hemisphere, whereas for observations above the Pacific and Atlantic oceans the noise is only slightly affected (compare with discussion in Sect. 2.3.3).

2.3.5 Data filtering

Supplementary to the raw IASI L1C measurements, EUMETSAT distributes auxiliary L2 diagnostics, such as cloud cover and surface type. Utilizing these diagnostics, Schneider et al. (2021b) provide the MUSICA IASI retrieval results for (almost) cloud-free conditions over land, oceans and sea ice, with small cloud contaminations being possible. They supply an additional diagnostic flag variable containing information about the quality of the MUSICA IASI spectral fit. Observations where the simulated spectrum did not converge against the measured spectrum are sorted out from the outset.

As part of the {H2O, δD} pair post-processing, we recommend an additional data selection according to the quality of the retrieved {H2O, δD} pair results and share for this purpose further height-dependent flags for an intuitive and user-friendly data handling.

First, we introduce the flag variable musica_wvp_kernel_flag for filtering the {H2O, δD} pairs according to their vertical sensitivity, i.e. to obtain retrieval results that are actually sensitive to the true atmospheric state rather than to the a priori state (see Sect. 2.3.1). Therefore, we define this flag based on the sensitivity metrics of the kernel matrix Awv,m. For the measurement response, we require values between 0.8 and 1.2. To limit the information displacement at an altitude level z(i), we define the following criterion:

(21) | c ( i ) - z ( i ) | z cl ( i ) 0.5 ,

with c(i) being the centroid of the corresponding averaging kernel (Keppens et al.2015) and zcl(i) the a priori correlation length at the respective altitude level (Schneider et al.2021b). This criterion ensures that the deviation of the centroid from the nominal height is less than half of the a priori correlation length. As a filter condition for the vertical resolution, we propose

(22) r LW ( i ) z cl ( i ) 4 .

rLW(i) is the layer width per one DOFS from Purser and Huang (1993) (see Sect. 2.3.1) as a proxy for the vertical resolution of an averaging kernel. By considering the kernel properties relative to the correlation length, we achieve kernels with larger values in their metrics also passing the aforementioned filters if larger values in the corresponding correlation length are assumed.

Second, we provide the error flag musica_deltad_error_flag that identifies data points with too high uncertainties in the δD retrieval results, namely errors due to measurement noise and atmospheric temperature uncertainties. The corresponding height-dependent flag displays retrieval results with a total δD error below 40 ‰.

Figure 5Effects of individual filter criteria on the corresponding metrics, along the Metop-A orbit 55524, as also shown in Fig. 4. The grey scatter shows all observations along the specific orbit, the cyan scatter shows the available data for each variable according to its individual filter criterion and the dark blue scatter shows the available data when simultaneously filtering for all four metrics. The metrics (a–c) are used to define the flag musica_wvp_kernel_flag, and the noise in (d) is used for the flag musica_deltad_error_flag (compare with Table 1).


Table 1Diagnostic flag variables and their recommended values for selecting MUSICA IASI {H2O, δD} pair data with high quality. The flags indicating the vertical sensitivity (musica_wvp_kernel_flag) and uncertainty (musica_deltad_error_flag) of the {H2O, δD} pair product are individually set for each altitude level, while the flags for the cloud cover (eumetsat_cloud_summary_flag) and the retrieval fit quality (musica_fit_quality_flag) are not height-dependent (Schneider et al.2021a). The vertical sensitivity flag depends on the measurement response, information displacement and vertical resolution of the {H2O, δD} pair kernels, and the total error flag depends on the sum of the temperature and noise error of δD.

Download Print Version | Download XLSX

The aforementioned filter conditions are visualized in Fig. 5 and the respective flags and their recommended values are summarized in Table 1. The flag variables musica_wvp_kernel_flag and musica_deltad_error_flag are binary; i.e. they only consist of the values 1 (for indicating high quality) and 0 (for low quality). Even though the recommended filter conditions are chosen somewhat arbitrarily, they efficiently remove recognizable outliers in terms of kernel properties (see Fig. 5a–c) and data uncertainties (see Fig. 5d) of the retrieved {H2O, δD} pairs. Therefore, the simultaneous application of the corresponding quality flags musica_wvp_kernel_flag and musica_deltad_error_flag serves for a convenient and meaningful selection of reliable {H2O, δD} pair data. However, to enable a flexible adjustment of the individual filter conditions for individual purposes, the output datasets contain the filtered and unfiltered {H2O, δD} pair data together with the flag and filter variables.

2.3.6 Matrix compression

Analogous to Schneider et al. (2021b), the averaging kernel matrices for the {H2O, δD} pairs are stored in a decomposed and compressed format in order to reduce the required storage volume. For this purpose, we apply a singular value decomposition for the matrices Awv,m and At,wv,m into the matrices U, D and V that decompose the kernel matrix through

(23) A = UDV T .

The length of the singular value vector (diagonal entries in D) is called rank. The actual compression is achieved by cutting off the lowest singular values in D and thereby reducing the rank. Consequently, the number of singular vectors (columns of U and V) is also reduced. The optimal limit of the singular values for balancing the compression error against the effective storage reduction is discussed in Weber (2019). Based on that, we neglect singular values that are less than 0.1 % of the maximum singular value in D.

2.4 The final MUSICA IASI {H2O, δD} pair product

After performing the aforementioned post-processing and filtering, we obtain the final {H2O, δD} pair product, as shown in Fig. 6 for a full Metop orbit. By performing the sensitivity optimization, we observe a substantial increase in variability in the {H2O, δD} pairs at 4.2 kma.s.l. for dry regions. For instance, over polar areas the weaker constraints allow larger deviations from the corresponding a priori values, such that lower values in H2O and δD can be observed. This is analogous to the increase in the measurement response that is most pronounced for dry conditions (see Figs. 24). As the measurement response is considered during the quality filtering for reliable {H2O, δD} pairs (see Table 1), its increase yields a higher number of observations passing the recommended data filter (see data amount in Fig. 6).

Figure 6Quality-filtered {H2O, δD} pairs (according to Table 1) for the original (with diagonal constraint) and the improved (without diagonal constraint) products at 4.2 kma.s.l. along the Metop-A orbit 55524 during boreal summer (orbit also shown in Figs. 4 and 5). The upper (lower) row shows the scatter for data of the Northern (Southern) Hemisphere, colour-coded with the corresponding latitude values. The grey scatter shows the a priori values of the individual observations at the nominal altitude. The value of N indicates the respective number of plotted data points.


In summary, the MUSICA IASI {H2O, δD} pair post-processing provides an optimal estimation {H2O, δD} pair product in the troposphere with a substantial increase in sensitivity for dry conditions. Together with the recommended quality flags indicating observations with meaningful averaging kernels and low errors for δD, this is the main Level-2 product provided freely to the scientific community.

2.5 Generation of a re-gridded {H2O, δD} pair product (Level-3 product)

For reasons of traceability and data re-usage, the output files produced by the MUSICA IASI {H2O, δD} pair post-processing include arrays for reconstructing different retrieval metrics, such as the averaging kernels and uncertainty covariances. Consequently, the corresponding files have high computational requirements with respect to storing and processing. Therefore, we generated an additional Level-3 dataset, the purpose of which is to allow for a simplified and less computationally intensive application of the MUSICA IASI {H2O, δD} pairs.

As the MUSICA {H2O, δD} pair product typically has the highest sensitivity in the mid-troposphere (between 2.9 and 6.4 kma.s.l.), the Level-3 dataset comprises all quality-filtered {H2O, δD} pairs (according to Table 1) for the fixed altitude levels at 2.9, 4.2 and 6.4 km (analogous to Fig. 2) and re-gridded on a regular 1× 1 grid. The latter is achieved by linear averaging all data of H2O and HDO (derived from δD) within the individual grid boxes and the a posteriori calculation of an averaged δD based on the averaged H2O and HDO data (according to Eq. 1). In the case of the total errors of H2O and δD, the averaging of their distributions on the regular grid requires taking into account the nature of the errors, i.e. the relative contributions of systematic and random error components. This is crucial, because averaging over systematic error components will balance around a constant systematic bias, whereas the random errors will get smaller the more data points are used for averaging. Here, we follow the simple assumption that errors due to measurement noise and temperature consist of 50 % systematic and 50 % random error components. We accordingly convolute all measurement noise and temperature error values within the individual grid boxes and afterwards form the total H2O and δD errors for each grid box. Furthermore, we provide a metric indicating the representativeness of the averaged H2O and δD. It is the rms of the differences of the individual H2O and δD values within the grid boxes to their respective averages. For H2O, the respective calculations are made on the logarithmic scale. As this metric is a measure for how scattered the individual data are within a single grid box, it indicates the data range for which the averaged value of a single grid box is representative.

3 Output volume of the full MUSICA IASI {H2O, δD} pair datasets

By using the retrieval output from Schneider et al. (2021b), we performed the proposed water isotopologue post-processing for all MUSICA IASI results between October 2014 and December 2020. With on average 350 000 cloud-free observations per sensor (Metop-A and Metop-B) and per day, this results in around 1500 million observations processed for mid-tropospheric {H2O, δD} pair information.

According to the local overpass times of the Metop satellites, we split the orbits into morning ( 09:30 LT) and evening ( 21:30 LT) overpasses and concatenate the respective observations for all overpasses within a single day into an individual global NetCDF4 file. That is, two files per day emerge with each, having a size of around 4–5 GB, resulting in about 1.7 TB yr−1. The full output volume is approximately 10.5 TB. This represents the Level-2 dataset of the MUSICA IASI {H2O, δD} pairs. The Level-3 product described in Sect. 2.5 is generated for all files of the Level-2 dataset, with an approximate individual file size of 4 MB and full output volume of 22 GB. The metadata of the Level-2 and Level-3 output NetCDF4 files are in agreement with the CF metadata conventions. Information about how to access the full dataset is given in the data availability section.

Additionally, we provide a technical documentation for the output files of the Level-2 and Level-3 datasets, which guides the use and application of the MUSICA IASI {H2O, δD} pair data. This document is provided as the Supplement to this paper.

4 Data coverage and quality

The following section gives an impression of the spatial and temporal representativeness of the optimal estimation {H2O, δD} pair data. If not otherwise specified, the Level-2 dataset of the MUSICA IASI {H2O, δD} pairs is used for the analyses and the generation of the figures throughout the following section.

4.1 Degree of freedom for signal

Figure 7 shows the DOFS values of δD as monthly means for February and August 2018, for morning and evening observations, respectively. To consider the full quality range of the {H2O, δD} pair results, we here investigate the δD distributions filtered only for cloud-free scenes and acceptable retrieval fit quality, but not for the {H2O, δD} pair quality (only filtered for musica_fit_quality_flag 2 and eumetsat_cloud_summary_flag 2).

Figure 7Monthly averages for February and August 2018 for the DOFS of the {H2O, δD} pair product without {H2O, δD} pair quality filtering, evaluated on a 1× 1 grid.

Maximum values are around 2 and are found over the tropics (persistently throughout the entire year) and the sub-tropics (during summer), indicating the capability of independently resolving signals in the lower and middle free troposphere (as indicated by the averaging kernels in Fig. 2). The DOFS minimum is located over the polar regions during winter, as these regions are typically very dry and cold.

Over oceans, the DOFS distribution roughly reflects the dominant sea-surface temperature patterns. For instance, the warm surface currents in the west Atlantic and west Pacific correlate with an increased sensitivity of the {H2O, δD} pair retrievals.

While the large-scale DOFS patterns show a strong inter-annual variability for all regions except the tropics, their diurnal variations are rather small. Instead, the latter becomes more pronounced for small-scale regional structures. In particular for land observations, thermal effects lead to a sensitivity maximum for morning times (Clerbaux et al.2009), e.g. for Australia during February and for Europe and North America during August. Conversely, for the Sahara we observe an inverted effect, i.e. an increase in DOFS from morning to evening. As a next step, we will consider data that have been additionally filtered for high sensitivity and low uncertainty in the {H2O, δD} pair product (see Table 1).

4.2 Vertical distribution of data coverage

As discussed in Sect. 2.3, the MUSICA IASI water vapour retrieval is mainly sensitive to water vapour in the free troposphere. Figure 8 shows that this is reflected clearly on the vertical distribution of available {H2O, δD} pairs after applying the full recommended filters according to Table 1. Here, the number of globally available observations per day and per morning and evening maps is averaged for February and August 2018 and is shown for each retrieval grid level between the surface and 9 km. The best data availability arises between 2–7 kma.s.l. On average, during boreal summer (at maximum over 400 000 data pairs per day) remarkably more observations are available than during austral summer (maximum 300 000 data pairs). In contrast, the diurnal variations are again rather small on the global scale. Only for altitudes below 3.5 kma.s.l. do we observe a slight decrease in data availability during evening. This might be due to thermal heating that develops during the day and leads to a upwards transport of low-level moisture, resulting in an upwards shift of the retrieval sensitivity. Such effects are stronger over land masses and during summer and probably lead to a larger morning-to-evening difference during boreal summer, as there are more land masses in the Northern Hemisphere than in the Southern Hemisphere.

Figure 8Averaged number of quality filtered {H2O, δD} pairs for the tropospheric retrieval grid altitudes during February and August 2018. The black line indicates the global means that are further divided into the means for morning (violet) and evening (pink) overpasses.


4.3 Horizontal distribution of data coverage

In this section we discuss the horizontal data coverage of {H2O, δD} pairs for different altitude regions after applying the respective quality filtering according to Table 1. To identify those retrieval results that provide vertical profile information, we check for observations that fulfil the respective filter criteria simultaneously for two distinct altitudes.

Figure 9Monthly statistics for the horizontal availability of MUSICA IASI {H2O, δD} pair data for February 2018 (first and second rows) and for August 2018 (third and fourth rows). Data are filtered according to Table 1. The first row for each month shows the averaged number of available observations per 1× 1 grid box and per day, and the second row gives the frequency of days with at least one reliable observation inside a single 1× 1 grid box. Shown are the results for observations at 4.2 kma.s.l. (above sea level) in the first column and for observations fulfilling the quality filter conditions simultaneously at 2.9 and 6.4 kma.s.l. (second column) and simultaneously at 1–1.5 and 4–5 kma.g.l. (above ground level; third column). For the latter, if more than one grid level falls inside the given altitude range, then the lower one is chosen.

Table 2Averaged fractions of available MUSICA IASI {H2O, δD} pair data after applying the quality filter according to Table 1, compared to the full (i.e. unfiltered) cloud-free IASI observations. The results are shown for 4.2 kma.s.l. (above sea level), for observations where the filter conditions are fulfilled simultaneously at 2.9 and 6.4 kma.s.l. and at 1–1.5 and 4–5 kma.g.l. (above ground level), respectively. For the latter, if more than one grid level falls inside the given altitude range, then the lower one is chosen.

Download Print Version | Download XLSX

Figure 9 shows the mean horizontal coverage of quality-filtered {H2O, δD} pairs for different altitude regions during February and August 2018; the corresponding IASI observations are evaluated on a 1× 1 grid. The averaged number of daily available {H2O, δD} pairs and the fraction of days with at least one measurement are illustrated for each grid box. Additionally, Table 2 provides the total fractions of available {H2O, δD} pairs for each altitude region compared to all cloud-free IASI observations.

At 4.2 kma.s.l., up to 59 % of all cloud-free IASI observations provide reliable {H2O, δD} pair data, with the best horizontal coverage over tropical and subtropical summer regions. Here, up to 35 observations are available per day and grid box, and over wide areas there is a 100 % frequency of 1× 1 grid boxes with at least one reliable observation, especially in the tropics and the summertime sub-tropics. But also for high northern latitudes, where typically cold and dry conditions prevail, a satisfactory data availability is apparent. Furthermore, for about 22 %–30 % of the cloud-free observations the quality filter conditions are simultaneously fulfilled at 2.9 and 6.4 kma.s.l. We observe similar spatial patterns with lower values and less temporal coverage, when compared to 4.2 kma.s.l. Even though the data coverage decreases significantly for areas with profile information at even lower altitudes (the quality filter conditions are simultaneously fulfilled at 1–1.5 and 4–5 kma.g.l. only for about 10 %–17 % of the cloud-free observations), interesting features emerge. The maximum availability of about 10 observations per grid box and per day shifts towards higher latitudes, such that over the tropics there are almost no data.

In this analysis we jointly investigated the morning and evening observations. As can be deduced from Figs. 7 and 8, the differences between the morning and evening distributions will differ only a little. For instance, Table 2 includes the fractions of available data after filtering according to Table 1 for the altitude regions from Fig. 9. The values do not differ significantly for the mid-troposphere during morning and evening times, but they decrease for lower altitudes during the evening overpasses (analogous to Fig. 8).

Figure 10Horizontal distributions of H2O and δD from the optimal estimation pair product, after filtering according to Table 1. Data are shown for 4.2 kma.s.l. and for 1 February and 1 August 2018 (including both morning and evening observations). The range of the colour bars is adjusted to Fig. 15 of Schneider et al. (2021b).

To convey an impression of the actual horizontal data distribution of the {H2O, δD} pair product, Fig. 10 depicts all data of H2O and δD at 4.2 kma.s.l. for 2 d (1 February and 1 August 2018). The horizontal patterns of available data are in agreement with Fig. 9. Both H2O and δD show the highest values over tropical regions and decrease towards the polar areas. However, differences in their zonal distribution become apparent. For instance, while H2O and δD show consistently high values over northern Africa, large discrepancies appear at similar latitudes over the Pacific (high H2O combined with decreased δD). Section 5 will give further insights into such relations between H2O and δD.

4.4 Horizontal distribution of data uncertainty

Figure 11 shows the horizontal distributions of the total errors of H2O and δD at 4.2 kma.s.l., exemplarily for 1 February and 1 August 2018. Overall, an anti-correlation to the DOFS distributions in Fig. 7 may be identified. The lowest errors are found for warm and moist tropical and sub-tropical sites during summer, where the DOFS is maximum. Here, the minimum error values lie around 5 % and 10 ‰ for H2O and δD, respectively. With decreased sensitivity during winter and for higher latitudes, we observe an increase in the total errors, in particular over land areas. The errors can reach values up to  12 % and 30 ‰ for H2O and δD but are still in the range of uncertainty from other comparable remotely sensed products (Worden et al.2006, 2019). In contrast to these Level-2 data errors, the averaged H2O and δD errors of the Level-3 data product will be overall lower (not shown), which is a result of the averaging over the assumed random error components (see Sect. 2.5).

Figure 11Distributions of the total errors of the filtered H2O (a) and δD (b) product at 4.2 kma.s.l., shown for the morning and evening data of 1 February and 1 August 2018. The filtering is performed according to Table 1.

Figure 12Monthly averages of the rms difference of all data points within the individual 1× 1 grid boxes to their averaged value, evaluated at 4.2 kma.s.l. for the quality-filtered H2O data on a logarithmic scale (a) and the quality-filtered δD data (b). The filtering is performed according to Table 1.

Furthermore, Fig. 12 provides an overview of the representativeness of the H2O and δD data being averaged for the regular 1× 1 grid, as done for the Level-3 dataset of the MUSICA IASI {H2O, δD} pairs (see Sect. 2.5). For this purpose, monthly means of the representativeness metrics (rms values; see Sect. 2.5) are shown for February and August 2018. The lowest rms values appear for both H2O and δD over oceans, meaning that these regions exhibit rather homogeneous and compact distributions in H2O and δD within the individual grid boxes. In contrast, the highest rms values arise for coastal regions in the subtropics, where due to the land–sea contrast the largest spread of H2O and δD values within the individual grid boxes develops.

5 Data example: tropical Atlantic and Sahel

To convey an impression of the amount and scientific potential of the optimal estimation MUSICA IASI {H2O, δD} pair product, we discuss results for two illustrative regions of interest, namely the tropical Atlantic (13–17 N, 46–30 W) and the Sahel in West Africa (13–17 N, 8 W–8 E).

Figure 13MUSICA IASI {H2O, δD} pair data for 4.2 kma.s.l. over the tropical Atlantic (13–17 N, 46–30 W) and the Sahel in West Africa (13–17 N, 8 W–8 E), for the full MUSICA IASI period. Panels (a) and (c) show the time series of H2O and δD. Panels (b) and (d) show the respective probability density functions of the two-dimensional {H2O, δD} distributions, indicating the location of the main 10 % and 90 % scatter points. These contours are drawn for the data of February, May, August and November, summarized for all respective years.


Figure 13 shows the time series of the respective MUSICA IASI data for H2O and δD at 4.2 kma.s.l. that have passed the full recommended filtering (according to Table 1) for the period from October 2014 to December 2020. As discussed in Sect. 2.3.2, the harmonized retrieval results for H2O and δD offer almost the same averaging kernels, thereby allowing for a meaningful interpretation of paired {H2O, δD} distributions. Based on that, Fig. 13 also summarizes the mean monthly evolution (represented by February, May, August and November) of the {H2O, δD} pair distribution over the tropical Atlantic and the Sahel. The data are illustrated with normalized two-dimensional histogram contours comprising the main 10 % and 90 % of the scatter points (the calculation is described in the appendix of Eckstein et al.2018).

Over the tropical Atlantic, both H2O and δD exhibit a similar annual cycle, even though it is weaker for δD. This can also be observed in the corresponding {H2O, δD} pairs, where the August data are on average moister and more enriched in δD than the February data. Despite some shifting and tilting, the overall contour shape does not change to first order from February to August.

In contrast, over the Sahel signs of an annual anti-correlation between H2O and δD appear. Again, during February there is a minimum of H2O and δD, even though it is slightly moister than over the tropical Atlantic. During boreal summer, the variability of H2O and δD decreases significantly, while the respective contours shift to higher H2O. However, this moistening is associated with a strong decrease in the maximum values in δD, leading to a remarkable tilting of the August contour, when compared to the February contour.

These regional differences highlight the benefit of adding information about δD when studying atmospheric moisture, because different moisture processes leave a different impact on the shape and position of {H2O, δD} pair distributions. In the example of Fig. 13, we observe that the tropical Atlantic and the Sahel reveal significantly different structures in δD, while their H2O distributions show clear and similar annual cycles. Therefore, this feature makes clear that the tropospheric moisture over the two tropical regions is governed by structurally different processes. As δD is mainly affected during phase changes of water vapour, we infer that its observed anti-correlation to H2O may be an effect of tropical convection that is exceptionally strong over the Sahel during the summertime monsoon period. Related dynamical changes in the contributing wind regimes might pose further contributing factors for changes in the {H2O, δD} phase space.

However, in order to robustly attribute such {H2O, δD} pair signals to underlying moisture processes, supplementary measurements and model analyses are required. As previous studies stated (e.g. Worden et al.2007; Noone2012; Dyroff et al.2015; González et al.2016; Schneider et al.2017; Christner et al.2018; Eckstein et al.2018; Lacour et al.2018; Dahinden et al.2021; Diekmann et al.2021c), such an analysis is then capable of providing a deeper understanding of atmospheric moisture pathways and will therefore be part of future MUSICA IASI studies.

6 Data availability

The Level-2 dataset of the MUSICA IASI {H2O, δD} pair product is referenced with the DOI (Diekmann et al.2021a). In its description, this DOI refers to the data available from October 2014 to June 2019, because only this period was available at the time of the DOI assignment. However, this dataset could be extended to additionally include all data until December 2020. The Level-3 dataset of the MUSICA IASI {H2O, δD} pairs is referenced with the DOI (Diekmann et al.2021b). The full Level-2 and Level-3 datasets are freely available via the web portal (last access: 9 November 2021).

7 Summary

We present an extension of the MUSICA IASI retrieval that aims at creating an optimized water isotopologue pair product for the free troposphere. The retrieval processor from Schneider et al. (2021b) is an update of the version that was developed and validated against reference measurements during the MUSICA project (Schneider et al.2016). The presented post-processing step exploits their retrieval results and generates an optimal estimation {H2O, δD} pair product by harmonizing the averaging kernels of H2O and δD, as proposed by Schneider et al. (2012). We introduce a further optimization step by a posteriori reducing the strength of the underlying regularization. This increases the sensitivity of the {H2O, δD} pair retrieval product, especially for dry conditions, and enhances the vertical profile information between the boundary layer and the free troposphere. However, as a trade-off the retrieval noise increases, but not beyond an unreasonable range ( 12 % for H2O and  30 ‰ for δD). For a user-friendly data handling, we derive supplementary filter flags that perform a height-dependent data selection based on the quality of the {H2O, δD} pair results. An additional technical user guide attached as the Supplement aims to support and facilitate to work with the {H2O, δD} pair data.

We applied this post-processing to the MUSICA IASI full retrieval product and created a novel space-borne dataset of tropospheric {H2O, δD} pair data. It consists of two global maps per day for all cloud-free IASI observations between October 2014 and December 2020. On a global average, the main vertical sensitivity lies between 2–7 km. It features the best horizontal representativeness in terms of data quality and coverage for tropical and summertime sub-tropical regions. Despite a negative Equator-to-pole gradient in the horizontal representativeness, there is still a satisfactory amount of reliable {H2O, δD} pair data in higher latitudes, ranging up to polar regions during summer. In addition to this comprehensive Level-2 dataset of MUSICA IASI {H2O, δD} pairs, a reduced Level-3 dataset is also provided, which consists of mid-tropospheric {H2O, δD} pairs re-gridded on a regular 1× 1 grid.

Due to the unprecedented combination of high coverage and resolution in space and time, the MUSICA IASI {H2O, δD} pair datasets are highly promising for studying atmospheric moisture pathways. They enable analyses across different scales, from annually to daily and from globally to locally and are therefore appealing to a wide range of scientific applications. For further encouraging the use of these data, the full Level-2 dataset is made freely available to the scientific community under the DOI (Diekmann et al.2021a) and the Level-3 dataset under the DOI (Diekmann et al.2021b).


The supplement related to this article is available online at:

Author contributions

FH developed the radiative transfer model PROFFIT-NADIR. BE and MS optimized the MUSICA IASI retrieval. BE, MS, ES and OG performed the retrieval calculations. MS, CJD and FK developed the water isotopologue post-processing. CJD performed the water isotopologue post-processing and created the data statistics. CJD wrote major parts of the manuscript. PB and PK supervised the PhD of CJD. All authors contributed to the discussion of the paper.

Competing interests

The authors declare that they have no conflict of interest.


Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.


The here published dataset was generated using the supercomputer ForHLR, which is funded by the Ministry of Science, Research and the Arts Baden-Württemberg and by the German Federal Ministry of Education and Research. Further, the authors wish to acknowledge the contribution of the Teide High-Performance Computing facilities, which were used to perform parts of the MUSICA IASI retrievals. Teide-HPC facilities are provided by the Institute Tecnológica y de Energías Renovables (ITER, S.A.).

Financial support

This research has been supported by the Deutsche Forschungsgemeinschaft (grant no. 290612604, project MOTIV and grant no. 416767181, project TEDDY), the European Research Council, FP7 Ideas: European Research Council (MUSICA, grant no. 256961), the Bundesministerium für Bildung und Forschung (ForHLR supercomputer), the Ministerium für Wissenschaft, Forschung und Kunst Baden-Württemberg (ForHLR supercomputer), and the Ministerio de Economía y Competitividad (grant no. CGL2016-80688-P, project INMENSE).

Review statement

This paper was edited by Nellie Elguindi and reviewed by Camille Risi and one anonymous referee.


Backus, G. and Gilbert, F.: Uniqueness in the inversion of inaccurate gross Earth data, Philos. T. R. Soc. S.-A, 266, 123–192,, 1970. a, b

Baron, P., Ricaud, P., De la Noë, J., Eriksson, J. E., Merino, F., Ridal, M., and Murtagh, D. P.: Studies for the Odin sub-millimetre radiometer. II. Retrieval methodology, Can. J. Phys., 80, 341–356,, 2002. a

Barthlott, S., Schneider, M., Hase, F., Blumenstock, T., Kiel, M., Dubravica, D., García, O. E., Sepúlveda, E., Mengistu Tsidu, G., Takele Kenea, S., Grutter, M., Plaza-Medina, E. F., Stremme, W., Strong, K., Weaver, D., Palm, M., Warneke, T., Notholt, J., Mahieu, E., Servais, C., Jones, N., Griffith, D. W. T., Smale, D., and Robinson, J.: Tropospheric water vapour isotopologue data (H216O, H218O, and HD16O) as obtained from NDACC/FTIR solar absorption spectra, Earth Syst. Sci. Data, 9, 15–29,, 2017. a

Boesch, H., Deutscher, N. M., Warneke, T., Byckling, K., Cogan, A. J., Griffith, D. W. T., Notholt, J., Parker, R. J., and Wang, Z.: HDO/H2O ratio retrievals from GOSAT, Atmos. Meas. Tech., 6, 599–612,, 2013. a

Borger, C., Schneider, M., Ertl, B., Hase, F., García, O. E., Sommer, M., Höpfner, M., Tjemkes, S. A., and Calbet, X.: Evaluation of MUSICA IASI tropospheric water vapour profiles using theoretical error assessments and comparisons to GRUAN Vaisala RS92 measurements, Atmos. Meas. Tech., 11, 4981–5006,, 2018. a

Christner, E., Aemisegger, F., Pfahl, S., Werner, M., Cauquoin, A., Schneider, M., Hase, F., Barthlott, S., and Schädler, G.: The Climatological Impacts of Continental Surface Evaporation, Rainout, and Subcloud Processes on δD of Water Vapor and Precipitation in Europe, J. Geophys. Res.-Atmos., 123, 4390–4409,, 2018. a

Clerbaux, C., Boynard, A., Clarisse, L., George, M., Hadji-Lazaro, J., Herbin, H., Hurtmans, D., Pommier, M., Razavi, A., Turquety, S., Wespes, C., and Coheur, P.-F.: Monitoring of atmospheric composition using the thermal infrared IASI/MetOp sounder, Atmos. Chem. Phys., 9, 6041–6054,, 2009. a, b, c

Craig, H.: Standard for reporting concentrations of deuterium and oxygen-18 in natural waters, Science, 133, 1833–1834,, 1961. a

Dahinden, F., Aemisegger, F., Wernli, H., Schneider, M., Diekmann, C. J., Ertl, B., Knippertz, P., Werner, M., and Pfahl, S.: Disentangling different moisture transport pathways over the eastern subtropical North Atlantic using multi-platform isotope observations and high-resolution numerical modelling, Atmos. Chem. Phys., 21, 16319–16347,, 2021. a, b

Diekmann, C. J., Schneider, M., and Ertl, B.: MUSICA IASI water isotopologue pair product (a posteriori processing version 2), Institute of Meteorology and Climate Research, Atmospheric Trace Gases and Remote Sensing (IMK-ASF), Karlsruhe Institute of Technology (KIT) [data set],, 2021a. a, b, c

Diekmann, C. J., Schneider, M., and Ertl, B.: Regular 1× 1 re-gridded MUSICA IASI water isotopologue pair dataset (a posteriori processing version 2), Institute of Meteorology and Climate Research, Atmospheric Trace Gases and Remote Sensing (IMK-ASF), Karlsruhe Institute of Technology (KIT) [data set],, 2021b. a, b, c

Diekmann, C. J., Schneider, M., Knippertz, P., de Vries, A. J., Pfahl, S., Aemisegger, F., Dahinden, F., Ertl, B., Khosrawi, F., Wernli, H., and Braesicke, P.: A Lagrangian perspective on stable water isotopes during the West African Monsoon, J. Geophys. Res.-Atmos., 126, e2021JD034895,, 2021c. a, b

Dyroff, C., Sanati, S., Christner, E., Zahn, A., Balzer, M., Bouquet, H., McManus, J. B., González-Ramos, Y., and Schneider, M.: Airborne in situ vertical profiling of HDO / H216O in the subtropical troposphere during the MUSICA remote sensing validation campaign, Atmos. Meas. Tech., 8, 2037–2049,, 2015. a, b

Eckstein, J., Ruhnke, R., Pfahl, S., Christner, E., Diekmann, C., Dyroff, C., Reinert, D., Rieger, D., Schneider, M., Schröter, J., Zahn, A., and Braesicke, P.: From climatological to small-scale applications: simulating water isotopologues with ICON-ART-Iso (version 2.3), Geosci. Model Dev., 11, 5113–5133,, 2018. a, b, c

Eriksson, P.: Analysis and comparison of two linear regularization methods for passive atmospheric observations, J. Geophys. Res., 105, 18157–18167,, 2000. a

Frankenberg, C., Yoshimura, K., Warneke, T., Aben, I., Butz, A., Deutscher, N., Griffith, D., Hase, F., Notholt, J., Schneider, M., Schrijver, H., and Röckmann, T.: Dynamic processes governing lower-tropospheric HDO/H2O Ratios as Observed from Space and Ground, Science, 325, 1374–1377,, 2009. a

Frankenberg, C., Wunch, D., Toon, G., Risi, C., Scheepmaker, R., Lee, J.-E., Wennberg, P., and Worden, J.: Water vapor isotopologue retrievals from high-resolution GOSAT shortwave infrared spectra, Atmos. Meas. Tech., 6, 263–274,, 2013. a

Galewsky, J., Steen-Larsen, H. C., Field, R. D., Worden, J., Risi, C., and Schneider, M.: Stable isotopes in atmospheric water vapor and applications to the hydrologic cycle, Rev. Geophys., 54, 809–865,, 2016. a

González, Y., Schneider, M., Dyroff, C., Rodríguez, S., Christner, E., García, O. E., Cuevas, E., Bustos, J. J., Ramos, R., Guirado-Fuentes, C., Barthlott, S., Wiegele, A., and Sepúlveda, E.: Detecting moisture transport pathways to the subtropical North Atlantic free troposphere using paired H2O-δD in situ measurements, Atmos. Chem. Phys., 16, 4251–4269,, 2016. a, b

Hase, F., Hannigan, J. W., Coffey, M. T., Goldman, A., Höpfner, M., Jones, N. B., Rinsland, C. P., and Wood, S. W.: Intercomparison of retrieval codes used for the analysis of high-resolution, ground-based FTIR measurements, J. Quant. Spectros. Ra., 87, 25–52,, 2004. a, b

Keppens, A., Lambert, J.-C., Granville, J., Miles, G., Siddans, R., van Peet, J. C. A., van der A, R. J., Hubert, D., Verhoelst, T., Delcloo, A., Godin-Beekmann, S., Kivi, R., Stübi, R., and Zehner, C.: Round-robin evaluation of nadir ozone profile retrievals: methodology and application to MetOp-A GOME-2, Atmos. Meas. Tech., 8, 2093–2120,, 2015. a, b, c

Lacour, J.-L., Risi, C., Clarisse, L., Bony, S., Hurtmans, D., Clerbaux, C., and Coheur, P.-F.: Mid-tropospheric δD observations from IASI/MetOp at high spatial and temporal resolution, Atmos. Chem. Phys., 12, 10817–10832,, 2012. a

Lacour, J. L., Risi, C., Worden, J., Clerbaux, C., and Coheur, P. F.: Importance of depth and intensity of convection on the isotopic composition of water vapor as seen from IASI and TES δD observations, Earth Planet. Sc. Lett., 481, 387–394,, 2018. a, b

Noone, D.: Pairing measurements of the water vapor isotope ratio with humidity to deduce atmospheric moistening and dehydration in the tropical midtroposphere, J. Climate, 25, 4476–4494,, 2012. a, b

Phillips, D. L.: A Technique for the Numerical Solution of Certain Integral Equations of the First Kind, J. ACM, 9, 84–97,, 1962. a

Purser, R. J. and Huang, H. L.: Estimating effective data density in a satellite retrieval or an objective analysis, J. Appl. Meteorol., 32, 1092–1107,<1092:EEDDIA>2.0.CO;2, 1993. a, b

Rodgers, C. D.: Inverse Methods for Atmospheric Sounding: Theory and Practice, World Scientific Publishing Co. Pte. Ltd., Singapore, 2, 238, 2000. a, b, c, d, e, f, g, h, i

Rodgers, C. D. and Connor, B. J.: Intercomparison of remote sounding instruments, J. Geophys. Res.-Atmos., 108, 4116,, 2003. a

Schneider, A., Borsdorff, T., aan de Brugh, J., Hu, H., and Landgraf, J.: A full-mission data set of H2O and HDO columns from SCIAMACHY 2.3 µm reflectance measurements, Atmos. Meas. Tech., 11, 3339–3350,, 2018. a

Schneider, A., Borsdorff, T., aan de Brugh, J., Aemisegger, F., Feist, D. G., Kivi, R., Hase, F., Schneider, M., and Landgraf, J.: First data set of H2O/HDO columns from the Tropospheric Monitoring Instrument (TROPOMI), Atmos. Meas. Tech., 13, 85–100,, 2020. a

Schneider, M. and Hase, F.: Optimal estimation of tropospheric H2O and δD with IASI/METOP, Atmos. Chem. Phys., 11, 11207–11220,, 2011. a, b, c, d

Schneider, M., Hase, F., and Blumenstock, T.: Ground-based remote sensing of HDO/H2O ratio profiles: introduction and validation of an innovative retrieval approach, Atmos. Chem. Phys., 6, 4705–4722,, 2006. a

Schneider, M., Barthlott, S., Hase, F., González, Y., Yoshimura, K., García, O. E., Sepúlveda, E., Gomez-Pelaez, A., Gisi, M., Kohlhepp, R., Dohe, S., Blumenstock, T., Wiegele, A., Christner, E., Strong, K., Weaver, D., Palm, M., Deutscher, N. M., Warneke, T., Notholt, J., Lejeune, B., Demoulin, P., Jones, N., Griffith, D. W. T., Smale, D., and Robinson, J.: Ground-based remote sensing of tropospheric water vapour isotopologues within the project MUSICA, Atmos. Meas. Tech., 5, 3007–3027,, 2012. a, b, c, d, e, f, g

Schneider, M., González, Y., Dyroff, C., Christner, E., Wiegele, A., Barthlott, S., García, O. E., Sepúlveda, E., Hase, F., Andrey, J., Blumenstock, T., Guirado, C., Ramos, R., and Rodríguez, S.: Empirical validation and proof of added value of MUSICA's tropospheric δD remote sensing products, Atmos. Meas. Tech., 8, 483–503,, 2015. a

Schneider, M., Wiegele, A., Barthlott, S., González, Y., Christner, E., Dyroff, C., García, O. E., Hase, F., Blumenstock, T., Sepúlveda, E., Mengistu Tsidu, G., Takele Kenea, S., Rodríguez, S., and Andrey, J.: Accomplishments of the MUSICA project to provide accurate, long-term, global and high-resolution observations of tropospheric H2O, δD pairs – a review, Atmos. Meas. Tech., 9, 2845–2875,, 2016. a, b, c

Schneider, M., Borger, C., Wiegele, A., Hase, F., García, O. E., Sepúlveda, E., and Werner, M.: MUSICA MetOp/IASI H2O, δD pair retrieval simulations for validating tropospheric moisture pathways in atmospheric models, Atmos. Meas. Tech., 10, 507–525,, 2017. a, b

Schneider, M., Ertl, B., Diekmann, C. J., Khosrawi, F., Röhling, A. N., Hase, F., Dubravica, D., García, O. E., Sepúlveda, E., Borsdorff, T., Landgraf, J., Lorente, A., Chen, H., Kivi, R., Laemmel, T., Ramonet, M., Crevoisier, C., Pernin, J., Steinbacher, M., Meinhardt, F., Deutscher, N. M., Griffith, D. W. T., Velazco, V. A., and Pollard, D. F.: Synergetic use of IASI and TROPOMI space borne sensors for generating a tropospheric methane profile product, Atmos. Meas. Tech. Discuss. [preprint],, in review, 2021a. a, b, c

Schneider, M., Ertl, B., Diekmann, C. J., Khosrawi, F., Weber, A., Hase, F., Höpfner, M., García, O. E., Sepúlveda, E., and Kinnison, D.: Design and description of the MUSICA IASI full retrieval product, Earth Syst. Sci. Data Discuss. [preprint],, in review, 2021b. a, b, c, d, e, f, g, h, i, j, k, l, m, n, o, p, q, r, s, t, u, v

Steck, T.: Methods for determining regularization for atmospheric retrieval problems, Appl. Optics, 41, 1788,, 2002. a

Tikhonov, A.: On the solution of improperly posed problems and the method of regularization, Dokl. Akad. Nauk SSSR+, 151, 501, 1963. a

von Clarmann, T., Degenstein, D. A., Livesey, N. J., Bender, S., Braverman, A., Butz, A., Compernolle, S., Damadeo, R., Dueck, S., Eriksson, P., Funke, B., Johnson, M. C., Kasai, Y., Keppens, A., Kleinert, A., Kramarova, N. A., Laeng, A., Langerock, B., Payne, V. H., Rozanov, A., Sato, T. O., Schneider, M., Sheese, P., Sofieva, V., Stiller, G. P., von Savigny, C., and Zawada, D.: Overview: Estimating and reporting uncertainties in remotely sensed atmospheric composition and temperature, Atmos. Meas. Tech., 13, 4393–4436,, 2020. a, b, c, d

Weber, A.: Storage-Efficient Analysis of Spatio-Temporal Data with Application to Climate Research, Zenodo,, 2019. a

Wiegele, A., Schneider, M., Hase, F., Barthlott, S., García, O. E., Sepúlveda, E., González, Y., Blumenstock, T., Raffalski, U., Gisi, M., and Kohlhepp, R.: The MUSICA MetOp/IASI H2O and δD products: characterisation and long-term comparison to NDACC/FTIR data, Atmos. Meas. Tech., 7, 2719–2732,, 2014. a, b

Worden, J., Bowman, K., Noone, D., Beer, R., Clough, S., Eldering, A., Fisher, B., Goldman, A., Gunson, M., Herman, R., Kulawik, S. S., Lampel, M., Luo, M., Osterman, G., Rinsland, C., Rodgers, C., Sander, S., Shephard, M., and Worden, H.: Tropospheric Emission Spectrometer observations of the tropospheric HDO/H2O ratio: Estimation approach and characterization, J. Geophys. Res., 111, D16309,, 2006. a, b, c

Worden, J., Noone, D., Bowman, K., Beer, R., Eldering, A., Fisher, B., Gunson, M., Goldman, A., Herman, R., Kulawik, S. S., Lampel, M., Osterman, G., Rinsland, C., Rodgers, C., Sander, S., Shephard, M., Webster, C. R., and Worden, H.: Importance of rain evaporation and continental convection in the tropical water cycle, Nature, 445, 528–532,, 2007.  a, b

Worden, J., Kulawik, S., Frankenberg, C., Payne, V., Bowman, K., Cady-Peirara, K., Wecht, K., Lee, J.-E., and Noone, D.: Profiles of CH4, HDO, H2O, and N2O with improved lower tropospheric vertical resolution from Aura TES radiances, Atmos. Meas. Tech., 5, 397–411,, 2012. a

Worden, J. R., Kulawik, S. S., Fu, D., Payne, V. H., Lipton, A. E., Polonsky, I., He, Y., Cady-Pereira, K., Moncet, J.-L., Herman, R. L., Irion, F. W., and Bowman, K. W.: Characterization and evaluation of AIRS-based estimates of the deuterium content of water vapor, Atmos. Meas. Tech., 12, 2331–2339,, 2019. a, b, c

Short summary
The joint analysis of different stable water isotopes in water vapour is a powerful tool for investigating atmospheric moisture pathways. This paper presents a novel global and multi-annual dataset of H2O and HDO in mid-tropospheric water vapour by using data from the satellite sensor Metop/IASI. Due to its unique combination of coverage and resolution in space and time, this dataset is highly promising for studying the hydrological cycle and its representation in weather and climate models.
Final-revised paper