An updated version of a gap-free monthly mean zonal mean ozone database
- 1Bodeker Scientific, 42 Russell Street, Alexandra, 9320, New Zealand
- 2Deutsches Zentrum für Luft- und Raumfahrt (DLR), Institut für Physik der Atmosphäre, Oberpfaffenhofen, Germany
- 3Cooperative Institute for Research in Environmental Sciences, University of Colorado, Boulder, CO, USA
- 4NOAA Earth System Research Laboratory, Boulder, CO, USA
- 5School of Earth and Environment, University of Leeds, Leeds, UK
- 6National Centre for Earth Observation, University of Leeds, Leeds, UK
Correspondence: Birgit Hassler (email@example.com)
An updated and improved version of a global, vertically resolved, monthly mean zonal mean ozone database has been calculated – hereafter referred to as the BSVertOzone (Bodeker Scientific Vertical Ozone) database. Like its predecessor, it combines measurements from several satellite-based instruments and ozone profile measurements from the global ozonesonde network. Monthly mean zonal mean ozone concentrations in mixing ratio and number density are provided in 5∘ latitude bins, spanning 70 altitude levels (1 to 70 km), or 70 pressure levels that are approximately 1 km apart (878.4 to 0.046 hPa). Different data sets or “tiers” are provided: Tier 0 is based only on the available measurements and therefore does not completely cover the whole globe or the full vertical range uniformly; the Tier 0.5 monthly mean zonal means are calculated as a filled version of the Tier 0 database where missing monthly mean zonal mean values are estimated from correlations against a total column ozone (TCO) database. The Tier 0.5 data set includes the full range of measurement variability and is created as an intermediate step for the calculation of the Tier 1 data where a least squares regression model is used to attribute variability to various known forcing factors for ozone. Regression model fit coefficients are expanded in Fourier series and Legendre polynomials (to account for seasonality and latitudinal structure, respectively). Four different combinations of contributions from selected regression model basis functions result in four different Tier 1 data sets that can be used for comparisons with chemistry–climate model (CCM) simulations that do not exhibit the same unforced variability as reality (unless they are nudged towards reanalyses). Compared to previous versions of the database, this update includes additional satellite data sources and ozonesonde measurements to extend the database period to 2016. Additional improvements over the previous version of the database include the following: (i) adjustments of measurements to account for biases and drifts between different data sources (using a chemistry-transport model, CTM, simulation as a transfer standard), (ii) a more objective way to determine the optimum number of Fourier and Legendre expansions for the basis function fit coefficients, and (iii) the derivation of methodological and measurement uncertainties on each database value are traced through all data modification steps. Comparisons with the ozone database from SWOOSH (Stratospheric Water and OzOne Satellite Homogenized data set) show good agreement in many regions of the globe. Minor differences are caused by different bias adjustment procedures for the two databases. However, compared to SWOOSH, BSVertOzone additionally covers the troposphere. Version 1.0 of BSVertOzone is publicly available at https://doi.org/http://doi.org/10.5281/zenodo.1217184.
Ozone is a greenhouse gas, and changes in stratospheric ozone concentrations have an effect on surface climate. Ozone changes can result in direct radiative forcing changes (e.g., Forster, 1999), changes in surface UV radiation (e.g., McKenzie et al., 2011), and can alter the natural modes of tropospheric climate variability (e.g., Thompson et al., 2011). An accurate representation of stratospheric ozone in climate models is therefore essential if they are to be used to project realistic future climate behavior (e.g., Nowack et al., 2015). While many climate models calculate stratospheric ozone distributions with a fully coupled stratospheric chemistry scheme, Eyring et al. (2016) reported that not all models participating in phase 6 of the Coupled Model Intercomparison Project (CMIP6) will have that ability and therefore need to prescribe atmospheric ozone concentrations. This requires a long-term, latitudinally and vertically resolved ozone database.
Vertically resolved ozone databases are not only useful for prescribing ozone concentrations in climate models but can also be used for climate model evaluation and development. When chemistry–climate models (CCMs) are run with specified dynamics (e.g., wind and temperature fields from reanalyses), the resulting ozone distributions are as close to the real atmospheric distribution of ozone as CCMs can be expected to simulate them. Comparisons with observation-based, vertically resolved ozone databases can reveal possible model deficiencies in simulating chemical and dynamical processes. Diagnosing the problems occurring in the specified dynamics model simulations can inform a process-oriented validation of a free-running CCM and thereby improves the quality of projections.
Randel and Wu (2007) pioneered the idea of combining ozone measurements from different observation platforms, using statistical methods for gap filling to create a long-term, gap-free global database of monthly mean zonal mean stratospheric ozone values that can be used to evaluate CCM simulations and investigate ozone variability and trends. An updated and extended version of this database was created in support of the CMIP5 simulations (Cionni et al., 2011). The updated database, built by combining CCM simulations and observations, covered not just the historical period (from 1850), but also extended into the future (to 2100). This database was used by CMIP5 climate models that did not simulate their own stratospheric ozone, and it also covered the troposphere since tropospheric ozone also has a radiative effect on the climate system. A first database that covered the troposphere and the stratosphere, based solely on measurements and using statistical methods for data gap filling, was introduced by Bodeker et al. (2013) (Binary Database of Profiles, hereafter BDBP v18.104.22.168) and is the predecessor of the database described here.
In preparation for the World Meteorological Organization/United Nations Environment Programme (WMO/UNEP) Scientific Assessment of Ozone Depletion 2014, the communities involved in making satellite-based and ground-based ozone measurements decided to intensify the preparation of ozone databases that consist of multiple data sources from different platforms for stratospheric ozone variability investigations and trend detection. Several ozone databases were created that combine (merge) measurements from (i) the same type of instruments that were flown on different satellites (Frith et al., 2017; Wild and Long, 2018), (ii) two different satellite instruments (Kyrölä et al., 2013; Bourassa et al., 2014), or (iii) several different satellite instruments (Froidevaux et al., 2015; Davis et al., 2016). All of these databases cover the stratosphere, but only parts of the troposphere, if at all. Systematic comparisons between the databases showed that the applied merging technique plays a role in the representation of realistic long-term stratospheric ozone trends and variability (Tummon et al., 2015), in addition to the choice of data sources. Recognizing these sensitivities, a second generation of most of these databases has already been developed, with reprocessed data sources and improved merging techniques (e.g., Bourassa et al., 2018; Sofieva et al., 2017; Zawada et al., 2018). However, these updated databases still have limited coverage in the troposphere.
When combining measurements from different data sources, providing realistic uncertainty estimates on every value of the final data product (either calculated from the different data sources or estimated using statistical methods) becomes more and more complex. However, realistic estimates of uncertainties on every datum are necessary to be able to estimate resultant uncertainties on ozone trends calculated from those data. This is particularly important when seeking to detect the small but expected signal of ozone recovery due to the reductions in ozone-depleting substances (e.g., Harris et al., 2015). Measurements from different satellite instruments can exhibit offsets and drifts between coinciding measurements or when compared with ground-based measurements (see Hubert et al., 2016). When measurements are adjusted to account for any offsets and drifts, it introduces an additional source of uncertainty which must also be accounted for. Diligent propagation of uncertainties from the individual measurements to the final data product, e.g., a monthly mean zonal mean value, is therefore essential.
Here, BSVertOzone v1.0 (Bodeker Scientific Vertical Ozone, hereafter referred to as BSVertOzone) is described, which is an update and further developed version of the BDBP (Binary Database of Profiles) v22.214.171.124 that is described in Bodeker et al. (2013) and, as in that earlier version, consists of monthly mean zonal mean ozone values between 1 and 70 km (878.4 to 0.046 hPa). The database was assigned a new name and not just a new version number due to the many improvements compared to the earlier version and the fact that the data sources for the database are not available in binary format anymore. The following major improvements over BDBP v126.96.36.199 have been made: (1) where updated versions of the ozone data sources were available, these were used (Sect. 2); (2) data from the Microwave Limb Sounder (MLS) and recent ozonesonde measurements were used as additional data sources (Sect. 2); (3) drifts and biases between the data sources are now quantified and corrected for using a chemistry-transport model (CTM) as a transfer standard (Sect. 3); and (4) measurement uncertainties and uncertainties from other sources (e.g., applied offset and bias corrections) are propagated through to the final monthly mean zonal mean values (Sects. 2.2 and 3). In addition, a method was applied to estimate missing monthly mean zonal mean values before the database was subjected to regression modeling to generate data sets that include the effects of subsets of ozone forcing factors. The pre-filling of the database is described in Sect. 4. This filled data set is our best estimate of the true vertically and latitudinally resolved monthly mean zonal mean ozone field. However, it is not suitable for direct comparison with ozone fields simulated by CCMs since it contains significant unforced variability. Therefore, in much the same way as described in Bodeker et al. (2013), a regression model was applied but, rather than using it to both fill the database and conduct an attribution of the various factors affecting ozone, it is now used only to conduct the attribution (Sect. 4). The fact that the regression model is applied to a pre-filled data set makes the regression model fits more robust than it was the case for the BDBP v188.8.131.52 database. A detailed description and validation of these global gap-free data sets is also presented (Sects. 5 and 6), followed by a conclusion (Sect. 1).
2.1 Observation-based data sources
The original BDBP v184.108.40.206 (Bodeker et al., 2013) was built from measurements obtained from several satellite instruments and from ozone profile measurements made at sites in the global ozonesonde network (Hassler et al., 2008). The satellite instruments were selected first according to their vertical resolution (<3 km), and then for their temporal and spatial coverage. Exceptions to these criteria were only accepted if the measurements provide information in a measurement-sparse region or time period, e.g., the troposphere, the polar regions, or the early 1980s. BDBP v220.127.116.11 therefore included measurements from the Stratospheric Aerosol and Gas Experiment I and II (SAGE I and II), the Halogen Occultation Experiment (HALOE), the Improved Limb Atmospheric Spectrometer (ILAS and ILAS-II), and the Polar Ozone and Aerosol Measurement II and III (POAM II and III). Measurements from these satellite instruments are also included in BSVertOzone. While measurements from the Limb Infrared Monitor of the Stratosphere (LIMS) were included in BDBP v18.104.22.168, they were excluded from BSVertOzone since it was assessed that the vertical resolution of the LIMS measurements did not warrant its inclusion in the new database. The temporal and vertical coverage of the instruments providing data for incorporation into the BSVertOzone database are illustrated in Fig. 1. A more detailed description of these instruments, their measurement techniques, their temporal and spatial coverage, and their measurement uncertainties can be found in Hassler et al. (2014). Except for SAGE II, the data quality screening for other instruments was performed in the same way as described in Hassler et al. (2008). In addition, all SAGE I ozone measurements with uncertainties greater than 300 % were excluded. The SAGE II screening criteria were updated and are described in more detail next.
In late 2012, an improved and updated version of the SAGE II data set was released (version 7.0; Damadeo et al., 2013). This replaced the version 6.2 data set that was used in BDBP v22.214.171.124. Due to the change in data version, the quality screening for SAGE II had to be adjusted. The screening of SAGE II ozone measurements was performed following the suggestions of Wang et al. (1996) together with the modifications outlined in the SAGE II version 7.0 release notes. In addition, all ozone values below 23 km between 1 July 1991 and 1 October 1993 were excluded as those measurements were affected by the Mt. Pinatubo eruption (as it was done in Hassler et al., 2008).
As a final quality check, ozone values from all data sources were used to calculate monthly mean zonal mean climatologies at each level and latitude bin. If individual values in these latitude bins exceed the respective mean by 3σ, they were excluded.
BDBP v126.96.36.199 covered the period 1979 to 2007 since the main satellite data sources for that database, i.e., SAGE II and HALOE, ended in 2005. After 2005, only ozonesonde profiles were included in BDBP v188.8.131.52. To extend BSVertOzone to 2016, it was necessary to add new satellite measurements and additional ozonesonde profiles to the database. To ensure sufficient overlap with SAGE II and HALOE measurements, preference was given to instruments that provide measurements starting in 2005 or earlier and extend to the end of 2016. Although having a somewhat broader vertical resolution than the other data sources used in the BDBP v184.108.40.206 database, the large data quantity and high data quality of the Microwave Limb Sounder makes it an attractive target data source for incorporation into the BSVertOzone database. The usefulness of MLS ozone data has already been shown through its use in several other combined ozone databases, e.g., GOZCARDS (Global OZone Chemistry And Related trace gas Data records for the Stratosphere; Froidevaux et al., 2008) and SWOOSH (Stratospheric Water and OzOne Satellite Homogenized data set; Davis et al., 2016). Given this pedigree, MLS measurements were included in the creation of the BSVertOzone database.
The MLS instrument sits on NASA's Aura satellite, which was launched in mid-July 2004 and remains operational to date (see Fig. 1). MLS measures microwave emission from the limb of the Earth's atmosphere to provide measurements of a multitude of atmospheric trace gases including ozone and temperature. MLS-retrieved ozone profiles cover the region from the upper troposphere to the mesosphere, i.e., from 261 hPa to about 0.02 hPa. The vertical resolution of the ozone profiles ranges from 2.5 to 3 km in the stratosphere but increases to up to 8 km in the mesosphere. MLS provides about 3500 ozone profiles every day, with a near-global coverage from 82∘ S to 82∘ N (Waters et al., 2006). Version 4.2 MLS data were incorporated into the monthly mean zonal means in BSVertOzone. More information about the MLS instrument can be found in Waters et al. (2006), and more information about the MLS ozone validation can be found in Froidevaux et al. (2008) and Hubert et al. (2016).
The screening of the MLS ozone measurements is based on the official MLS v4.2 data description document provided by JPL (http://mls.jpl.nasa.gov/data/v4-2_data_quality_document.pdf, last access: 10 August 2018) and is similar to the ozone screening described in Froidevaux et al. (2008). The uncertainties on each ozone measurement, at each pressure level, which are more detailed than the uncertainty information included in the MLS v4.2 data description document, were provided by the MLS team (Lucien Froidevaux, personal communication, April 2017).
One of the main foci of BSVertOzone is tracing all sources of uncertainty from the individual measurements through to the final monthly mean zonal mean ozone values. The uncertainty estimates that are provided with the individual measurements obtained from each satellite instrument shown in Fig. 1 are used as a starting point for the uncertainty treatment throughout the creation process of the different BSVertOzone data sets (see following sections). These uncertainties normally include, amongst others, calibration errors, spectroscopy uncertainties, and uncertainties introduced by the use of an a priori profile.
Ozonesondes remain the only source of ozone measurements throughout the troposphere (see Fig. 1) and were obtained from four different data archives, viz. the World Ozone and Ultraviolet Radiation Data Centre (WOUDC), the Network for the Detection of Atmospheric Composition Change (NDACC), the Southern Hemisphere Additional Ozonesondes network (SHADOZ), and the National Oceanic and Atmospheric Administration (NOAA). While the goal is that each ozonesonde site provides profiles along with uncertainty estimates (Ozonesonde Data Quality Assessment, O3S-DQA) (Tarasick et al., 2016; Deshler et al., 2017; Witte et al., 2017), most data files obtained for use in the generation of BSVertOzone still did not include uncertainty estimates. We therefore continue to use the uncertainty estimates as described in Hassler et al. (2008). To extend BSVertOzone through to the end of 2016, and to ensure that the most recently available ozonesonde data are used, an updated data set of ozonesonde data was obtained from the mentioned data archives.
2.2 Database grid
The monthly mean zonal means comprising the BSVertOzone database are provided in 5∘ latitude bands on a pressure grid and on a geopotential height grid. Ozone concentrations on these grids are provided both in number density and mixing ratio. To provide the input to those four databases, all individual measurements are converted from their original vertical grid and concentration unit to the vertical grids and concentration units prescribed in the BSVertOzone database. For these conversions, the temperature and pressure at each measurement is required. These ancillary data were available in the data files of the individual data sources. The BSVertOzone levels of the vertical altitude grid range from 1 km to 70 km with a spacing of 1 km. The BSVertOzone levels of the vertical pressure grid range from about 878.4 to 0.046 hPa and levels are approximately 1 km apart (see Hassler et al., 2008, for more detail). Only a few satellite instruments provide data on such a grid and therefore, in most cases, the measurements were interpolated onto the two predefined vertical grids. The uncertainties provided with each measurement are also linearly interpolated onto the predefined vertical grids using log pressure when interpolating between pressure levels. We acknowledge this approach to interpolating the uncertainties underestimates the uncertainties on the interpolated values since we expect uncertainties to maximize between measurements. A rigorous statistical approach to interpolating the uncertainties is by means of Kriging and the use of a one-dimensional variogram. However, this approach is computationally demanding and, given that the variogram changes in time and space, and that the interpolation is generally over less than 1 km, we assessed that the small underestimation in the uncertainties on the interpolated values has little effect on the uncertainties in the calculated monthly mean zonal means, which are largely dominated by spatial and temporal variables in the measurements.
For simplicity, any reference to either a specific geopotential height or pressure level is hereafter referred to as “level”.
Quantifying offsets and drift between different measurement systems can be made far more robust by using an independent data source, especially when temporal and spatial coincidences between the two measurement systems are sparse. If the independent data source has high spatial and temporal sampling and covers the combined range of the two measurement systems to be homogenized, it can be used as a transfer standard. The independent data source does not need to be quantitatively exact but does need to capture the spatial and temporal morphology of the underlying measurements. Output from either a chemistry–climate model that has been nudged towards observed meteorology or output from a chemistry-transport model can meet these requirements. The bias and drift correction applied here is based on a homogenization approach that uses a regression model together with global vertically resolved ozone concentrations as simulated by the chemistry-transport model TOMCAT/SLIMCAT for the period 1980 to 2016 (see Fig. 2).
3.1 Chemistry-transport model (CTM) data
TOMCAT/SLIMCAT (hereafter referred to as SLIMCAT) is an offline three-dimensional (3D) chemistry-transport model (CTM) (Chipperfield, 2006) that includes an interactive stratospheric chemistry scheme and is driven by meteorological fields obtained from ERA-Interim reanalyses (Dee et al., 2011) provided by the European Centre for Medium-Range Weather Forecasts (ECMWF). We use 12-hourly output from a SLIMCAT simulation at a horizontal resolution of 2.8∘ by 2.8∘ and 32 levels extending from the surface to about 60 km (corresponding to a vertical resolution of about 1.5–2 km in the stratosphere). Observed surface mixing ratios of source gases were used as boundary conditions for the SLIMCAT simulation. The known unphysical temporal discontinuities in ERA-Interim upper stratospheric temperatures in August 1998 that arose from changes in the satellite radiance data used in the assimilation (Dhomse et al., 2011) introduced a bias in the ozone concentrations simulated by SLIMCAT in 1998. This bias only occurs at the top five model levels. Therefore, before the CTM data are used in this study, we remove the bias at those levels by first fitting a regression model that includes an offset term (whole time period) and step function (Heaviside Function) after 1998 to the ozone data (Dhomse et al., 2011). We then subtract the contribution of the step basis function from the model data, obtaining the corrected CTM data where the discontinuities have been removed. For more details about the SLIMCAT model see Chipperfield (2006) and Chipperfield et al. (2015, 2017).
Using CTM output as an evaluation and adjustment tool for coarsely distributed global ozone measurements is not a novel idea. In Sofieva et al. (2014) the gap-free ozone fields of a highly temporally and spatially resolved CTM run were used to characterize sampling biases for coarse satellite samplers when their measurements were used for the calculation of monthly mean zonal mean ozone values. Hegglin et al. (2014) used a CCM that was nudged to ERA-Interim reanalysis to correct for offsets and drifts between stratospheric water vapor measurements from multiple satellite instruments. This is very similar to the approach used in our study. While Hegglin et al. (2014) used the CCM output to adjust monthly mean values of the different instruments, here the SLIMCAT output is used to adjust individual measurements with a correction that is based on zonal mean comparisons (see Sect. 3.2).
As shown in several recent studies (Hubert et al., 2016; Tegtmeier et al., 2013), satellite instruments can experience drifts over their lifetime, and coincident measurements of the same constituent from different satellite-based and ground-based instruments can differ. It is necessary to account for such potential differences when combining measurements from different measurement platforms into a single product such as a monthly mean zonal mean. Two such approaches have been used recently within the community to combine measurements from different platforms. In both cases a standard is selected to which the measurements from other sources are adjusted. Preferably, this initial standard is sufficient in its global and temporal coverage to allow robust estimates of biases and drifts of all other measurement sets. Here we follow the quality assessment from Hubert et al. (2016) and the suggested ozone standard from Davis et al. (2016) and, for levels above 15 km (about 118 hPa), chose measurements from SAGE II as the standard. Recognizing the higher data quality of the ozonesonde measurements below 15 km, as each ozonesonde is individually calibrated, ozonesonde data are used at 15 km and below as the standard. Once the standard has been selected, it is then necessary to either (i) find coincident measurements from the standard data set and from the measurement system to be adjusted, as done, for example, in Davis et al. (2016), or (ii) use an independent and gap-free data set that can be used as a transfer standard for adjusting measurements to the standard. The first approach is problematic if a measurement from the chosen standard is not available for every measurement from the data set to be adjusted. For the second approach, reanalysis data, as shown, for example, by Foelsche et al. (2011), or temporally and spatially highly resolved CTM output, as shown, for example, by Toohey et al. (2013) and Sofieva et al. (2014), can be used as a transfer standard, capturing small-scale ozone variability. Here we chose the second approach for generating a homogeneous data set, since measurements from some data sources are limited to a very small geographical region and time period, such that coincidences with the SAGE II measurements, the chosen standard, are sparse. A CTM (SLIMCAT, see Sect. 3.1) simulation was used as the transfer standard.
The homogenization of the satellite-based measurements that contribute to BSVertOzone is a sequential process where each measurement from a selected satellite instrument is adjusted with respect to the standard, hereafter referred to as the inter-satellite bias correction, ISBC (see Fig. 2). After the measurements have been adjusted to the standard, they are incorporated as part of the standard, and the new set of standard measurements is used to determine the adjustment for the next set of target measurements. This process is repeated until all satellite-based and ozonesonde measurements have been homogenized. The order in which the satellites are chosen for ISBC is determined by the satellite that has the largest temporal overlap with the standard. In our case, above 15 km where SAGE II is the standard, measurements from HALOE are first adjusted and merged with the standard as HALOE provides the largest temporal overlap with SAGE II measurements. Below and at 15 km, measurements from SAGE II are first adjusted and merged with ozonesondes. To adjust measurements from a different source to the standard, the following steps are taken at each level individually:
Calculate differences between individual ozone measurements from the standard and the CTM-simulated ozone values.
Calculate an error-weighted, latitude-weighted (based on 1∘ latitude bands), monthly mean zonal mean of those differences. Calculate an error and area weighted monthly mean zonal mean of those differences at 1∘ zonal resolution. The weight of each measurement is defined as cos (latitude)∕sigma2.
Fit a linear regression model to the calculated monthly mean zonal mean differences (hereafter referred to as modeled differences) to obtain an analytical representation of the difference field that can be evaluated at any latitude and time; ( at latitude ϕ, level z, and time t). The regression model comprises an offset and trend term, where the fit coefficients are expanded in Fourier and Legendre polynomials to account for seasonal and latitudinal structure in the monthly mean zonal mean differences, respectively. While the known discontinuities in ozone at the upper levels of the CTM data were corrected to the extent possible, recognizing that some small inconsistencies may remain, it was decided to exclude the trend term from the regression model when modeling the difference fields for the levels above 47 km (where the CTM data were corrected).
Repeat steps 1 to 3 using the measurements from the target new data source that requires bias correction to obtain an analytical representation for its difference field: .
Calculate an adjusted measurement at the location and time of a given satellite measurement using
where θ is the longitude, is the homogenized/adjusted measurement at a location, is the original measurement at that location, and is the applied adjustment.
Incorporate the adjusted measurements into the standard measurements, to create a new homogenized standard that now includes more measurements, and likely a larger spatial and temporal coverage, for the next iteration.
Repeat steps 1 to 6 for all data sources to be included in the BSVertOzone database.
The number of Fourier and Legendre polynomial expansions used to model the monthly mean zonal mean differences depend on the individual differences provided as input to the regression model, as each satellite instrument provides measurements with a different spatial and temporal coverage. The number of Fourier and Legendre polynomial expansions used to model the difference field is made adaptive to avoid overfitting of the model. Four Fourier pairs and eight Legendre polynomials are the default expansions for the offset term while four Legendre polynomials and no Fourier pairs are the default values for the trend term. If the chosen default expansions result in overfitting of the difference fields, which is determined by testing whether the maximum and minimum values of the modeled field do not exceed 150 % of the maximum and minimum value of the original data, a new candidate model is generated by decreasing the degree of one of the expansion (e.g., one scenario would have three instead of four Fourier pairs for the offset term, with the rest of the offset and trend expansions remaining unchanged). The Akaike information criterion (AIC; deLeeuw, 1992; Bozdogan, 1987) was used to provide a relative assessment of the quality of the candidate models. The candidate model which minimizes AIC is selected for use, as it represents the model which minimizes the amount of information lost. The process is repeated until a set of expansions is found that does not result in overfitting of the model.
In the generation of a homogenized data set, in addition to adjusting measurements from different measurement systems to account for bias and drifts, the uncertainties on those measurements also need to be revised since the application of these adjustments introduces additional uncertainty. Following error propagation rules, the uncertainty on the adjusted measurements is given by
where σraw is the uncertainty on the original, unadjusted measurement and (σISBC) reflects the uncertainty on the regression modeled fields (standard-CTM)modeled and (satellite-CTM)modeled, i.e.,
While this re-evaluation of the measurement uncertainties does not include the effects of uncertainties in the CTM output, the effects of CTM uncertainties on the adjustment of the satellite data are minor, since the CTM data are only used as a transfer standard and, as can be seen from Eq. (4), cancel out if the CTM bias is consistent at both measurement locations.
Fit the regression model (as described above) to the monthly mean zonal means of the differences between the measurements of the standard and the CTM data.
Subtract the regression model fit from the difference field to obtain the residuals.
To each residual data point, add a randomly sampled value from a normal distribution with a mean of zero and a standard deviation of the uncertainty on the monthly mean zonal mean differences associated with the selected data point. This step represents the influence of measurement uncertainty on the residuals.
For each monthly mean zonal mean bin, randomly select one modified residual value and add it to the monthly mean zonal mean of the differences. Do this for all bins to generate a new difference data set which, while having the same underlying structure as the original signal, now has different random noise. Then fit the regression model again, resulting in a new modeled difference field.
Repeat steps 2–4 many times (e.g., 200) to generate 200 estimates of the modeled difference field. Calculate the standard deviation of those 200 modeled difference fields to obtain the estimated uncertainty on the modeled difference field .
Such a bootstrap approach encapsulates two sources of uncertainty which are present in the modeled field: the uncertainty from the fact that the chosen model is imperfect and the uncertainty in the measurements. The standard deviation of the difference fields, derived from the ensemble of fields, is the uncertainty on the modeled difference field.
3.3 Calculating the individual monthly mean zonal mean values
To create a homogeneous database, each measurement and its uncertainty, on a specific level, and in a specific latitude band, is adjusted using the ISBC method described above. The uncertainties on the corrections applied are included in the total uncertainty for each individual data point. For the final calculation of monthly mean zonal means values, additional data filtering was applied, similar to the filtering described in Bodeker et al. (2013); e.g., SAGE II data were not used below 10 km (below 242.8 hPa for the pressure grid), and SAGE I data were not used below 18 km (77.44 hPa).
Monthly mean zonal mean ozone values obtained from different data sources pre- and post-homogenization, for example, level and latitude band, are shown in Fig. 3. The ozonesonde measurements are the chosen standard at this level. Different data sources complement one another and the homogenization process reduces the spread between the different data sources, so that they are more consistent with each other. The average adjustment to the standard (e.g., ozonesondes at this level) for each measurement from the different data sources ranges from −14.4 % for MLS to 37.7 % for HALOE at 182 hPa between 25 and 30∘ N (the chosen level and latitude band in Fig. 3); i.e., MLS measurements have a positive bias compared to the standard, while HALOE measurements have a negative bias.
The time series of ozone in Fig. 3 are shown for illustrative purposes only since the method to calculate the monthly mean zonal means does not depend on precalculated monthly mean zonal means for each individual data source, but rather calculates the monthly mean zonal mean values from all available individual measurements at once. All available measurements and their uncertainties σi for each latitude band and level are then used to calculate the error-weighted monthly mean:
where σi_new represents not only the measurement uncertainty, but also the confidence we have that each measurement can be seen as an estimator of the monthly mean, and σi_new is given by
with the expectation value xi_exp being the unweighted monthly mean zonal mean. The uncertainty on the monthly mean zonal mean value is then calculated using
with N being the number of measurements available for the chosen latitude band and level. The uncertainty on the monthly mean zonal mean calculated using Eq. (7) is sensitive to the magnitude of the uncertainties on each measurement but also to the variance in the measurements, which is not the case when using equations for calculating the uncertainty on monthly means as provided in the current literature. As the uncertainties on the monthly mean zonal means take into account how many measurements were available to calculate the mean value, the uncertainty will be larger for mean values where fewer measurements were available compared to mean values based on more measurements. However, having only one measurement per latitude band available is not sufficient to calculate a monthly mean and its uncertainty. As a result, the requirement here is that there are at least six measurements per latitude band and level available to calculate a monthly mean zonal mean. If fewer measurements are available, no monthly mean zonal mean will be calculated.
The calculated monthly mean zonal mean ozone time series, combining all measurements from different data sources, are shown in Fig. 4 as solid orange lines (unadjusted) and blue lines (adjusted) with their corresponding uncertainties. Despite the data gaps, the annual cycle and the interannual variability within the monthly mean zonal mean time series at the level and latitude band are apparent as shown in Figs. 3 and 4. Without a homogenization process and the resulting measurement adjustment towards the standard, the monthly mean zonal mean time series would represent mostly the mean of the data sources that are available at a high spatial and temporal measurement density in a given month and latitude band, introducing a bias into the monthly mean zonal mean time series. The uncertainties on the monthly mean zonal mean ozone values are significantly smaller once MLS measurements are added to the merged data product, as there are thousands of MLS measurements available per day.
The homogeneous database of monthly means zonal means constitutes Tier 0 of BSVertOzone.
To generate a global, gap-free monthly mean zonal mean ozone data set, all values from the Tier 0 database are used, and the missing monthly mean zonal mean values are estimated from correlations against a total column ozone (TCO) database. This Tier 0.5 data set includes the full range of measurement variability and is created as an intermediate step for the calculation of the Tier 1 data where a least squares regression model is used to attribute variability to various known forcing factors for ozone (Sect. 5).
The first step in creating the Tier 0.5 data set is to regress the monthly mean zonal mean ozone at 20 km/58.2 hPa against monthly mean zonal mean total column ozone, i.e.,
where m is the month; ϕ is the latitude; α and β are the two regression model fit coefficients, each expanded in Fourier and Legendre series; and R is the residuals (see Bodeker et al., 2013, for details).
The TCO database used here is described in detail in Bodeker et al. (2018) and covers the period from 31 October 1978 to 31 December 2016 by combining measurements from multiple satellite-based instruments to a single global daily time series of ozone fields. Comparisons of TCO from a subset of satellite-based instruments against TCO measurements from ground-based Dobson and Brewer spectrophotometer networks are used to remove offsets and drifts between satellite-based and ground-based measurements. For more details about the database and correction method see Bodeker et al. (2018). Daily total column ozone fields at 1.25∘ longitude by 1∘ latitude are then used to calculate monthly mean zonal means at 5∘ latitude bands before Eq. (8) is applied to pre-fill the vertically resolved ozone data set.
The regression model fit coefficients in Eq. (8) are expanded in Fourier series to account for seasonality in the correlation of ozone at 20 km/58.2 hPa against total column ozone and in Legendre expansions to account for the latitudinal structure. Uncertainties on the monthly mean zonal mean ozone values at 20 km are passed to the regression model when the fitting is performed. The optimal number of Fourier and Legendre expansions is determined by finding the minimum in a Bayesian information criterion (BIC; Liddle, 2007) for a range of possible Fourier (0..5) and Legendre (0..11) expansion indices. For example, at 20 km this test identifies optimal values of four Fourier expansions and four Legendre expansions. The regression model can then be used to estimate missing ozone values, along with their uncertainties as derived from the uncertainties on the regression model fit coefficients, whenever a measurement is not available. In this way, a filled monthly mean zonal mean ozone field at 20 km/58.2 hPa is created. This field then becomes the predictor (rather than TCO) for the field at 21 km. Once filled fields at 20 and 21 km are available, both are used as predictors for ozone at 22 km; i.e., the regression model now includes two basis functions:
where n is the level number. Equation (9) is then applied from level 22 upwards to level 70, and then down from level 19 (now using levels 20 and 21 as the predictors for ozone at level 19) to level 1.
The result is a pre-filled ozone data set, where filled values include some indication of the true month-to-month variability as suggested by the TCO month-to-month variability. Monthly mean zonal mean ozone values at 20 km after the homogenization of data sources has been applied (Fig. 5a) and the pre-filled data sets are shown in Fig. 5.
This Tier 0.5 data set was then used as input to a least squares regression model to generate the Tier 1.1 to Tier 1.4 data sets described in the next section. It describes the full natural variability and is therefore particularly useful for CCM evaluation studies when the model runs with prescribed dynamics.
The methodology to generate the Tier 1.1 to Tier 1.4 data sets is much the same approach as the one described for the previous version of the database (BDBP v220.127.116.11) in Bodeker et al. (2013). The same regression model is applied, but, rather than using this regression model to both fill the data gaps and conduct the attribution, it is now only used to conduct the attribution to various factors affecting ozone. This is a way to provide different data sets for very specific purposes with regards to CCM evaluation. For an in-depth description of the approach see Sect. 4 of Bodeker et al. (2013). The most important features of the approach used here are briefly outlined below.
The least squares regression model that was applied to the Tier 0.5 data consists of eight basis functions, viz.
a constant offset that is expanded in a Fourier series to represent the mean annual cycle;
an EESC (equivalent effective stratospheric chlorine) term that differs with age of air;
a linear trend term;
a quasi-biennial oscillation (QBO) basis function that was specified as the monthly mean 50 hPa Singapore zonal wind;
a second QBO basis function, that is mathematically orthogonalized to the first, to account for QBO lag variations with latitude and level;
an El Niño–Southern Oscillation (ENSO) term;
a solar cycle term; and
a Mt. Pinatubo term that accounts for the enhancement of stratospheric aerosols after the Mt. Pinatubo eruption in 1991.
The regression model was applied to each level separately, where all basis functions are included for all levels, except the EESC basis function which is excluded at levels below the mean tropopause. A two-term autocorrelation model was used to account for the effects of autocorrelation within the monthly mean time series. The regression was applied to monthly mean zonal mean values at all latitude bands simultaneously. This can be done by expanding the regression model fit coefficients not only in a Fourier series to account for temporal variability, but also in Legendre polynomials to account for latitudinal variability. The number of Fourier pairs and Legendre terms for each level differ, and this is determined by the expected latitude and time distribution of ozone for that level.
Tier 1.1 (anthropogenic): This data set is calculated by summing up the contributions from the offset, EESC, and linear trend basis functions.
Tier 1.2 (natural): This data set is calculated by summing up the contributions from the offset, QBO, ENSO, and solar cycle basis functions.
Tier 1.3 (natural and volcanoes): This data set is calculated by summing up the contributions from the offset, QBO, ENSO, solar cycle, and Mt. Pinatubo volcanic eruption basis functions.
Tier 1.4 (all): This data set is calculated by summing up the contributions from all basis functions.
As the Tier 1.x data sets are output from a regression model, they do not capture real-world year-to-year variability, only the variability for which basis functions are included in the regression model. These data sets are optimized for the use in comparisons with CCM simulations that do not exhibit the same unforced variability as reality. They can be used for different purposes, e.g., to compare ozone radiative forcing with and without the effects of changes in EESC and greenhouse gases on ozone.
Two examples of the Tier 0 database, and Tier 0.5, together with the differences between Tier 1.x and Tier 0.5 data sets for the latitude bands 30 to 35∘ S and 70 to 75∘ N are shown in Figs. 6 and 7. As there were no ozonesonde profiles available for the latitude band 30 to 35∘ S (Fig. 6), there are no measurements below about 250 hPa included in Tier 0. The stratosphere and lower mesosphere are covered well with satellite measurements from around 1985, but the effect of the Mt. Pinatubo eruption on the ozone measurements can clearly be seen in around 1991 to 1993 in the lower stratosphere. More ozonesonde measurements and less satellite observations are available for the latitude band 70 to 75∘ N (Fig. 7), and the inclusion of MLS measurements from 2005 onward significantly increases the coverage of the upper stratosphere and lower mesosphere.
The Tier 0 and Tier 0.5 data sets both show considerably more variability than the Tier 1.4 data set as the regression model is not capable of tracking all of the variability in Tier 0 and Tier 0.5 (non-systematic differences shown in the lower panels in Figs. 6 and 7), rather only the variability that can be described by the basis functions used in the regression model. For example, in 2011 unprecedented depletion of ozone happened in the Arctic winter (Manney et al., 2011), which is detectable as less ozone compared to earlier years for pressure levels between 100 and 10 hPa in Tier 0.5, but which is not apparent in Tier 1.4 (Fig. 7) since no basis functions are included in the regression model to track such variability. Careful comparison of the Tier 1.2 and Tier 1.4 data sets indicates the effects of natural variability on ozone, while the comparison of the Tier 1.1 with Tier 1.4 data set illustrates the effects of EESC on ozone.
Due to the implemented improvements in the construction of the BSVertOzone database over the previous version BDBP v18.104.22.168, differences between both databases are to be expected. Ozone concentrations from 1979 to 2016 as extracted from Tier 1.4 of both databases at three different pressure levels and three different latitude bands are shown in Fig. 8. Ozone concentrations in the upper stratosphere (10 hPa) over the northern midlatitudes are almost identical (upper panel). Differences become apparent for ozone at around 140 hPa in the tropics (middle panel): the ozone concentrations differ in magnitude and the BSVertOzone values (blue line) are consistently higher, especially after 1985, compared to the BDBP (red line). Furthermore, the annual cycle and interannual variability differ between BSVertOzone and BDBP, which could be a result of using a different number of Fourier and Legendre expansions used in the regression model generating the Tier 1.x data sets. While the expansions were predefined for the calculation of BDBP v22.214.171.124, the optimal number of expansions were determined by applying the BIC method (see Sect. 3.2) when calculating BSVertOzone. At this pressure level, satellite-based measurements were adjusted to ozonesonde measurements (see Sect. 3.2), measuring higher ozone concentrations between 5 and 10∘ N than satellite-based instruments. The BDBP includes the raw satellite-based and ozonesonde measurements, without any adjustment or correction being applied between the measurements from different instruments. Ozone concentrations at about 60 hPa in the southernmost latitude band, 85 to 90∘ S, as extracted from Tier 1.4 of BSVertOzone and BDBP are shown in the lower panel of Fig. 8. Both data sets track the evolution of the Antarctic ozone hole in each southern spring. However, there are small differences in the magnitude of the ozone reduction in the latitude band, especially at the beginning of the time series (1979 to 1985). The BSVertOzone (blue line) ozone concentrations are lower – e.g., there is more ozone depletion – than the ozone concentrations extracted from the BDBP v126.96.36.199 (red line) until about 1985. This is most likely a result of the homogenization approach applied to the different data sources in BSVertOzone but not in the BDBP. During the early 1980s, satellite-based measurements are sparse for most parts of the globe, and ozonesonde soundings were not as frequent as during the 1990s or 2000s.
Most measurements available for the latitude band and level described here (85 to 90∘ S, 58 hPa) are ozone soundings (South Pole station). There are only sparse satellite measurements available for this latitude band, and mostly later in the time period. The homogenization adjustment for the South Pole ozone soundings therefore is only based on an extrapolated difference field between the standard and CTM (see Sect. 3.2). Problems related to this extrapolation could cause the slightly increased ozone depletion in the early part of the BSVertOzone Tier 1.4 time series compared to BDBP.
Davis et al. (2016) developed one of the more recent vertically resolved ozone data sets: SWOOSH. This data set has already been used in a number of studies for a wide variety of analyses (Harris et al., 2015; Steinbrecht et al., 2017; Ball et al., 2017). Therefore, SWOOSH is an ideal candidate for the evaluation of the newly developed BSVertOzone database. A comparison of ozone concentrations extracted from SWOOSH with ozone concentrations extracted from Tier 0.5 and Tier 1.4 of BSVertOzone, for three different latitude bands and three different pressure levels, is shown in Fig. 9. As the SWOOSH ozone concentrations are provided on a different vertical grid, SWOOSH data were interpolated onto the BSVertOzone pressure levels for a direct comparison. Overall, the ozone concentrations from the tiers of BSVertOzone (red and blue lines) agree well with the ozone values from SWOOSH (black line) for the shown latitude bands and pressure levels. Some differences are apparent in the lower stratosphere of the southern midlatitudes (middle panel), where ozone concentrations from SWOOSH are higher than ozone concentrations from BSVertOzone. These differences are for the most part a result of the different homogenization approaches applied in SWOOSH and BSVertOzone. While in SWOOSH, all satellite-based measurements are adjusted to SAGE II measurements, in BSVertOzone all satellite-based measurements are adjusted to ozonesonde measurements at this pressure level (see Sect. 3.2). The lower panel of Fig. 9 shows some small differences at around 70 hPa over the tropics, where ozone concentrations from SWOOSH are occasionally higher than the ozone concentrations from BSVertOzone. These small differences are most likely a result of applying a different methodology when combining the measurements from different data sources. Additionally, the selection of data sources could explain some of the differences seen in Fig. 9: SWOOSH does not include ozonesonde measurements, as well as SAGE I, POAM II/III, and ILAS-I/II measurements, whereas BSVertOzone does not include any measurements from SAGE III and UARS (Upper Atmosphere Research Satellite) MLS. In the middle to upper stratosphere, due to the good coverage of satellite data, selecting different data sources will not affect the combined data product. However, in the lower stratosphere and upper troposphere where satellite measurements are sparse and more uncertain and ozonesonde measurements are more robust, not including ozonesonde measurements will lead to differences, as can be seen in the comparison between SWOOSH and BSVertOzone (Fig. 9 middle panel).
Comparisons between BSVertOzone Tier 0.5 and Tier 1.4 are in very close agreement, as would be expected. Tier 0.5 shows more interannual variability since its missing monthly mean zonal mean values are filled with regression model output describing the relationship between monthly mean ozone values from Tier 0 and monthly mean total column ozone values (see Sect. 4), and Tier 1.4 is smoother and shows less variation. Overall, however, time series of both tiers track each other closely.
BSVertOzone v1.0, which is described in this paper, is archived and publicly available at Zenodo (Zenodo is a research data repository that was created by OpenAIRE and the European Organization for Nuclear Research, CERN) with the DOI number https://doi.org/10.5281/zenodo.1217184 (Hassler et al., 2018). Additionally, it is available at the Bodeker Scientific website (http://www.bodekerscientific.com/data/monthly-mean-global-vertically-resolved-ozone, last access: 14 August 2018) in its most recent and updated version.
An updated and further developed version of the vertically resolved ozone database, the BDBP v188.8.131.52 (Bodeker et al., 2013), is presented in this study. Like its predecessor, the new database, BSVertOzone, consists of global monthly mean zonal mean ozone values between 1 and 70 km (878.4 to 0.046 hPa) and has several gap-free data sets. Monthly mean zonal mean ozone concentrations are provided in mixing ratio and number density for the period from 1979 to 2016 (note that although only examples for the mixing ratio on pressure level data sets were shown throughout the paper, the other three databases are also available through the DOI given below). The BSVertOzone database is unique within the collection of available vertically resolved ozone data sets (e.g., GOZCARDS, SWOOSH, and SAGE–CCI–OMPS) as BSVertOzone includes ozone profile measurements that cover the troposphere and the lower–mid-stratosphere. Offsets and drifts between the measurements from different instruments are now quantified and accounted for by applying a homogenization method that uses a CTM output as a transfer standard (see Sect. 3.2) and a regression model to adjust the measurements from different sources to a given standard. For levels above 15 km, measurements from SAGE II are the chosen standard, while ozonesonde data are used at 15 km and below as the standard (recognizing the higher data quality of the ozonesonde measurements below 15 km). Ozone concentrations from BSVertOzone compare well to the ozone values from SWOOSH in most latitude bins, although some discrepancies between the two data sets remain. These can be explained by the differences in the methodology of combining measurements from different data sources. The applied homogenization results in an improvement of BSVertOzone compared to the earlier version BDBP v184.108.40.206, i.e., a more realistic representation of the ozone variability in the atmosphere.
Tier 0 contains the monthly mean zonal mean values that are directly calculated from the individual (adjusted) data sources, containing data gaps where no measurements were available.
Tier 0.5 monthly mean zonal means represent an intermediate filled data set that is calculated from Tier 0 data. Missing monthly mean zonal mean ozone values are filled with regression model output describing the relationship between monthly mean ozone values from Tier 0 and monthly mean total column ozone values obtained from the total column ozone database described in Bodeker et al. (2018). The gap-free Tier 0.5 database captures real-world forced and unforced variability, suitable for CCM evaluations where the model was run with prescribed dynamics. Tier 0.5 is used as input for the calculation of the Tier 1 data sets.
Tier 1.1 to Tier 1.4 are based on multiple linear regression model output. They differ in the combination of the contributions of the different basis functions used in the regression model. The ozone variability in these data sets is reduced compared to Tier 0 and Tier 0.5, since it describes only the variability for which basis functions were included in the regression model. Especially Tier 1.4 is therefore well suited for evaluating CCM output, where the CCM is not nudged to real-world dynamics.
A clear improvement compared to BDBP v220.127.116.11 is the provision of uncertainty estimates on each monthly mean zonal mean for all tiers. These uncertainties combine the uncertainties that are provided with each individual measurement and the uncertainties introduced by applying the homogenization method. The provided uncertainties are essential for more realistic comparisons with CCM simulations, and results of ozone variability analyses can be interpreted with more information about the confidence in the results.
There are several improvements that could be implemented when preparing the measurements and for the used homogenization method. In the current version (v1.0) of BSVertOzone, the global troposphere is only covered by ozonesonde profile measurements. These profiles are available for many decades (see Sect. 2.1), but they only cover a limited portion of the globe. As a result, estimating the long-term global tropospheric ozone distribution from these measurements alone using the gap-filling method described above results in larger uncertainties in the troposphere than the stratosphere. Additional measurements in the troposphere would help to better constrain the regression model used to fill the data gaps and therefore reduce the overall uncertainties on the monthly mean zonal mean ozone values. Since the early 2000s there are satellite measurements of vertically resolved tropospheric ozone available (e.g., the Tropospheric Emission Spectrometer, TES; or the Atmospheric Infrared Sounder, AIRS), which could be included in the database. Although both satellite instruments only provide measurements for the last 15 years, the additional information would improve the characterization of the global tropospheric ozone concentrations of the database covering the full period.
Measurements from MLS are the only source of stratospheric ozone data in the last 10 years which were included in the current version of the database. As long as MLS is active and measures ozone, BSVertOzone will be updated regularly to include these MLS measurements. However, when MLS stops measuring ozone, alternative, and possibly additional, data sources for stratospheric ozone will need to be added to BSVertOzone to ensure a continuous time series of vertically resolved ozone into the future. Measurements from NASA's Ozone Mapping Profiler Suite (OMPS), from SCISAT's Atmospheric Chemistry Experiment Fourier transform spectrometer (ACE-FTS), or from the recently launched SAGE III instrument on the International Space Station (ISS) would be possible candidates to be included in BSVertOzone.
In addition to including more ozone measurements from different instruments, there are some improvements in the processing of the measurements that are planned to be implemented in the future. Firstly, as the CTM output used here as a transfer standard to homogenize the satellite and ozonesonde measurements has a temperature bias due to the underlying meteorological ERA-Interim reanalysis (see Sect. 3.1), the use of CTM output forced with the most recent reanalysis data set ERA5 is planned. This most likely will remove the remaining inconsistencies in the ozone concentrations simulated by SLIMCAT in the upper levels.
All available measurements for each latitude band, each level, and each month are most likely not evenly distributed spatially and temporally, which can result in a skewed (unrepresentative) monthly mean value and an underestimation of the monthly mean uncertainty. The individual ozone measurements should therefore undergo a spatial and temporal bias correction before monthly mean zonal means are calculated, to represent the monthly distribution correctly. Additionally, it might be necessary to consider possible existing spatial and temporal autocorrelations between individual data points. As mentioned in Sect. 5, a two-term autocorrelation model was used in the regression model generating the Tier 1 data sets. This only takes into account the temporal dependence of the already calculated monthly mean zonal mean values, but does not correct for the temporal dependence of measurements within 1 month, or any spatial dependence of measurements within the chosen latitude band. While this consideration would not affect the calculated monthly mean zonal means, it would change the number of available independent measurements and therefore would change the uncertainties on the calculated means.
In the upper stratosphere and mesosphere, ozone formation and destruction happens so fast that it follows the availability of sunlight. As a results, diurnal variations in ozone concentrations are observable in the upper stratosphere and lower mesosphere (Schanz et al., 2014). Therefore, differences in ozone concentrations measured in the upper stratosphere–lower mesosphere could result from satellite-based instruments measuring ozone at a different solar zenith angles, i.e., different local time of the day (e.g., Pallister and Tuck, 1983; Schanz et al., 2014; Studer et al., 2014). With the current version of BSVertOzone, the potential differences in ozone measurements caused by the diurnal cycle are ignored as the effect on the monthly mean zonal mean ozone values is expected to be small. However, to test this expectation, and for an update of BSVertOzone, the implementation of methods to account for the diurnal cycle effects on the upper levels of the ozone database is planned.
BH wrote the paper with support and input from all other co-authors. BH, SK, GEB, and JL were involved in combining the satellite measurements and performed the several computations required for the generation of the BSVertOzone database. KN was involved in developing the software for the homogenization method described in the paper. Comparison between the BSVertOzone and SWOOSH data sets was performed by SMD The SLIMCAT simulations were performed by MPC and SSD, who also provided the model output data for this study. MD assisted in writing the paper and provided valuable discussions around the methodology.
The authors declare that they have no conflict of interest.
This work was conducted under subcontract to NIWA under the Deep South
National Science Challenge (CO1X1445). The SLIMCAT modeling work was
supported by the NERC National Centre for Atmospheric Science (NCAS). We
thank Wuhu Feng for help with SLIMCAT. The simulations were performed on the
national Archer and Leeds ARC HPC facilities. We would like to thank Lucien
Froidevaux for providing the measurement uncertainties of MLS ozone data and
many helpful discussions. We would also like to thank two anonymous
reviewers for their helpful comments.
The article processing charges for this open-access
publication were covered by a Research
Centre of the Helmholtz Association.
Edited by: David Carlson
Reviewed by: two anonymous referees
Ball, W. T., Alsing, J., Mortlock, D. J., Rozanov, E .V., Tummon, F., and Haigh, J. D.: Reconciling differences in stratospheric ozone composites, Atmos. Chem. Phys., 17, 12269–12302, https://doi.org/10.5194/acp-17-12269-2017, 2017. a
Bodeker, G. E., Hassler, B., Young, P. J., and Portmann, R. W.: A vertically resolved, global, gap-free ozone database for assessing or constraining global climate model simulations, Earth Syst. Sci. Data, 5, 31–43, https://doi.org/10.5194/essd-5-31-2013, 2013. a, b, c, d, e, f, g, h, i, j, k
Bodeker Scientific: Combined vertical ozone profile database – version 1.0, available at: http://www.bodekerscientific.com/data/monthly-mean-global-vertically-resolved-ozone, last access date: 14 August, 2018.
Bourassa, A. E., Degenstein, D. A., Randel, W. J., Zawodny, J. M., Kyrölä, E., McLinden, C. A., Sioris, C. E., and Roth, C. Z.: Trends in stratospheric ozone derived from merged SAGE II and Odin-OSIRIS satellite observations, Atmos. Chem. Phys., 14, 6983–6994, https://doi.org/10.5194/acp-14-6983, 2014. a
Bourassa, A. E., Roth, C. Z., Zawada, D. J., Rieger, L. A., McLinden, C. A., and Degenstein, D. A.: Drift-corrected Odin-OSIRIS ozone product: algorithm and updated stratospheric ozone trends, Atmos. Meas. Tech., 11, 489–498, https://doi.org/10.5194/amt-11-489-2018, 2018. a
Bozdogan, H.: Model selection and Akaike's Information Criterion (AIC): The general theory and its analytical extensions, Psychometrika, 52, 345–370, 1987. a
Chipperfield, M. P.: New Version of the TOMCAT/SLIMCAT Off-Line Chemical Transport Model: Intercomparison of Stratospheric Tracer Experiments, Q. J. Roy. Meteorol. Soc., 132, 1179–1203, https://doi.org/10.1256/qj.05.51, 2006. a, b
Chipperfield, M. P., Dhomse, S. S., Feng, W., McKenzie, R. L., Velders, G., and Pyle, J. A.: Quantifying the ozone and UV benefits already achieved by the Montreal Protocol, Nat. Commun., 6, 7233, https://doi.org/10.1038/ncomms8233, 2015. a
Chipperfield, M. P., Bekki, S., Dhomse, S., Harris, N. R. P., Hassler, B., Hossaini, R., Steinbrecht, W., Thieblemont, R., and Weber, M.: Detecting recovery of the stratospheric ozone layer, Nature, 549, 211–218, https://doi.org/10.1038/nature23681, 2017. a
Cionni, I., Eyring, V., Lamarque, J. F., Randel, W. J., Stevenson, D. S., Wu, F., Bodeker, G. E., Shepherd, T. G., Shindell, D. T., and Waugh, D. W.: Ozone database in support of CMIP5 simulations: results and corresponding radiative forcing, Atmos. Chem. Phys., 11, 11267–11292, https://doi.org/10.5194/acp-11-11267-2011, 2011. a
Davis, S. M., Rosenlof, K. H., Hassler, B., Hurst, D. F., Read, W. G., Vömel, H., Selkirk, H., Fujiwara, M., and Damadeo, R.: The Stratospheric Water and Ozone Satellite Homogenized (SWOOSH) database: a long-term database for climate studies, Earth Syst. Sci. Data, 8, 461–490, https://doi.org/10.5194/essd-8-461-2016, 2016. a, b, c, d, e
Dee, D. P., Uppala, S. M., Simmons, A. J., Berrisford, P., Poli, P., Kobayashi, S., Andrae, U., Balmaseda, M. A., Balsamo, G., Bauer, P., Bechtold, P., Beljaars, A. C. M., van de Berg, L., Bidlot, J., Bormann, N., Delsol, C., Dragani, R., Fuentes, M., Geer, A. J., Haimberger, L., Healy, S. B., Hersbach, H., Hólm, E. V., Isaksen, L., Kållberg, P., Köhler, M., Matricardi, M., McNally, A. P., Monge-Sanz, B. M., Morcrette, J.-J., Park, B.-K., Peubey, C., de Rosnay, P., Tavolato, C., Thépaut, J.-N., and Vitart, F.: The ERA-Interim reanalysis: configuration and performance of the data assimilation system, Q. J. Roy. Meteorol. Soc., 137, 553–597, https://doi.org/10.1002/qj.828, 2011. a
deLeeuw, J.: Introduction to Akaike (1973) information theory and an extension of the maximum likelihood principle, in: Breakthroughs in Statistics I, edited by: Kotz, S. and Johnson, N. L., Springer, 599–609, 1992. a
Deshler, T., Stübi, R., Schmidlin, F. J., Mercer, J. L., Smit, H. G. J., Johnson, B. J., Kivi, R., and Nardi, B.: Methods to homogenize electrochemical concentration cell (ECC) ozonesonde measurements across changes in sensing solution concentration or ozonesonde manufacturer, Atmos. Meas. Tech., 10, 2021–2043, https://doi.org/10.5194/amt-10-2021-2017, 2017. a
Dhomse, S. S., Chipperfield, M. P., Feng, W., and Haigh, J. D.: Solar response in tropical stratospheric ozone: a 3-D chemical transport model study using ERA reanalyses, Atmos. Chem. Phys., 11, 12773–12786, https://doi.org/10.5194/acp-11-12773-2011, 2011. a, b
Efron, B. and Tibshirani, R.: Bootstrap methods for standard errors, confidence intervals, and other measures of statistical accuracy, Stat. Sci., 1, 54–77, 1986. a
Eyring, V., Bony, S., Meehl, G. A., Senior, C. A., Stevens, B., Stouffer, R. J., and Taylor, K. E.: Overview of the Coupled Model Intercomparison Project Phase 6 (CMIP6) experimental design and organization, Geosci. Model Dev., 9, 1937–1958, https://doi.org/10.5194/gmd-9-1937-2016, 2016. a
Foelsche, U., Scherllin-Pirscher, B., Ladstädter, F., Steiner, A. K., and Kirchengast, G.: Refractivity and temperature climate records from multiple radio occultation satellites consistent within 0.05 %, Atmos. Meas. Tech., 4, 2007–2018, https://doi.org/10.5194/amt-4-2007-2011, 2011. a
Forster, P. M. d. F.: Radiative forcing due to stratospheric ozone changes 1979–1997, using updated trend estimates, J. Geophys. Res., 104, 24395–24399, 1999. a
Frith, S. M., Stolarski, R. S., Kramarova, N. A., and McPeters, R. D.: Estimating uncertainties in the SBUV Version 8.6 merged profile ozone data set, Atmos. Chem. Phys., 17, 14695–14707, https://doi.org/10.5194/acp-17-14695-2017, 2017. a
Froidevaux, L., Jiang, Y. B., Lambert, A., Livesey, N. J., Read, W. G., Waters, J. W., Browell, E. V., Hair, J. W., Avery, M. A., McGee, T. J., Twigg, L. W., Sumnicht, G. K., Jucks, K. W., Margitan, J. J., Sen, B., Stachnik, R. A., Toon, G. C., Bernath, P. F., Boone, C. D., Walker, K. A., Filipiak, M. J., Harwood, R. S., Fuller, R. A., Manney, G. L., Schwartz, M. J., Daffer, W. H., Drouin, B. J., Cofield, R. E., Cuddy, D. T., Jarnot, R. F., Knosp, B. W., Perun, V. S., Snyder, W. V., Stek, P. C., Thurstans, R. P., and Wagner, P. A.: Validation of Aura Microwave Limb Sounder stratospheric ozone measurements, J. Geophys. Res., 113, D15S20, https://doi.org/10.1029/2007JD008771, 2008. a, b, c
Froidevaux, L., Anderson, J., Wang, H.-J., Fuller, R. A., Schwartz, M. J., Santee, M. L., Livesey, N. J., Pumphrey, H. C., Bernath, P. F., Russell III, J. M., and McCormick, M. P.: Global OZone Chemistry And Related trace gas Data records for the Stratosphere (GOZCARDS): methodology and sample results with a focus on HCl, H2O, and O3, Atmos. Chem. Phys., 15, 10471–10507, https://doi.org/10.5194/acp-15-10471-2015, 2015. a
Harris, N. R. P., Hassler, B., Tummon, F., Bodeker, G. E., Hubert, D., Petropavlovskikh, I., Steinbrecht, W., Anderson, J., Bhartia, P. K., Boone, C. D., Bourassa, A., Davis, S. M., Degenstein, D., Delcloo, A., Frith, S. M., Froidevaux, L., Godin-Beekmann, S., Jones, N., Kurylo, M. J., Kyrölä, E., Laine, M., Leblanc, S. T., Lambert, J.-C., Liley, B., Mahieu, E., Maycock, A., de Mazière, M., Parrish, A., Querel, R., Rosenlof, K. H., Roth, C., Sioris, C., Staehelin, J., Stolarski, R. S., Stübi, R., Tamminen, J., Vigouroux, C., Walker, K. A., Wang, H. J., Wild, J., and Zawodny, J. M.: Past changes in the vertical distribution of ozone – Part 3: Analysis and interpretation of trends, Atmos. Chem. Phys., 15, 9965–9982, https://doi.org/10.5194/acp-15-9965-2015, 2015. a, b
Hassler, B., Bodeker, G. E., and Dameris, M.: Technical Note: A new global database of trace gases and aerosols from multiple sources of high vertical resolution measurements, Atmos. Chem. Phys., 8, 5403–5421, https://doi.org/https://doi.org/10.5194/acp-8-5403-2008, 2008. a, b, c, d, e
Hassler, B., Petropavlovskikh, I., Staehelin, J., August, T., Bhartia, P. K., Clerbaux, C., Degenstein, D., de Mazière, M., Dinelli, B. M., Dudhia, A., Dufour, G., Frith, S. M., Froidevaux, L., Godin-Beekmann, S., Granville, J., Harris, N. R. P., Hoppel, K., Hubert, D., Kasai, Y., Kurylo, M. J., Kyrölä, E., Lambert, J.-C., Levelt, P. F., McElroy, C. T., McPeters, R. D., Munro, R., Nakajima, H., Parrish, A., Raspollini, P., Remsberg, E. E., Rosenlof, K. H., Rozanov, A., Sano, T., Sasano, Y., Shiotani, M., Smit, H. G. J., Stiller, G., Tamminen, J., Tarasick, D. W., Urban, J., van der A, R. J., Veefkind, J. P., Vigouroux, C., von Clarmann, T., von Savigny, C., Walker, K. A., Weber, M., Wild, J., and Zawodny, J. M.: Past changes in the vertical distribution of ozone – Part 1: Measurement techniques, uncertainties and availability, Atmos. Meas. Tech., 7, 1395–1427, https://doi.org/10.5194/amt-7-1395-2014, 2014. a
Hassler, B., Kremser, S., Bodeker, G., Lewis, J., Nesbit, K., Davis, S., Chipperfield, M., Dhomse, S., and Dameris, M.: BSVerticalOzone database (Version v1.0) [Data set], Zenodo, available at: http://doi.org/10.5281/zenodo.1217184, last access: 14 August 2018.
Hegglin, M. I., Plummer, D. A., Shepherd, T. G., Scinocca, J. F., Anderson, J., Froidevaux, L., Funke, B., Hurst, D., Rozanov, A., Urban, J., von Clarmann, T., Walker, K. A., Wang, H. J., Tegtmeier, S., and Weigel, K.: Vertical structure of stratospheric water vapour trends derived from merged satellite data, Nat. Geosci., 7, 768–776, https://doi.org/10.1038/ngeo2236, 2014. a, b
Hubert, D., Lambert, J.-C., Verhoelst, T., Granville, J., Keppens, A., Baray, J.-L., Bourassa, A. E., Cortesi, U., Degenstein, D. A., Froidevaux, L., Godin-Beekmann, S., Hoppel, K. W., Johnson, B. J., Kyrölä, E., Leblanc, T., Lichtenberg, G., Marchand, M., McElroy, C. T., Murtagh, D., Nakane, H., Portafaix, T., Querel, R., Russell III, J. M., Salvador, J., Smit, H. G. J., Stebel, K., Steinbrecht, W., Strawbridge, K. B., Stübi, R., Swart, D. P. J., Taha, G., Tarasick, D. W., Thompson, A. M., Urban, J., van Gijsel, J. A. E., Van Malderen, R., von der Gathen, P., Walker, K. A., Wolfram, E., and Zawodny, J. M.: Ground-based assessment of the bias and long-term stability of 14 limb and occultation ozone profile data records, Atmos. Meas. Tech., 9, 2497–2534, https://doi.org/10.5194/amt-9-2497-2016, 2016. a, b, c, d
Kyrölä, E., Laine, M., Sofieva, V., Tamminen, J., Päivärinta, S.-M., Tukiainen, S., Zawodny, J., and Thomason, L.: Combined SAGE II-GOMOS ozone profile data set for 1984–2011 and trend analysis of the vertical distribution of ozone, Atmos. Chem. Phys., 13, 10645–10658, https://doi.org/10.5194/acp-13-10645-2013, 2013. a
Manney, G. L., Santee, M. L., Rex, M., Livesey, N. J., Pitts, M. C., Veefkind, P., Nash, E. R., Wohltmann, I., Lehmann, R., Froidevaux, L., Poole, L. R., Schoeberl, M. R., Haffner, D. P., Davies, J., Dorokhov, V., Gernandt, H., Johnson, B., Kivi, R., Kyrö, E., Larsen, N., Levelt, P. F., Makshtas, A., McElroy, C. T., Nakajima, H., Parrondo, M. C., Tarasick, D. W., von der Gathen, P., Walker, K. A., and Zinoviev, N. S.: Unprecedented Arctic ozone loss in 2011, Nature, 478, 469–475, https://doi.org/10.1038/nature10556, 2011. a
McKenzie, R. L., Aucamp, P .J., Bais, A. F., Bjorn, L. O., Ilyas, M., and Madronich, S.: Ozone depletion and climate change: impacts on UV radiation, Photochem. Photobiol. Sci., 10, 182–198, https://doi.org/10.1039/C0PP90034F, 2011. a
Nowack, P. J., Abraham, N. L., Maycock, A. C., Braesicke, P., Gregory, J. M., Joshi, M. M., Osprey, A., and Pyle , J. A.: A large ozone-circulation feedback and its implications for global warming assessments, Nat. Clim. Change, 5, 41–45, https://doi.org/10.1038/nclimate2451, 2015. a
Pallister, R. C. and Tuck, A. F.: The diurnal variation of ozone in the upper stratosphere as a test of photochemical theory, Q. J. Roy. Meteor. Soc., 109, 271–284, https://doi.org/10.1002/qj.49710946002, 1983. a
Randel, W. J. and Wu, F.: A stratospheric ozone profile data set for 1979-2005: Variability, trends, and comparisons with column ozone data, J. Geophys. Res., 112, D06313, https://doi.org/10.1029/2006JD007339, 2007. a
Schanz, A., Hocke, K., and Kämpfer, N.: Daily ozone cycle in the stratosphere: global, regional and seasonal behaviour modelled with the Whole Atmosphere Community Climate Model, Atmos. Chem. Phys., 14, 7645–7663, https://doi.org/10.5194/acp-14-7645-2014, 2014. a, b
Sofieva, V. F., Kalakoski, N., Päivärinta, S.-M., Tamminen, J., Laine, M., and Froidevaux, L.: On sampling uncertainty of satellite ozone profile measurements, Atmos. Meas. Tech., 7, 1891–1900, https://doi.org/10.5194/amt-7-1891-2014, 2014. a, b
Sofieva, V. F., Kyrölä, E., Laine, M., Tamminen, J., Degenstein, D., Bourassa, A., Roth, C., Zawada, D., Weber, M., Rozanov, A., Rahpoe, N., Stiller, G., Laeng, A., von Clarmann, T., Walker, K. A., Sheese, P., Hubert, D., van Roozendael, M., Zehner, C., Damadeo, R., Zawodny, J., Kramarova, N., and Bhartia, P. K.: Merged SAGE II, Ozone_cci and OMPS ozone profiles dataset and evaluation of ozone trends in the stratosphere, Atmos. Chem. Phys., 17, 12533–12552, https://doi.org/10.5194/acp-17-12533-2017, 2017. a
Steinbrecht, W., Froidevaux, L., Fuller, R., Wang, R., Anderson, J., Roth, C., Bourassa, A., Degenstein, D., Damadeo, R., Zawodny, J., Frith, S., McPeters, R., Bhartia, P., Wild, J., Long, C., Davis, S., Rosenlof, K., Sofieva, V., Walker, K., Rahpoe, N., Rozanov, A., Weber, M., Laeng, A., von Clarmann, T., Stiller, G., Kramarova, N., Godin-Beekmann, S., Leblanc, T., Querel, R., Swart, D., Boyd, I., Hocke, K., Kämpfer, N., Maillard Barras, E., Moreira, L., Nedoluha, G., Vigouroux, C., Blumenstock, T., Schneider, M., García, O., Jones, N., Mahieu, E., Smale, D., Kotkamp, M., Robinson, J., Petropavlovskikh, I., Harris, N., Hassler, B., Hubert, D., and Tummon, F.: An update on ozone profile trends for the period 2000 to 2016, Atmos. Chem. Phys., 17, 10675–10690, https://doi.org/10.5194/acp-17-10675-2017, 2017. a
Studer, S., Hocke, K., Schanz, A., Schmidt, H., and Kämpfer, N.: A climatology of the diurnal variations in stratospheric and mesospheric ozone over Bern, Switzerland, Atmos. Chem. Phys., 14, 5905–5919, https://doi.org/10.5194/acp-14-5905-2014, 2014. a
Tarasick, D. W., Davies, J., Smit, H. G. J., and Oltmans, S. J.: A re-evaluated Canadian ozonesonde record: measurements of the vertical distribution of ozone over Canada from 1966 to 2013, Atmos. Meas. Tech., 9, 195–214, https://doi.org/10.5194/amt-9-195-2016, 2016. a
Tegtmeier, S., Hegglin, M. I., Anderson, J., Bourassa, A., Brohede, S., Degenstein, D., Froidevaux, L., Fuller, R., Funke, B., Gille, J., Jones, A., Kasai, Y., Krüger, K., Kyrölä, E., Lingenfelser, G., Lumpe, J., Nardi, B., Neu, J., Pendlebury, D., Remsberg, E., Rozanov, A., Smith,L., Toohey, M., Urban, J., von Clarmann, T., Walker, K. A., and Wang, R. H. J.: SPARC Data Initiative: A comparison of ozone climatologies from international satellite limb sounders, J. Geophys. Res., 118, 1–19, https://doi.org/10.1002/2013JD019877, 2013. a
Thompson, D. W. J., Solomon, S., Kushner, P. J., England, M. H., Grise, K. M., and Karoly, D. J.: Signatures of the Antarctic ozone hole in Southern Hemisphere surface climate change, Nat. Geosci., 4, 741–749, https://doi.org/10.1038/ngeo1296, 2011. a
Toohey, M., Hegglin, M. I., Tegtmeier, S., Anderson, J., Añel, J. A., Bourassa, A., Brohede, S., Degenstein, D., Froidevaux, L., Fuller, R., Funke, B., Gille, J., Jones, A., Kasai, Y., Krüger, K., Kyrölä, E., Neu, J. L., Rozanov, A., Smith, L., Urban, J., von Clarmann, T., Walker, K. A., and Wang, R. H. J.: Characterizing sampling biases in the trace gas climatologies of the SPARC Data Initiative, J. Geophys. Res., 118, 1–16, https://doi.org/10.1002/jgrd.50874, 2013. a
Tummon, F., Hassler, B., Harris, N. R. P., Staehelin, J., Steinbrecht, W., Anderson, J., Bodeker, G. E., Bourassa, A., Davis, S. M., Degenstein, D., Frith, S. M., Froidevaux, L., Kyrölä, E., Laine, M., Long, C., Penckwitt, A. A., Sioris, C. E., Rosenlof, K. H., Roth, C., Wang, H.-J., and Wild, J.: Intercomparison of vertically resolved merged satellite ozone data sets: interannual variability and long-term trends, Atmos. Chem. Phys., 15, 3021–3043, https://doi.org/10.5194/acp-15-3021-2015, 2015. a
Wang, H. J., Cunnold, D. M. and Bao, X.: A critical analysis of stratospheric aerosol and gas experiment ozone trends, J. Geophys. Res., 101, 12495–12514, 1996. a
Waters, J. W., Froidevaux, L., Harwood, R. S., Jarnot, R. F., Pickett, H. M., Read, W. G., Siegel, P. H., Cofield, R. E., Filipiak, M. J., Flower, D. A., Holden, J. R., Lau, G. K., Livesey, N. J., Manney, G. L., Pumphrey, H. C., Santee, M. L., Wu, D. L., Cuddy, D. T., Lay, R. R., Loo, M. S., Perun, V. S., Schwartz, M. J., Stek, P. C., Thurstans, R. P., Boyles, M. A., Chandra, S., Chavez, M. C., Chen, G.-S., Chudasama, B. V., Dodge, R., Fuller, R. A., Girard, M. A., Jiang, J. H., Jiang, Y., Knosp, B. W., LaBelle, R. C., Lam, J. C., Lee, K. A., Miller, D., Oswald, J. E., Patel, N. C., Pukala, D. M., Quintero, O., Scaff, D. M., Snyder, W. V., Tope, M. C., Wagner, P. A., and Walch, M. J.: The Earth Observing System Microwave Limb Sounder (EOS MLS) on the Aura satellite, IEEE T. Geosci. Remote, 44, 1075–1092, https://doi.org/10.1109/TGRS.2006.873771, 2006. a, b
Wild, J. D. and Long, C. S.: A Coherent Ozone Profile Dataset from SBUV, SBUV/2: 1979 to 2015, in preparation, 2018. a
Witte, J. C., Thompson, A. M., Smit, H. G. J., Fujiwara, M., Posny, F., Coetzee, G. J. R., Northam, E. T., Johnson, B. J., Sterling, C. W., Mohamad, M., Ogino, S.-Y., Jordan, A., and da Silva, F. R.: First reprocessing of Southern Hemisphere ADditional OZonesondes (SHADOZ) profile records (1998–2015): 1. Methodology and evaluation, J. Geophys. Res.-Atmos., 122, 6611–6636, https://doi.org/10.1002/2016JD026403, 2017. a
Zawada, D. J., Rieger, L. A., Bourassa, A. E., and Degenstein, D. A.: Tomographic retrievals of ozone with the OMPS Limb Profiler: algorithm description and preliminary results, Atmos. Meas. Tech., 11, 2375–2393, https://doi.org/10.5194/amt-11-2375-2018, 2018. a