<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0" article-type="data-paper">
  <front>
    <journal-meta><journal-id journal-id-type="publisher">ESSD</journal-id><journal-title-group>
    <journal-title>Earth System Science Data</journal-title>
    <abbrev-journal-title abbrev-type="publisher">ESSD</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Earth Syst. Sci. Data</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1866-3516</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/essd-13-4385-2021</article-id><title-group><article-title>Development of observation-based global multilayer soil moisture products for 1970 to 2016</article-title><alt-title>Development of observation-based global multilayer SM products (1970–2016)</alt-title>
      </title-group><?xmltex \runningtitle{Development of observation-based global multilayer SM products (1970--2016)}?><?xmltex \runningauthor{Y. Wang et al.}?>
      <contrib-group>
        <contrib contrib-type="author" corresp="no" rid="aff1 aff2">
          <name><surname>Wang</surname><given-names>Yaoping</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="yes" rid="aff2">
          <name><surname>Mao</surname><given-names>Jiafu</given-names></name>
          <email>maoj@ornl.gov</email>
        <ext-link>https://orcid.org/0000-0002-2050-7373</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1 aff3">
          <name><surname>Jin</surname><given-names>Mingzhou</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff4">
          <name><surname>Hoffman</surname><given-names>Forrest M.</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff2">
          <name><surname>Shi</surname><given-names>Xiaoying</given-names></name>
          
        <ext-link>https://orcid.org/0000-0001-8994-5032</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff2">
          <name><surname>Wullschleger</surname><given-names>Stan D.</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff5">
          <name><surname>Dai</surname><given-names>Yongjiu</given-names></name>
          
        </contrib>
        <aff id="aff1"><label>1</label><institution>Institute for a Secure and Sustainable Environment, University of
Tennessee, Knoxville, TN 37902, USA</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>Environmental Sciences Division and Climate Change Science Institute, Oak Ridge National Laboratory, <?xmltex \hack{\break}?>Oak Ridge, TN 37830, USA</institution>
        </aff>
        <aff id="aff3"><label>3</label><institution>Department of Industrial and Systems Engineering, University of
Tennessee, Knoxville, TN 37996, USA</institution>
        </aff>
        <aff id="aff4"><label>4</label><institution>Computational Sciences and Engineering Division and Climate Change
Science Institute, Oak Ridge National Laboratory, Oak Ridge, TN 37830, USA</institution>
        </aff>
        <aff id="aff5"><label>5</label><institution>School of Atmospheric Sciences, Sun Yat-sen University, Guangzhou, 519082, China</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Jiafu Mao (maoj@ornl.gov)</corresp></author-notes><pub-date><day>7</day><month>September</month><year>2021</year></pub-date>
      
      <volume>13</volume>
      <issue>9</issue>
      <fpage>4385</fpage><lpage>4405</lpage>
      <history>
        <date date-type="received"><day>10</day><month>March</month><year>2021</year></date>
           <date date-type="rev-request"><day>16</day><month>March</month><year>2021</year></date>
           <date date-type="rev-recd"><day>14</day><month>July</month><year>2021</year></date>
           <date date-type="accepted"><day>25</day><month>July</month><year>2021</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2021 Yaoping Wang et al.</copyright-statement>
        <copyright-year>2021</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://essd.copernicus.org/articles/13/4385/2021/essd-13-4385-2021.html">This article is available from https://essd.copernicus.org/articles/13/4385/2021/essd-13-4385-2021.html</self-uri><self-uri xlink:href="https://essd.copernicus.org/articles/13/4385/2021/essd-13-4385-2021.pdf">The full text article is available as a PDF file from https://essd.copernicus.org/articles/13/4385/2021/essd-13-4385-2021.pdf</self-uri>
      <abstract><title>Abstract</title>
    <p id="d1e165">Soil moisture (SM) datasets are critical to understanding
the global water, energy, and biogeochemical cycles and benefit extensive
societal applications. However, individual sources of SM data (e.g., in situ
and satellite observations, reanalysis, offline land surface model
simulations, Earth system model – ESM – simulations) have source-specific
limitations and biases related to the spatiotemporal continuity,
resolutions, and modeling and retrieval assumptions. Here, we developed seven
global, gap-free, long-term (1970–2016), multilayer (0–10, 10–30,
30–50, and 50–100 cm) SM products at monthly 0.5<inline-formula><mml:math id="M1" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> resolution
(available at <ext-link xlink:href="https://doi.org/10.6084/m9.figshare.13661312.v1" ext-link-type="DOI">10.6084/m9.figshare.13661312.v1</ext-link>; Wang and Mao, 2021) by
synthesizing a wide range of SM datasets using three statistical methods
(unweighted averaging, optimal linear combination, and emergent constraint).
The merged products outperformed their source datasets when evaluated with
in situ observations (mean bias from <inline-formula><mml:math id="M2" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.044 to 0.033 m<inline-formula><mml:math id="M3" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msup></mml:math></inline-formula> m<inline-formula><mml:math id="M4" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, root
mean square errors from 0.076 to 0.104 m<inline-formula><mml:math id="M5" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msup></mml:math></inline-formula> m<inline-formula><mml:math id="M6" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, Pearson
correlations from 0.35 to 0.67) and multiple gridded datasets that did not
enter merging because of insufficient spatial, temporal, or soil layer
coverage. Three of the new SM products, which were produced by applying any
of the three merging methods to the source datasets excluding the ESMs,
had lower bias and root mean square errors and higher correlations than the
ESM-dependent merged products. The ESM-independent products also showed a
better ability to capture historical large-scale drought events than the
ESM-dependent products. The merged products generally showed reasonable
temporal homogeneity and physically plausible global sensitivities to
observed meteorological factors, except that the ESM-dependent products
underestimated the low-frequency temporal variability in SM and
overestimated the high-frequency variability for the 50–100 cm depth.
Based on these evaluation results, the three ESM-independent products were
finally recommended for future applications because of their better
performances than the ESM-dependent ones. Despite uncertainties in the raw
SM datasets and fusion methods, these hybrid products create added value
over existing SM datasets because of the performance improvement and
harmonized spatial, temporal, and vertical coverages, and they provide a new
foundation for scientific investigation and resource management.</p>
  </abstract>
    </article-meta>
  </front>
<body>
      

<?pagebreak page4386?><sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d1e239">High-quality global soil moisture (SM) datasets benefit many applications,
such as understanding drought changes and ecosystem dynamics (Green et al.,
2019; Kumar et al., 2019), studying land–atmosphere feedbacks (Li et al.,
2020a), benchmarking model capabilities (Loew et al., 2013), and
initializing weather and climate forecast systems (Sospedra-Alfonso and
Merryfield, 2018). The majority of SM products fall into five categories: in
situ measurements, satellite observations, offline land surface model (LSM)
simulations, reanalysis, and Earth system model (ESM) simulations. In situ
measurements provide the most direct SM observations at the point scale but
are too sparse to be interpolated to the global level (spatial
autocorrelation dies <inline-formula><mml:math id="M7" display="inline"><mml:mo>∼</mml:mo></mml:math></inline-formula> 300 km; Gruber et al., 2016).
Satellite-derived SM records only penetrate the top few centimeters of soil
and contain errors and spatial gaps typically caused by factors such as a
change in path, dense vegetation, frozen soil, water bodies, and radio
frequency interference (Llamas et al., 2020; Wang et al., 2012). Although a
long-term (1979–present) concatenated SM dataset was developed by merging
data from multiple satellites, the merged product did not fill the spatial
gaps that existed in the source satellite datasets (Dorigo et al., 2012;
EODC, 2021). The SM in LSM simulations usually spans multiple soil layers
and has no spatial or temporal gaps, which is convenient for regional and
global analysis (Gu et al., 2019); however, LSM simulations may contain
considerable errors because of inadequacies in the model physics,
parameterization, and drivers (Andresen et al., 2020). Reanalysis datasets
assimilate observations into LSMs or coupled forecast systems that have LSMs
as a component and are gap-free. Direct assimilation of remote sensing SM,
which has been the practice for some recent reanalysis – such as the ECMWF
Reanalysis 5 (ERA5) (de Rosnay et al., 2013) and Global Land Evaporation
Amsterdam Model (GLEAM) (Martens et al., 2017) – is likely to improve the
performance relative to free-running LSMs. Still, many reanalyses do not
directly assimilate the observational SM, such as the Japanese 55-Year
Reanalysis (JRA55) (Kobayashi et al., 2015) and Modern-Era Retrospective
Analysis for Research and Applications Version 2 (MERRA2) (McCarty et al.,
2016). Also, the meteorological variables, especially precipitation,
simulated by the atmosphere model of the coupled reanalysis system may be
biased, leading to inaccurate SM estimates by the intrinsic LSM component
(Balsamo et al., 2015). Fully coupled ESMs, such as those for the Coupled
Model Intercomparison Project phases 5 and 6 (CMIP5 and CMIP6) (Eyring et
al., 2016; Taylor et al., 2012), provide SM simulations for both historical
and future periods. ESMs, however, share the same uncertainty sources for
the SM estimates as the LSMs; moreover, the SM in ESM simulations has
internal variability-related uncertainties induced by unrealistic
initialization from the preindustrial conditions rather than the real world
(Eyring et al., 2016; Taylor et al., 2012).</p>
      <p id="d1e249">There is active development toward generating more accurate, gap-free SM
datasets. Methods for filling the spatial gaps in satellite observations
have been under investigation, but the resulting estimates either cover
short time periods or target only parts of the globe (Llamas et al., 2020;
Wang et al., 2012). One global multilayer SM product was generated by
upscaling in situ observations using machine learning and selected SM
predictors; however, it only focused on 2000–2019 (O and Orth, 2021).
Unlike current global reanalyses that directly assimilate satellite SM
(Martens et al., 2017; de Rosnay et al., 2013), some studies merged in situ observations,
or both in situ and satellite observations, with offline LSMs to improve
accuracy while retaining complete spatiotemporal coverage. Nevertheless,
these efforts were mainly conducted on the regional scale using limited sets
of LSMs (Wu et al., 2018; Zeng et al., 2016; Gruber et al., 2016).</p>
      <p id="d1e252">Therefore, there is a need to develop global merged SM products that
comprehensively combine the information from the latest in situ and
satellite observations, offline LSMs, reanalysis, and ESMs using advanced
fusion methods. Because of the incorporation of various quality-controlled
observations in the merging process, the merged products would likely
perform better than the SM in the original LSMs or ESMs, while keeping the
benefits of being gap-free in space and having long temporal and
multi-soil-layer coverage. The fusion of multiple LSMs, reanalysis, and
ESMs also involves ensemble averaging, which may reduce the SM uncertainties
from individual models by canceling the model-specific errors (Giorgi and
Mearns, 2002). This study presents a group of SM products derived using
three merging methods: unweighted  averaging (Mean), optimal linear combination
(OLC), and emergent constraint (EC). Unweighted averaging assigns equal
weight to all the source datasets and does not use in situ information (see
Sect. 2.1 for explanation for the exclusion). The OLC is an ensemble
weighting and rescaling algorithm that is optimal in the sense that the
weighted average minimizes the mean squared difference with respect to the
site-level observations (Bishop and Abramowitz, 2013). The OLC method was
previously found to lead to improved performance in the merged product
relative to the source datasets in terms of the global evapotranspiration
and runoff (Hobeichi et al., 2018, 2019). The EC method is common for
reducing uncertainty in future ESM simulations (Mystakidis et al., 2016;
Padrón et al., 2019). This method first uses data from multiple ESMs to
establish physically meaningful and statistically significant relationships
between the constraint variables that have observations and a target
variable that has no observations, and it then uses the relationship and actual
observations to derive a constrained target variable (Mystakidis et al.,
2016; Padrón et al., 2019). Given the clear physical relationships
between the SM and meteorological variables, we hypothesized in this study
that the EC method can be applied to reduce forcing-related biases in
offline LSMs and<?pagebreak page4387?> reanalysis and to align the natural internal variability
in ESMs with the real world. Seven new monthly multilayer SM datasets at
<inline-formula><mml:math id="M8" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.5</mml:mn><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup><mml:mo>×</mml:mo><mml:mn mathvariant="normal">0.5</mml:mn><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> resolution for 1971–2016 were
produced by applying these merging algorithms to  different
combinations of the mentioned raw SM estimates. The merged products with
different setups were then systematically evaluated against in situ measurements
that were reserved for validation, semi-independent gridded SM datasets,
drought indices, and meteorological variables.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F1"><?xmltex \currentcnt{1}?><?xmltex \def\figurename{Figure}?><label>Figure 1</label><caption><p id="d1e278">Procedure of creating the merged SM products using different
methods and source products.</p></caption>
        <?xmltex \igopts{width=236.157874pt}?><graphic xlink:href="https://essd.copernicus.org/articles/13/4385/2021/essd-13-4385-2021-f01.png"/>

      </fig>

</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Methods and source datasets</title>
<sec id="Ch1.S2.SS1">
  <label>2.1</label><title>Overview</title>
      <p id="d1e302">Figure 1 shows the schematic of the merging procedure to create the seven SM
products. The unweighted averaging and OLC (Hobeichi et al., 2018, 2019)
methods were applied to the observational or observation-forced datasets
(i.e., offline LSMs, reanalysis, satellite – ORS). The unweighted averaging
did not use any in situ observations because the in situ observations were
sparse (<inline-formula><mml:math id="M9" display="inline"><mml:mo lspace="0mm">∼</mml:mo></mml:math></inline-formula> 1400 stations compared with <inline-formula><mml:math id="M10" display="inline"><mml:mo>∼</mml:mo></mml:math></inline-formula> 60 000
grids in a 0.5<inline-formula><mml:math id="M11" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> gap-free dataset over the global land surface;
Sect. 2.2). In unweighted averaging, the in situ observations can only
influence the merged values in the time steps and grids that coincide with
the observations. Therefore, the inclusion of in situ observations would
have little influence on the results of unweighted averaging. Also, to
validate a merged time step and grid, an un-merged observation must be
available at the same time step and grid, which would be difficult to
achieve in data-sparse situations. The OLC method used in situ observations
to constrain the ORS datasets. The EC method (Mystakidis et al., 2016) was
applied to the ORS, CMIP5, CMIP6, combination of CMIP5 and CMIP6
(CMIP5<inline-formula><mml:math id="M12" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula>6), and combination of ORS, CMIP5, and CMIP6 (ALL) datasets
(Eyring et al., 2016; Taylor et al., 2012) using gridded global
meteorological observations as constraints. The use of ESM simulations with
the EC method, but not with the unweighted averaging or OLC methods, was
because the latter two methods resulted in very inadequate performances when
applied to the ESM simulations in a preliminary analysis (results not
shown). Because the ORS datasets do not have uniform temporal coverage
(Tables S1–S3), the unweighted averaging only used the ORS datasets that
cover 1970–2016. For the OLC method, the ORS datasets were grouped based on
three time ranges (1970–2010, 1981–2010, and 1981–2016) that were
selected to maximize the available ORS datasets in each time range. For each
time range, the ORS datasets that fully cover the time range were merged
with the OLC method; if an ORS dataset fully covers two or three time
ranges, it was used in all the covered time ranges (see the “time
period used”  in Tables S1–S3). Then, the merged results for all three time
ranges were concatenated into a consistent dataset covering the whole target
period following a previous method for concatenating the remote sensing SM
(Dorigo et al., 2017; Liu et al., 2011, 2012) (Sect. 2.9). The CMIP5 and
CMIP6 datasets always cover 1970–2016; when they were used jointly with the
ORS datasets to produce the EC ALL (i.e., including all the source datasets)
product, they were subset to the same time ranges as the ORS datasets,
separately processed, and concatenated (bottom of Fig. 1). All the
synthesized monthly SM datasets are at 0.5<inline-formula><mml:math id="M13" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> resolution, cover
1970–2016, and contain four depths (0–10, 10–30, 30–50, and 50–100 cm).
The following sections provide more details on the datasets, merging methods,
and processing procedures.</p>
</sec>
<sec id="Ch1.S2.SS2">
  <label>2.2</label><title>In situ SM observations</title>
      <p id="d1e352">In situ SM observations were obtained from the International Soil Moisture
Network (ISMN) (Dorigo et al., 2011, 2013). Only the observations associated
with the ISMN quality flags “G” (good) or “M” (parameter value missing)
were retained. The resulting dataset contains <inline-formula><mml:math id="M14" display="inline"><mml:mo>∼</mml:mo></mml:math></inline-formula> 1400 stations
worldwide and spans  1964 to the present. Because only a few stations
were available at the beginning of the time period, only the observations in
1970 or later were used. To facilitate processing by the OLC method, the
ISMN observations were aggregated to monthly 0.5<inline-formula><mml:math id="M15" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> resolution and
regular depths (0–10, 10–30, 30–50, and 50–100 cm). The aggregation to
monthly resolution was simply averaging over all the available observations
in each month at each station. Although it is desirable to apply a stricter
criterion in the monthly averaging, such as treating a month as missing if
observations are for fewer than 15 d in the month, applying such a
criterion would exclude most of the stations in northern and eastern Asia,
which only have one to three observations per month. Multiple methods were tested
for the aggregation to 0.5<inline-formula><mml:math id="M16" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> resolution: (1) simply averaging all
the stations in each grid; (2) weighted-averaging the stations based on the
percentage<?pagebreak page4388?> of grid area that the land cover of each station represents
using the Moderate Resolution Imaging Spectroradiometer (MODIS) MCD12C1
product (Friedl and Sulla-Menashe, 2015); and (3) the same as (2) except
that if the total area of all the land cover types that the stations represent
does not account for a sufficient percentage of the grid area (40 %), the
grid was set to missing. Because all three methods resulted in similar
performance in the merged products (results not shown), only the second
method was adopted for the final products. The aggregation to regular depths
was simply to average over all the available observations in each depth
interval. When an observation was taken on the interface of two depth
intervals (e.g., exactly at 10 cm), the observation was assigned to the
shallower depth interval (e.g., 0–10 cm). Figure S1 shows the aggregated
ISMN observations at the 0.5<inline-formula><mml:math id="M17" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> grid scale and the number of
observations that falls into each land cover type. The observations are
overrepresented in the developed part of the world; the most overrepresented
land cover types include deciduous broadleaf forests, grasslands, and
croplands, whereas the most underrepresented land cover types are evergreen
broadleaf forests, mixed forests, closed shrublands, and permanent wetlands.
The number of available monthly observations did not decrease with deeper
soil layers (global totals are 19 317 station months for the 0–10 cm layer,
25 307 for the 10–30 cm layer, 25,011 for the 30–50 cm layer, and 20 660
for the 50–100 cm layer), although no observations were available for the
closed shrublands and permanent wetlands across the deeper soil layers
(10–30, 30–50, and 50–100 cm). After the ISMN observations were
aggregated to monthly 0.5<inline-formula><mml:math id="M18" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> resolutions, 60 % of the month grids
were used as the observed SM values in the OLC method (the <inline-formula><mml:math id="M19" display="inline"><mml:mrow><mml:msup><mml:mi>o</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>
variable; see Sect. 2.7), and the remainder were reserved for evaluating all
the merged products. The training month grids were uniformly randomly
selected without distinguishing between space and time.</p>
</sec>
<sec id="Ch1.S2.SS3">
  <label>2.3</label><title>ORS, CMIP5, and CMIP6 SM products</title>
      <p id="d1e421">Tables S1–S4 list the monthly gridded ORS, CMIP5, and CMIP6 results that
were used as the SM source datasets for the merged products, as well as the
depths (0–10, 10–30, 30–50, or 50–100 cm) and time ranges (1970–2016,
1970–2010, 1981–2010, or 1981–2016) for which the datasets were used. The
name of the used SM variable was “mrlsl” in the CMIP5 collection and
“mrsol” in the CMIP6 collection. Prior to merging, all the ORS, CMIP5, and
CMIP6 datasets were bilinearly interpolated to 0.5<inline-formula><mml:math id="M20" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> resolution, linearly interpolated to the target soil depths (0–10,
10–30, 30–50, or 50–100 cm), and masked with a common land mask using the
National Center for Atmospheric Research (NCAR) Command Language 6.6.2
(UCAR/NCAR/CISL/TDD, 2019). Linear interpolation to all four soil depths
could not be achieved for all the source datasets because the soil layers in
some models are too shallow or too coarse. For example, the bottom depths of
the Joint UK Land Environment Simulator (JULES) LSM are 10 cm, 35 cm, 1 m,
and 3 m (Sebastian Lienert, personal communication, 2019). Therefore, the
10–30 and 50–100 cm depths are contained within single layers in the JULES
model and could not be directly interpolated. The four target soil depths
here were selected to maximize the number of source datasets that could be
interpolated to each depth. Although the Community Land Model version 4
(CLM4) simulates SM as deep as 421 cm, the model was only used for 0–10 and
10–30 cm (Table S3). This was because the SM values at levels deeper than 38 cm
were very low (<inline-formula><mml:math id="M21" display="inline"><mml:mo lspace="0mm">&lt;</mml:mo></mml:math></inline-formula> 0.0005 m<inline-formula><mml:math id="M22" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msup></mml:math></inline-formula> m<inline-formula><mml:math id="M23" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> on global average) and
even lower than the SM values at levels shallower than 38 cm in the same model
(<inline-formula><mml:math id="M24" display="inline"><mml:mo lspace="0mm">&gt;</mml:mo></mml:math></inline-formula> 0.0025 m<inline-formula><mml:math id="M25" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msup></mml:math></inline-formula> m<inline-formula><mml:math id="M26" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> on global average). The common land
mask was created by intersecting the grid points that satisfy two criteria:
(1) all the datasets except the European Space Agency Climate Change
Initiative (ESA CCI) v4.5, after being interpolated to 0.5<inline-formula><mml:math id="M27" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>, have
valid values; and (2) at least 50 % of the land cover is not water bodies
or permanent snow and ice in the MODIS MCD12C1 product (Friedl and
Sulla-Menashe, 2015). Because of many spatial gaps, the ESA CCI v4.5 dataset
was not used to create the common land mask and received special handling.
In addition to being masked with the common land mask, the ESA CCI v4.5
dataset was masked using its accompanying quality flags, meaning the time
steps and grids that have snow coverage or temperature below zero (flag
1), dense vegetation (flag 2), no valid SM estimates (flag 4), SM
values above the physical boundary (flag 8), or only unreliable SM values
(flag 16). The ESA CCI v4.5 dataset was excluded from merging at the
time steps and grids in which there are missing values.</p>
</sec>
<sec id="Ch1.S2.SS4">
  <label>2.4</label><title>Observed, CMIP5, and CMIP6 temperature and precipitation</title>
      <p id="d1e507">The EC method, as implemented in this study (Sect. 2.8), requires the air
temperature and precipitation forcings that correspond to each SM dataset
and observed air temperature and precipitation as inputs. The temperature
and precipitation forcings that correspond to the ORS SM datasets are listed
in Table S5. Because the ESA CCI v4.5 dataset is observational and GLEAM
v3.3a directly assimilates the ESA CCI dataset (Dorigo et al., 2017), these
two datasets were assumed to correspond to the observed temperature and
precipitation in the Climate Research Unit (CRU) TS v4.03 dataset (Harris et
al., 2014). The temperature and precipitation forcings for the various
reanalysis datasets were obtained from the same reanalysis. The temperature
and precipitation forcings for the Multi-scale Synthesis and Terrestrial
Model Intercomparison Project (MsTMIP) collection of LSMs were from the CRU
NCEP v4 dataset, which, at monthly level, is equal to the CRU TS v3.20
dataset (Huntzinger et al., 2018). The temperature and precipitation
forcings for the Trends and Drivers of the Regional Scale Sources and Sinks
of Carbon Dioxide version 7 (TRENDY v7) collection of LSMs were from the CRU
TS v3.26 dataset (Stephen Sitch and<?pagebreak page4389?> Pierre Friedlingstein, personal communication, 2019).
The temperature and precipitation datasets that correspond to the CMIP5 and
CMIP6 SM were from the same ESMs and ensemble members (Table S4). For
observed air temperature and precipitation, the CRU TS v4.03 dataset (Harris
et al., 2014) was used. All the temperature datasets were bilinearly
interpolated, and the precipitation datasets conservatively interpolated, to
0.5<inline-formula><mml:math id="M28" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> resolution using the NCAR Command Language 6.6.2
(UCAR/NCAR/CISL/TDD, 2019). All the temperature and precipitation datasets
were limited to the same common land mask as the SM products.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T1" specific-use="star"><?xmltex \currentcnt{1}?><label>Table 1</label><caption><p id="d1e522">Global and regional datasets that were compared against the merged
SM products.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="7">
     <oasis:colspec colnum="1" colname="col1" align="justify" colwidth="2.5cm"/>
     <oasis:colspec colnum="2" colname="col2" align="justify" colwidth="2.5cm"/>
     <oasis:colspec colnum="3" colname="col3" align="left"/>
     <oasis:colspec colnum="4" colname="col4" align="justify" colwidth="1cm"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:colspec colnum="6" colname="col6" align="justify" colwidth="2.3cm"/>
     <oasis:colspec colnum="7" colname="col7" align="left"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Dataset</oasis:entry>
         <oasis:entry colname="col2">Type</oasis:entry>
         <oasis:entry colname="col3">Period</oasis:entry>
         <oasis:entry colname="col4">Depth (cm)</oasis:entry>
         <oasis:entry colname="col5">Resolution</oasis:entry>
         <oasis:entry colname="col6">Coverage</oasis:entry>
         <oasis:entry colname="col7">Reference</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">SMOS L3 RE04<?xmltex \hack{\hfill\break}?>MIR_CLF3MA, MIR_CLF3MD</oasis:entry>
         <oasis:entry colname="col2">Satellite</oasis:entry>
         <oasis:entry colname="col3">2010–2020</oasis:entry>
         <oasis:entry colname="col4">Surface (0–5)</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M29" display="inline"><mml:mo>∼</mml:mo></mml:math></inline-formula> 25 km</oasis:entry>
         <oasis:entry colname="col6">Global with missing values</oasis:entry>
         <oasis:entry colname="col7">Al Bitar et al. (2017)</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">SMOS L4 SCIE<?xmltex \hack{\hfill\break}?>MIR_CLM4RD</oasis:entry>
         <oasis:entry colname="col2">Reanalysis</oasis:entry>
         <oasis:entry colname="col3">2010–2020</oasis:entry>
         <oasis:entry colname="col4">0–100</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M30" display="inline"><mml:mo>∼</mml:mo></mml:math></inline-formula> 25 km</oasis:entry>
         <oasis:entry colname="col6">Global with missing values</oasis:entry>
         <oasis:entry colname="col7">Al Bitar et al. (2013)</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">GLEAM v3.3a</oasis:entry>
         <oasis:entry colname="col2">Reanalysis</oasis:entry>
         <oasis:entry colname="col3">1980–2018</oasis:entry>
         <oasis:entry colname="col4">0–100</oasis:entry>
         <oasis:entry colname="col5">0.25<inline-formula><mml:math id="M31" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6">Global</oasis:entry>
         <oasis:entry colname="col7">Martens et al. (2017)</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">SMERGE v2</oasis:entry>
         <oasis:entry colname="col2">Reanalysis</oasis:entry>
         <oasis:entry colname="col3">1979–2019</oasis:entry>
         <oasis:entry colname="col4">0–40</oasis:entry>
         <oasis:entry colname="col5">0.125<inline-formula><mml:math id="M32" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6">Contiguous United States</oasis:entry>
         <oasis:entry colname="col7">Tobin et al. (2017)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SoMo.ml</oasis:entry>
         <oasis:entry colname="col2">Machine learning<?xmltex \hack{\hfill\break}?>upscaled from in<?xmltex \hack{\hfill\break}?>situ observations</oasis:entry>
         <oasis:entry colname="col3">2000–2019</oasis:entry>
         <oasis:entry colname="col4">0–10, 10–30, 30–50</oasis:entry>
         <oasis:entry colname="col5">0.25<inline-formula><mml:math id="M33" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6">Global</oasis:entry>
         <oasis:entry colname="col7">O and Orth (2021)</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

</sec>
<sec id="Ch1.S2.SS5">
  <label>2.5</label><title>In situ and gridded SM datasets for evaluation of the merged products</title>
      <p id="d1e754">A recent discussion on the evaluation of coarse-scale soil moisture datasets
noted that neither in situ observations, which have limited coverage and
a small spatial footprint, nor satellite and LSMs, which have retrieval or
modeling errors, can be considered fully adequate for evaluation at the
global scale; as such, a sound evaluation practice would require combining
multiple sources of data (Gruber et al. 2020). Following this
recommendation, the merged SM products were evaluated against the reserved
40 % in situ observations (Sect. 2.2), as well as a few gridded reanalysis,
satellite, and machine-learning upscaled SM datasets. Although the merging
process aimed to use as many existing SM datasets as possible, the gridded
datasets in Table 1 were not used in the merging because of incompatible
vertical resolution, non-global spatial coverage, or short temporal coverage
(Table 1). Such evaluation against multisource gridded datasets complements
the evaluation against in situ observations by providing sanity checks on
the behavior of the merged products at large scales. All the evaluation
datasets were bilinearly interpolated to <inline-formula><mml:math id="M34" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.5</mml:mn><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup><mml:mo>×</mml:mo><mml:mn mathvariant="normal">0.5</mml:mn><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> and aggregated to a monthly level using the NCAR Command
Language 6.6.2 (UCAR/NCAR/CISL/TDD, 2019). For the Soil Moisture and Ocean
Salinity (SMOS) L3 dataset, which was available as monthly aggregates
(<uri>https://www.catds.fr/sipad/</uri>, last access:  30 July 2021), only the data points with retrieval error
(i.e., the “DQX” field) <inline-formula><mml:math id="M35" display="inline"><mml:mo>&lt;</mml:mo></mml:math></inline-formula> 0.07 m<inline-formula><mml:math id="M36" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msup></mml:math></inline-formula> m<inline-formula><mml:math id="M37" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> were
used, and the ascending (MIR_CLM4RA) and descending
(MIR_CLM4RD) orbits were averaged. For the SMOS L4 dataset,
which was only available at daily resolution, the days with a quality index
of 1 (highest quality) were used, and the monthly averaging was restricted
to the months with fewer than 13 missing days to be consistent with the
downloaded monthly SMOS L3 dataset. The other evaluation datasets do not
have gaps, so the aggregation to monthly level was straightforward. The SMOS
L3 and L4 datasets were independent from the merged products. The SoMo.ml SM
is mainly upscaled from the ISMN dataset (O and Orth, 2021,) and it is therefore
only semi-independent from the OLC ORS merged product but independent from
the unweighted averaging- or EC-based merged products. The GLEAM v3.3a
0–100 cm dataset is not independent from the ORS- or ALL-based merged
products, which use the 0–10 cm part of the GLEAM v3.3a dataset (Table S2),
but is independent from the CMIP5- or CMIP6-based merged products. The
SMERGE v2 dataset uses the ESA CCI satellite data and is therefore
nonindependent from the ORS- or ALL-based merged products (Table S1), but it is
independent from the CMIP5- or CMIP6-based merged products.</p>
      <p id="d1e808">The merged products were evaluated against the validation set of in situ
observations and the gridded SM datasets using three common metrics: mean
bias (bias), root mean squared error (RMSE), and Pearson correlation
coefficient (Corr). For evaluation against the in situ observations, the
metrics were calculated for both the whole validation set and for each land
cover type in consideration of the uneven distribution of ISMN observations
across land cover types (Fig. S1). The observational values used in each
calculation were the land-cover-weighted averages (see Sect. 2.2), and the
merged values were from the grids and time steps that have the observational
values. For evaluation against the SMOS L3 gridded dataset, the 0–10 cm
layers of the merged products and the source datasets (ORS, CMIP5, and CMIP6)
were  used. For evaluation against the other evaluation datasets, the merged
and source datasets were linearly interpolated to the depths of the evaluation
datasets. The annual climatology, mean seasonal anomalies (i.e., the
climatology of individual months minus the annual climatology),
least-squares linear trends, and anomalies (i.e., the original values minus
the mean seasonal cycle and trends) were calculated for each common grid
cell and over the common time period between each pair of evaluated and
evaluation datasets. Then, for each characteristic (climatology, seasonal
cycle, linear trends, or anomalies), the bias, RMSE, and Corr were
calculated using the values of the characteristic pooled over all the common
grid cells. When calculating the bias, RMSE, and Corr for the trends, the
insignificant trends at <inline-formula><mml:math id="M38" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.1</mml:mn></mml:mrow></mml:math></inline-formula> were set to zero to prevent small random
variability from influencing the results.</p>
</sec>
<sec id="Ch1.S2.SS6">
  <label>2.6</label><title>Drought and meteorological datasets for evaluation of the merged products</title>
      <p id="d1e831">In situ observations are sparse and represent much smaller spatial scales
than the 0.5<inline-formula><mml:math id="M39" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> grid of the merged products. Other global and
regional SM datasets have short temporal coverage and spatial gaps in the
evaluation datasets, as well as nonindependence between the merged products and
evaluation datasets. Given these limitations and to further ascertain the
quality of the merged products, the new SM products were evaluated using
process-based observational metrics, including the responses to prominent
historical drought events and historical climate change (e.g.,
precipitation, temperature, downwelling shortwave radiation). The selected
historical drought events were the United States drought of 1985–1992 and
the Australian millennium drought of 2002–2009<?pagebreak page4390?> because of their
macro-regional spatial scale and high severity; many other drought events
would also fit these criteria (Spinoni et al., 2019), but conducting a
comprehensive assessment of drought events is beyond the scope of this
study. A self-calibrated Palmer Drought Severity Index (scPDSI) dataset (Dai
et al., 2004) was used as the benchmark, and the spatial patterns of SM
anomalies and scPDSI were compared year by year during these two drought
events. The precipitation, temperature, and downwelling shortwave radiation
datasets were from the Global Soil Wetness Project (GWSP) version 3
reanalysis (Dirmeyer et al., 2006), which provides some independence from the
CRU TS v4.03 temperature and precipitation used in the EC method (Sect. 2.4). The SM climatic sensitivities were derived using the partial
correlations with each meteorological variable calculated as conditional on the
other two variables.</p>
</sec>
<sec id="Ch1.S2.SS7">
  <label>2.7</label><title>OLC method</title>
      <p id="d1e851">Let <inline-formula><mml:math id="M40" display="inline"><mml:mrow><mml:msubsup><mml:mi>x</mml:mi><mml:mi>k</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula> stand for the SM value of the source dataset <inline-formula><mml:math id="M41" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula> (<inline-formula><mml:math id="M42" display="inline"><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mi>K</mml:mi></mml:mrow></mml:math></inline-formula>) at time step <inline-formula><mml:math id="M43" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula> (<inline-formula><mml:math id="M44" display="inline"><mml:mrow><mml:mi>t</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mi>T</mml:mi></mml:mrow></mml:math></inline-formula>) and grid <inline-formula><mml:math id="M45" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula> (<inline-formula><mml:math id="M46" display="inline"><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mi>S</mml:mi></mml:mrow></mml:math></inline-formula>), and let <inline-formula><mml:math id="M47" display="inline"><mml:mrow><mml:msup><mml:mi>o</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> be the observed SM values at time step
<inline-formula><mml:math id="M48" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula> and grid <inline-formula><mml:math id="M49" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula>, where <inline-formula><mml:math id="M50" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mi>j</mml:mi><mml:mo>)</mml:mo><mml:mo>∈</mml:mo><mml:mi>V</mml:mi></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M51" display="inline"><mml:mi>V</mml:mi></mml:math></inline-formula> is the subset of grids and time steps that have observed SM. OLC calculates the final estimated SM (<inline-formula><mml:math id="M52" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">μ</mml:mi><mml:mtext>e</mml:mtext><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msubsup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> as a weighted average, with <inline-formula><mml:math id="M53" display="inline"><mml:mrow><mml:msub><mml:mi>w</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> denoting the weight of
source dataset <inline-formula><mml:math id="M54" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula>, using Eq. (1).
            <disp-formula id="Ch1.E1" content-type="numbered"><label>1</label><mml:math id="M55" display="block"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">μ</mml:mi><mml:mtext>e</mml:mtext><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>K</mml:mi></mml:munderover><mml:msub><mml:mi>w</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:msubsup><mml:mi>x</mml:mi><mml:mi>k</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msubsup></mml:mrow></mml:math></disp-formula></p>
      <p id="d1e1098">The optimal vector of weights for the source datasets, <inline-formula><mml:math id="M56" display="inline"><mml:mrow><mml:mi mathvariant="bold-italic">w</mml:mi><mml:mo>=</mml:mo><mml:msup><mml:mfenced close="]" open="["><mml:mrow><mml:msub><mml:mi>w</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>w</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msub><mml:mi>w</mml:mi><mml:mi>K</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mi>T</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>, which minimizes the mean
squared error subject to <inline-formula><mml:math id="M57" display="inline"><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>K</mml:mi></mml:msubsup><mml:msub><mml:mi>w</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula>, is a function of
the error covariance matrix of the source datasets (<inline-formula><mml:math id="M58" display="inline"><mml:mi mathvariant="bold">A</mml:mi></mml:math></inline-formula>)
following Eq. (2).
            <disp-formula id="Ch1.E2" content-type="numbered"><label>2</label><mml:math id="M59" display="block"><mml:mrow><mml:mi mathvariant="bold-italic">w</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msup><mml:mi mathvariant="bold">A</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mrow><mml:msup><mml:mn mathvariant="normal">1</mml:mn><mml:mi>T</mml:mi></mml:msup><mml:msup><mml:mi mathvariant="bold">A</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:math></disp-formula></p>
      <p id="d1e1212">The OLC procedure without a constant term requires the source datasets to be
unbiased (Bishop and Abramowitz, 2013), but the ORS datasets are biased
relative to the in situ observations. Therefore, to prevent the biases from
influencing the weights, the error covariance matrix <inline-formula><mml:math id="M60" display="inline"><mml:mi mathvariant="bold">A</mml:mi></mml:math></inline-formula> was
the covariance matrix of locally centered errors (<inline-formula><mml:math id="M61" display="inline"><mml:mrow><mml:msubsup><mml:mi>e</mml:mi><mml:mi>k</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msubsup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, which were
calculated following Eq. (3):
            <disp-formula id="Ch1.E3" content-type="numbered"><label>3</label><mml:math id="M62" display="block"><mml:mrow><mml:msubsup><mml:mi>e</mml:mi><mml:mi>k</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:mfenced open="(" close=")"><mml:mrow><mml:msubsup><mml:mi>x</mml:mi><mml:mi>k</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msubsup><mml:mo>-</mml:mo><mml:msubsup><mml:mi>x</mml:mi><mml:mi>k</mml:mi><mml:mrow><mml:mo>.</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced><mml:mo>-</mml:mo><mml:mfenced close=")" open="("><mml:mrow><mml:msup><mml:mi>o</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi>o</mml:mi><mml:mrow><mml:mo>.</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:mfenced><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>)</mml:mo><mml:mo>∈</mml:mo><mml:mi>V</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where <inline-formula><mml:math id="M63" display="inline"><mml:mrow><mml:msubsup><mml:mi>x</mml:mi><mml:mi>k</mml:mi><mml:mrow><mml:mo>.</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula> is the time-averaged SM value of the source dataset <inline-formula><mml:math id="M64" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula> at
grid <inline-formula><mml:math id="M65" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula>, and <inline-formula><mml:math id="M66" display="inline"><mml:mrow><mml:msup><mml:mi>o</mml:mi><mml:mrow><mml:mo>.</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> is the time-averaged observed SM value at grid <inline-formula><mml:math id="M67" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula>. The
time averaging was over the time steps in which observed SM values exist in
each grid <inline-formula><mml:math id="M68" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula>. The grids and time steps for which the centered errors exist
were pooled together to create a single vector of errors for each source
dataset. This vector of errors <inline-formula><mml:math id="M69" display="inline"><mml:mrow><mml:msubsup><mml:mi>e</mml:mi><mml:mi>k</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula> was then used to calculate the
error covariance matrix <inline-formula><mml:math id="M70" display="inline"><mml:mi mathvariant="bold">A</mml:mi></mml:math></inline-formula>. Therefore, the derived weights
were optimal with regard to the locally centered errors, not the un-centered
errors <inline-formula><mml:math id="M71" display="inline"><mml:mrow><mml:mfenced open="(" close=")"><mml:mrow><mml:msubsup><mml:mi>x</mml:mi><mml:mi>k</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msubsup><mml:mo>-</mml:mo><mml:msup><mml:mi>o</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:mfenced></mml:mrow></mml:math></inline-formula>. However, this limitation had to
be accepted because the ISMN observations were too sparse to enable
estimating the biases at every grid in space, and a constant bias could not
be assumed over the large spatial domain of the study, which spans very dry
to very wet climate zones.</p>
      <p id="d1e1434">The OLC procedure was implemented in Python 3.6.3 under a CentOS Linux
environment. Different functions for calculating the error covariance matrix
in Python were compared to eliminate potential numerical instability in
matrix inversion. The results were found to be similar, but the
ShrunkCovariance function in Scikit-learn v0.21.3 (Pedregosa et al., 2011)
generated slightly better validation performance for the estimated SM than
the other tested functions. Therefore, ShrunkCovariance was selected for
calculating <inline-formula><mml:math id="M72" display="inline"><mml:mi mathvariant="bold">A</mml:mi></mml:math></inline-formula>.</p>
      <?pagebreak page4391?><p id="d1e1445">In addition to estimating SM, the OLC procedure also calculates the
associated uncertainty in the form of standard deviation (<inline-formula><mml:math id="M73" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mtext>e</mml:mtext><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msubsup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> based on adjusted weights (<inline-formula><mml:math id="M74" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>w</mml:mi><mml:mo mathvariant="normal" stretchy="false">̃</mml:mo></mml:mover><mml:mi>k</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, adjusted source SM
datasets (<inline-formula><mml:math id="M75" display="inline"><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi>x</mml:mi><mml:mo mathvariant="normal" stretchy="false">̃</mml:mo></mml:mover><mml:mi>k</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msubsup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, and the estimated SM (<inline-formula><mml:math id="M76" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">μ</mml:mi><mml:mtext>e</mml:mtext><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msubsup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>
following Eq. (4).
            <disp-formula id="Ch1.E4" content-type="numbered"><label>4</label><mml:math id="M77" display="block"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mtext>e</mml:mtext><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:msqrt><mml:mrow><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>K</mml:mi></mml:munderover><mml:msub><mml:mover accent="true"><mml:mi>w</mml:mi><mml:mo mathvariant="normal" stretchy="false">̃</mml:mo></mml:mover><mml:mi>k</mml:mi></mml:msub><mml:msup><mml:mfenced open="(" close=")"><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi>x</mml:mi><mml:mo mathvariant="normal" stretchy="false">̃</mml:mo></mml:mover><mml:mi>k</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msubsup><mml:mo>-</mml:mo><mml:msubsup><mml:mi mathvariant="italic">μ</mml:mi><mml:mtext>e</mml:mtext><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:msqrt></mml:mrow></mml:math></disp-formula></p>
      <p id="d1e1590">The adjusted weights are a function of the original weights (<inline-formula><mml:math id="M78" display="inline"><mml:mrow><mml:msub><mml:mi>w</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>) and a
parameter <inline-formula><mml:math id="M79" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula> to have <inline-formula><mml:math id="M80" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>w</mml:mi><mml:mo mathvariant="normal" stretchy="false">̃</mml:mo></mml:mover><mml:mi>k</mml:mi></mml:msub><mml:mo>≥</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> and maintain
<inline-formula><mml:math id="M81" display="inline"><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>K</mml:mi></mml:msubsup><mml:msub><mml:mover accent="true"><mml:mi>w</mml:mi><mml:mo stretchy="false" mathvariant="normal">̃</mml:mo></mml:mover><mml:mi>k</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula>, following Eqs. (5) and (6).

                <disp-formula specific-use="align" content-type="numbered"><mml:math id="M82" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E5"><mml:mtd><mml:mtext>5</mml:mtext></mml:mtd><mml:mtd><mml:mstyle class="stylechange" displaystyle="true"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msub><mml:mover accent="true"><mml:mi>w</mml:mi><mml:mo mathvariant="normal" stretchy="false">̃</mml:mo></mml:mover><mml:mi>k</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msub><mml:mi>w</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:mfenced open="(" close=")"><mml:mrow><mml:mi mathvariant="italic">α</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:mfenced><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mi>K</mml:mi></mml:mfrac></mml:mstyle></mml:mrow><mml:mi mathvariant="italic">α</mml:mi></mml:mfrac></mml:mstyle></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E6"><mml:mtd><mml:mtext>6</mml:mtext></mml:mtd><mml:mtd><mml:mstyle class="stylechange" displaystyle="true"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mi mathvariant="italic">α</mml:mi><mml:mo>=</mml:mo><mml:mfenced close="" open="{"><mml:mtable class="array" columnalign="left left"><mml:mtr><mml:mtd><mml:mn mathvariant="normal">1</mml:mn></mml:mtd><mml:mtd><mml:mrow><mml:mi mathvariant="normal">if</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="normal">all</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:msub><mml:mi>w</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="normal">are</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi mathvariant="normal">nonnegative</mml:mi></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mi>K</mml:mi><mml:mo>min⁡</mml:mo><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi>w</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mi mathvariant="normal">if</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi mathvariant="normal">the</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="normal">minimum</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:msub><mml:mi>w</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi mathvariant="normal">is</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="normal">negative</mml:mi></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mfenced></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula></p>
      <p id="d1e1787">The adjusted source SM datasets (<inline-formula><mml:math id="M83" display="inline"><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi>x</mml:mi><mml:mo stretchy="false" mathvariant="normal">̃</mml:mo></mml:mover><mml:mi>k</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msubsup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> are linear functions
of the source SM datasets (<inline-formula><mml:math id="M84" display="inline"><mml:mrow><mml:msubsup><mml:mi>x</mml:mi><mml:mi>k</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msubsup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> and the estimated SM (<inline-formula><mml:math id="M85" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">μ</mml:mi><mml:mtext>e</mml:mtext><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msubsup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> through parameters <inline-formula><mml:math id="M86" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M87" display="inline"><mml:mi mathvariant="italic">β</mml:mi></mml:math></inline-formula>, where the parameter <inline-formula><mml:math id="M88" display="inline"><mml:mi mathvariant="italic">β</mml:mi></mml:math></inline-formula> is a function of the discrepancy between the observations and the estimated SM (<inline-formula><mml:math id="M89" display="inline"><mml:mrow><mml:msubsup><mml:mi>s</mml:mi><mml:mtext>e</mml:mtext><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, following Eqs. (7)–(9).

                <disp-formula specific-use="align" content-type="numbered"><mml:math id="M90" display="block"><mml:mtable rowspacing="8.535827pt 8.535827pt" displaystyle="true"><mml:mlabeledtr id="Ch1.E7"><mml:mtd><mml:mtext>7</mml:mtext></mml:mtd><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:msubsup><mml:mover accent="true"><mml:mi>x</mml:mi><mml:mo stretchy="false" mathvariant="normal">̃</mml:mo></mml:mover><mml:mi>k</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:msubsup><mml:mi mathvariant="italic">μ</mml:mi><mml:mtext>e</mml:mtext><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msubsup><mml:mo>+</mml:mo><mml:mi mathvariant="italic">β</mml:mi><mml:mfenced close=")" open="("><mml:mrow><mml:msubsup><mml:mi>x</mml:mi><mml:mi>k</mml:mi><mml:mrow><mml:mo>.</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msubsup><mml:mo>+</mml:mo><mml:mi mathvariant="italic">α</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:msubsup><mml:mi>x</mml:mi><mml:mi>k</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msubsup><mml:mo>-</mml:mo><mml:msubsup><mml:mi>x</mml:mi><mml:mi>k</mml:mi><mml:mrow><mml:mo>.</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced><mml:mo>-</mml:mo><mml:msubsup><mml:mi mathvariant="italic">μ</mml:mi><mml:mtext>e</mml:mtext><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E8"><mml:mtd><mml:mtext>8</mml:mtext></mml:mtd><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mi mathvariant="italic">β</mml:mi><mml:mo>=</mml:mo><mml:msqrt><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msubsup><mml:mi>s</mml:mi><mml:mtext>e</mml:mtext><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mrow><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mi>N</mml:mi></mml:mfrac></mml:mstyle><mml:msub><mml:mo>∑</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>t</mml:mi><mml:mo>∈</mml:mo><mml:mi>V</mml:mi></mml:mrow></mml:msub><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>K</mml:mi></mml:msubsup><mml:msub><mml:mover accent="true"><mml:mi>w</mml:mi><mml:mo mathvariant="normal" stretchy="false">̃</mml:mo></mml:mover><mml:mi>k</mml:mi></mml:msub><mml:msup><mml:mfenced close=")" open="("><mml:mrow><mml:msubsup><mml:mi>x</mml:mi><mml:mi>k</mml:mi><mml:mrow><mml:mo>.</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msubsup><mml:mo>+</mml:mo><mml:mi mathvariant="italic">α</mml:mi><mml:mfenced close=")" open="("><mml:mrow><mml:msubsup><mml:mi>x</mml:mi><mml:mi>k</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msubsup><mml:mo>-</mml:mo><mml:msubsup><mml:mi>x</mml:mi><mml:mi>k</mml:mi><mml:mrow><mml:mo>.</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced><mml:mo>-</mml:mo><mml:msubsup><mml:mi mathvariant="italic">μ</mml:mi><mml:mtext>e</mml:mtext><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:mfrac></mml:mstyle></mml:msqrt></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E9"><mml:mtd><mml:mtext>9</mml:mtext></mml:mtd><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msubsup><mml:mi>s</mml:mi><mml:mtext>e</mml:mtext><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msub><mml:mo>∑</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>t</mml:mi><mml:mo>∈</mml:mo><mml:mi>V</mml:mi></mml:mrow></mml:msub><mml:msup><mml:mfenced open="(" close=")"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">μ</mml:mi><mml:mtext>e</mml:mtext><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msubsup><mml:mo>-</mml:mo><mml:msup><mml:mi>o</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow><mml:mrow><mml:mi>N</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

            Here, <inline-formula><mml:math id="M91" display="inline"><mml:mi>V</mml:mi></mml:math></inline-formula> is the subset of grid and time step combinations that have observed
SM, and <inline-formula><mml:math id="M92" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> is the total number of grid time steps that have observed SM (i.e., <inline-formula><mml:math id="M93" display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>=</mml:mo><mml:mfenced open="|" close="|"><mml:mi>V</mml:mi></mml:mfenced><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>.</p>
</sec>
<sec id="Ch1.S2.SS8">
  <label>2.8</label><title>EC method</title>
      <p id="d1e2204">For establishing the EC relationship, temperature and precipitation were
selected to be the constraint variables because of their significant roles
in controlling evapotranspiration from and recharge to the soil water. SM
was the target variable. For each target year, month, grid, and soil depth,
a linear regression relationship was fitted using SM anomalies as the
predictand and temperature and precipitation anomalies as the predictors.
The SM, temperature, and precipitation anomalies of each source dataset
(i.e., the datasets in the ORS, CMIP5, CMIP6, CMIP5<inline-formula><mml:math id="M94" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula>6, or ALL group;
Fig. 1) were calculated by removing the monthly climatology of 1981–2010.
The vectors of SM, temperature, and precipitation anomalies in each
regression relationship consisted of the anomalies for the target year,
month, and soil depth over the nine nearest grids to the target grid and over
all the source datasets. If the fitted regression slopes of both temperature
and precipitation anomalies were significant at <inline-formula><mml:math id="M95" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.05</mml:mn></mml:mrow></mml:math></inline-formula>, the fitted
regression was used as the EC relationship. If either slope was
insignificant, the regression was refitted using only precipitation or
temperature anomalies as the predictor. If the refitted slope of
precipitation (temperature) anomalies was significant at <inline-formula><mml:math id="M96" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.05</mml:mn></mml:mrow></mml:math></inline-formula> and had a
lower <inline-formula><mml:math id="M97" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> value than the refitted slope of temperature (precipitation)
anomalies, the refitted regression with precipitation (temperature)
anomalies was used as the EC relationship. If neither of the refitted slopes
was significant at <inline-formula><mml:math id="M98" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.05</mml:mn></mml:mrow></mml:math></inline-formula>, the EC relationship was deemed insignificant
for this year, month, grid, and soil depth. After the significant EC
relationships were obtained, the observed temperature and precipitation were
converted to anomalies relative to the monthly climatology of 1981–2010
and fed into the EC relationships to generate constrained SM anomalies.
Finally, the constrained SM anomalies were added to the mean monthly
climatology over all the source datasets to generate constrained SM values.
For the combinations of year, month, grid, and soil depth that did not have
significant EC relationships, the mean monthly climatology over all the
source datasets was used as the constrained SM values. Uncertainties in the
EC-constrained SM values were estimated using the standard deviation of the
prediction of the linear regressions, calculated by the “wls_prediction_std” function of the Python package statsmodels
(Seabold and Perktold, 2010). Uncertainties in which there were no
significant EC relationships were estimated using the standard deviation of
the source datasets.</p>
      <p id="d1e2257">The fitted EC relationships are summarized in Figs. S3 and S4 using the
average values of the significant regression coefficients and the
percentage of significant regression coefficients for temperature and
precipitation, respectively. The regression coefficients for temperature
were mostly negative and for precipitation mostly positive, which can be
explained by the fact that higher temperature causes higher evaporative loss
of water from soil, and higher precipitation causes more recharge of water
into soil. In the Sahara region, the average regression coefficients were
mostly positive for temperature (Fig. S3), which might be related to
interannual correlation between precipitation and temperature caused by the
West African monsoon (Zhang and Cook, 2014). For the ORS datasets at 30–50
and 50–100 cm, the regression coefficients were also mostly positive for
temperature (Fig. S3). The ORS datasets at these depths only represent
three sets of meteorological forcings (GLDAS NOAH025_M2.0,
CRU TS v3.20 for MsTMIP, and CRU TS v3.26 for TRENDY v7; Tables S1–S3 and S5). Therefore, the EC relationships for the ORS datasets at these depths
likely have high uncertainty. Because only small percentages of EC
relationships were significant at these depths (Figs. S3 and S4), the
counterintuitive regression coefficients were unlikely to have a large
impact<?pagebreak page4392?> on the merged product. In general, the percentages of significant EC
relationships were higher for the CMIP5, CMIP6, CMIP5<inline-formula><mml:math id="M99" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula>6, and ALL datasets
than for the ORS datasets, suggesting that more diverse source datasets lead
to stronger EC relationships (Figs. S3 and S4). In the preliminary
analysis, additional setups for the regression were tested, including
whether the regression should use the actual values of SM, temperature, and
precipitation or the anomalies and whether the regression should use only
the target grid or the nine nearest grids. The setup with anomalies and the nine
nearest grids was found to result in more significant regression
coefficients and better performance (results not shown).</p>
</sec>
<sec id="Ch1.S2.SS9">
  <label>2.9</label><title>Temporal concatenation and homogeneity test against in situ and gridded SM datasets</title>
      <p id="d1e2275">Because some ORS datasets were not available for the entire 1970–2016
period, separate OLC ORS, EC ORS, and EC ALL datasets were produced for
three different periods (1970–2010, 1981–2016, and 1981–2010) and were
concatenated into a continuous 1970–2016 product using a previous
intercalibration approach (Fig. 1) (Dorigo et al., 2017; Liu et al., 2011,
2012). To perform the concatenation on the estimated SM values, the merged
product for each of the three periods was decomposed into monthly
climatology and monthly anomalies, with the monthly climatology being
calculated for the overlapping period (1981–2010). Then, the anomalies of
the 1970–2010 and 1981–2010 product were rescaled to have the same
cumulative distribution function (CDF) as the anomalies of the 1981–2016
product during their overlapping period (1981–2010) using the piecewise
linear CDF matching technique (Liu et al., 2011). In the first step of the
CDF matching technique, the 0th, 5th, 10th, 20th, 30th, 40th, 50th, 60th,
70th, 80th, 90th, 95th, and 100th percentiles of the anomalies of each
product during the overlapping period were identified on their CDF curves.
In the second step, the percentiles of the 1970–2010 and 1981–2010
products were plotted against the percentiles of the 1981–2016 product. A
linear line was drawn between each two adjacent percentiles (e.g., the 5th
and 10th percentiles), resulting in 12 linear segments. In the last step,
the anomalies of the 1970–2010 and 1981–2010 datasets that fell into each
interval of percentiles (e.g., 5th–10th) were rescaled using the equations
of the linear segments. Values outside the range of the monthly anomalies
during the overlapping period were rescaled using the equation of the
closest linear segment. A graphic illustration of the CDF matching technique
can be found in Fig. 3 of Liu et al. (2011). The CDF matching was conducted
for all the months as a whole, rather than for each month separately,
because the latter setup would result in too few data points (36 data points
during 1981–2016) to robustly determine the percentiles. The rescaled
anomalies were added back to the monthly climatology of each product.
Finally, the three added-back products were concatenated by using the
1970–2010 product for 1970–1980, the 1981–2010 product for 1981–2010,
and the 1981–2016 product for 2010–2016. To concatenate the estimated SM
uncertainty of the OLC method (<inline-formula><mml:math id="M100" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mtext>e</mml:mtext><mml:mrow><mml:mi>t</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>; see Sect. 2.7) and the EC
method (see Sect. 2.8), the uncertainty values of 1970–2010 and 1981–2010
were directly rescaled to the 1981–2016 values using CDF matching
without prior conversion to monthly anomalies, and the rescaled uncertainty
values were concatenated like the mean values.</p>
      <p id="d1e2294">Despite the intercalibration procedure, temporal discontinuity may still
exist in the OLC ORS, EC ORS, and EC ALL products because their source
datasets were different in the 1970–1980, 1981–2010, and 1981–2016
periods. To test this possibility, a previously demonstrated homogeneity
test procedure (Su et al., 2016) was applied to determine whether
statistically significant discontinuities in mean or variance exist between
the 1970–1980 and 1981–2010 periods or between the 1981–2010 and
1981–2016 periods. The procedure involves calculating <inline-formula><mml:math id="M101" display="inline"><mml:mi mathvariant="bold-italic">Q</mml:mi></mml:math></inline-formula> values between
the time series of interest, <inline-formula><mml:math id="M102" display="inline"><mml:mi mathvariant="bold-italic">Y</mml:mi></mml:math></inline-formula>, and one or multiple reference time series,
<inline-formula><mml:math id="M103" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">X</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M104" display="inline"><mml:mrow><mml:mfenced open="(" close=")"><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mi>K</mml:mi></mml:mrow></mml:mfenced></mml:mrow></mml:math></inline-formula>, that are in the same grid as <inline-formula><mml:math id="M105" display="inline"><mml:mi>Y</mml:mi></mml:math></inline-formula>
using Eq. (10):
            <disp-formula id="Ch1.E10" content-type="numbered"><label>10</label><mml:math id="M106" display="block"><mml:mrow><mml:mi mathvariant="bold-italic">Q</mml:mi><mml:mo>=</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>-</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>K</mml:mi></mml:msubsup><mml:msub><mml:mi>V</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:msub><mml:mi mathvariant="bold-italic">X</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>c</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>K</mml:mi></mml:msubsup><mml:msub><mml:mi>V</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where <inline-formula><mml:math id="M107" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M108" display="inline"><mml:mrow><mml:msub><mml:mi>c</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> are the linear regression coefficients
between <inline-formula><mml:math id="M109" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold">X</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M110" display="inline"><mml:mi mathvariant="bold">Y</mml:mi></mml:math></inline-formula>, and the weighting coefficient, <inline-formula><mml:math id="M111" display="inline"><mml:mrow><mml:msub><mml:mi>V</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, is equal to
the square of <inline-formula><mml:math id="M112" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> if <inline-formula><mml:math id="M113" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is positive and zero if <inline-formula><mml:math id="M114" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is negative. Then, the procedure uses the Wilcoxon rank-sum test to
determine if the mean values of <inline-formula><mml:math id="M115" display="inline"><mml:mi mathvariant="bold-italic">Q</mml:mi></mml:math></inline-formula> are significantly different between two
time periods and the Fligner–Killeen test to determine if the variances of
<inline-formula><mml:math id="M116" display="inline"><mml:mi mathvariant="bold-italic">Q</mml:mi></mml:math></inline-formula> are significantly different between two time periods. Like in the
original study (Su et al., 2016), a time series <inline-formula><mml:math id="M117" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">X</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> was only selected
for comparison if the Pearson correlation between <inline-formula><mml:math id="M118" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">X</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M119" display="inline"><mml:mi mathvariant="bold-italic">Y</mml:mi></mml:math></inline-formula> was
greater than 0.8 and significant at a <inline-formula><mml:math id="M120" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> value of 0.01 or smaller and if at
least three <inline-formula><mml:math id="M121" display="inline"><mml:mi mathvariant="bold-italic">Q</mml:mi></mml:math></inline-formula> values could be calculated in each of the two compared time
periods. In this study, the <inline-formula><mml:math id="M122" display="inline"><mml:mi mathvariant="bold-italic">Y</mml:mi></mml:math></inline-formula> time series were the time series in each
grid in a merged product. Two types of time series <inline-formula><mml:math id="M123" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">X</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> were used. One
was the time series of original ISMN observations (i.e., not interpolated to
0–10, 10–30, 30–50, or 50–100 cm, aggregated to grid level, or split
into validation or training sets) that were located in the same grid (Sect. 2.2). The other type was the time series of the gridded SM datasets for
evaluation (Sect. 2.5) in the same grid. For the latter type, the merged
products were linearly interpolated to the same depths as the gridded SM
datasets.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F2" specific-use="star"><?xmltex \currentcnt{2}?><?xmltex \def\figurename{Figure}?><label>Figure 2</label><caption><p id="d1e2592">Performance of the original ORS, CMIP5, and CMIP6 datasets
(box plots) and the merged products (scatter plots) for the validation set of
observations. The box plots show (from top to bottom) maximum, 75th
percentile, median, 25th percentile, and minimum. The ORS box plot includes
all the ORS datasets evaluated for their available years.</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://essd.copernicus.org/articles/13/4385/2021/essd-13-4385-2021-f02.png"/>

        </fig>

</sec>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Results</title>
<sec id="Ch1.S3.SS1">
  <label>3.1</label><title>Evaluation against the validation set of ISMN SM observations</title>
      <p id="d1e2617">When evaluated for the whole validation set, the bias of the merged products
ranged from <inline-formula><mml:math id="M124" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.044 to 0.033, the RMSE ranged from 0.076 to 0.104 m<inline-formula><mml:math id="M125" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msup></mml:math></inline-formula> m<inline-formula><mml:math id="M126" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, and the Corr ranged<?pagebreak page4393?> from 0.35 to 0.67 across the
four soil depths (Fig. 2). The merged products generally showed a smaller
magnitude of bias, smaller RMSE, and higher Corr than the source datasets
from which the products were merged (Fig. 2). The bias of both the source
and merged datasets shifted from mostly positive to mostly negative from the
shallowest to the deepest soil layer, indicating a tendency toward the
overestimation of the vertical SM gradient; the shallower soil layers also
tended to have lower RMSE and higher Corr than the deeper layers (Fig. 2).
The bias values of individual merged products were similar; the RMSE and
Corr values of the ORS-based merged products (Mean ORS, OLC ORS, EC ORS)
were better than the EC ALL product, and the RMSE and Corr values of the EC
ALL product were better than the CMIP5- or CMIP6-based merged products (EC
CMIP5, EC CMIP6, EC CMIP5<inline-formula><mml:math id="M127" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula>6) (Fig. 2).</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F3" specific-use="star"><?xmltex \currentcnt{3}?><?xmltex \def\figurename{Figure}?><label>Figure 3</label><caption><p id="d1e2657">The normalized bias, RMSE, and Corr among the annual climatology,
seasonal cycle, linear trends, and anomalies of the individual merged
products (Mean ORS through EC ALL) and source datasets (ORS, CMIP5, CMIP6),
as well as the global and regional datasets for evaluation (SMOS L3, SoMo, SMERGE
v2, SMOS L4, GLEAMv 3.3a). The normalization involves dividing each value by
the column-wise maximum in each panel and multiplying by 100 %, and it was
performed to prevent all the values in each column from showing the same
color. The blue number at the top of each column is the column-wise maximum
to the precision of two decimal points. The asterisk (*) indicates that the
magnitude of the bias, RMSE, or Corr of the merged product is better than
the product's source datasets in the same column (for EC CMIP5<inline-formula><mml:math id="M128" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula>6, the
comparison was made against the average of CMIP5 and CMIP6; for ALL, against
the average of ORS, CMIP5, and CMIP6). The displayed evaluation metrics of
the ORS, CMIP5, and CMIP6 are the average value over the individual source
datasets in each group.</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://essd.copernicus.org/articles/13/4385/2021/essd-13-4385-2021-f03.png"/>

        </fig>

      <p id="d1e2673">The merged products showed a lower magnitude of bias, lower RMSE, and higher
Corr than the source datasets across most of the land cover types (Fig. S5). The exceptions were the shallower (0–10 and 10–30 cm) soil layers
over the water bodies, evergreen needleleaf forests, and evergreen broadleaf
forests, for which the merged products produced RMSE similar to the bulk of the
source datasets (Fig. S5e–h), and the deeper (30–50 and/or 50–100 cm)
soil layers over the open shrublands, urban and built-up lands,
cropland–natural vegetation mosaics, and barren lands, for which the merged
products showed similar or even lower Corr and RMSE similar to the bulk of
the source datasets (Fig. S5e–i). Although the merged datasets
considerably overestimated the SM in the water bodies and evergreen
needleleaf forests (bias <inline-formula><mml:math id="M129" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 0.016 to 0.146 m<inline-formula><mml:math id="M130" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msup></mml:math></inline-formula> m<inline-formula><mml:math id="M131" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>; Fig. S5a–d),
the high Corr for these two land cover types (0.30 to 0.72 for the ORS-based
merged products, <inline-formula><mml:math id="M132" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.27 to 0.55 for the other merged products; Fig. S5i–l) indicated good ability to track the spatiotemporal variability.
The hybrid products underestimated the SM in the 0–10 cm layer of the
evergreen broadleaf forests (bias <inline-formula><mml:math id="M133" display="inline"><mml:mrow><mml:mo>=</mml:mo><mml:mo>-</mml:mo></mml:mrow></mml:math></inline-formula>0.174 to <inline-formula><mml:math id="M134" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.095 m<inline-formula><mml:math id="M135" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msup></mml:math></inline-formula> m<inline-formula><mml:math id="M136" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>),
the deciduous needleleaf forests (bias <inline-formula><mml:math id="M137" display="inline"><mml:mrow><mml:mo>=</mml:mo><mml:mo>-</mml:mo></mml:mrow></mml:math></inline-formula>0.162 to <inline-formula><mml:math id="M138" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.055 m<inline-formula><mml:math id="M139" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msup></mml:math></inline-formula> m<inline-formula><mml:math id="M140" display="inline"><mml:mrow><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, and the deeper soil layers of many other land cover types
(Fig. S5a–d). The Corr values of the merged products were also very low
in the evergreen broadleaf forests (<inline-formula><mml:math id="M141" display="inline"><mml:mo lspace="0mm">-</mml:mo></mml:math></inline-formula>0.81 to 0.05; Fig. S5i–l). Similar
to<?pagebreak page4394?> the global level, the ORS-based merged SM tended to outperform the EC
ALL and the CMIP5- and CMIP6-based merged products (Fig. S5). The three
merging methods performed similarly over most land cover types, but the OLC
method (OLC ORS product) had lower RMSE and higher Corr than the other two
methods (Mean ORS and EC ORS products) over the urban and built-up lands,
crop–natural vegetation mosaic, and barren land cover types (Fig. S5).</p>
</sec>
<sec id="Ch1.S3.SS2">
  <label>3.2</label><title>Evaluation against global and regional gridded SM datasets</title>
      <p id="d1e2807">The evaluating results for the merged SM products against independent or
semi-independent gridded SM datasets were highly dependent on the
evaluation dataset. For example, as shown above each panel in Fig. 3, the
maximum bias, RMSE, and Corr among the merged and source datasets in the
evaluation of climatology against the SMOS L4 0–100 cm dataset were 0.12 m<inline-formula><mml:math id="M142" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msup></mml:math></inline-formula> m<inline-formula><mml:math id="M143" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, 0.16 m<inline-formula><mml:math id="M144" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msup></mml:math></inline-formula> m<inline-formula><mml:math id="M145" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, and 0.16,
respectively, whereas
the same metrics were <inline-formula><mml:math id="M146" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.02 m<inline-formula><mml:math id="M147" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msup></mml:math></inline-formula> m<inline-formula><mml:math id="M148" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, 0.11 m<inline-formula><mml:math id="M149" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msup></mml:math></inline-formula> m<inline-formula><mml:math id="M150" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, and
0.81 in the evaluation against the GLEAM v3.3a 0–100 cm dataset. To
emphasize the differences<?pagebreak page4395?> between the merged and the source datasets rather
than across the evaluation datasets, Fig. 3 displays the evaluation
metrics in normalized units, with the maximum value across the merged and
source datasets of each matric set to 100 %. Within each column of each
panel in Fig. 3, the merged products generally had lower RMSE and higher
Corr than the average RMSE and Corr of corresponding source datasets, and
the ORS-based products generally had lower RMSE and higher Corr than the
CMIP5- or CMIP6-based products. The magnitudes of bias were often similar
between the merged and the source datasets for climatology, trend, and
anomalies, but the bias in seasonality of the merged datasets was generally
better than that of the source datasets.</p>
</sec>
<sec id="Ch1.S3.SS3">
  <label>3.3</label><title>Homogeneity test against in situ and gridded SM datasets</title>
      <p id="d1e2910">Because of the screening criteria (Sect. 2.9), few in situ observational
time series were available for conducting the homogeneity test. Therefore,
homogeneity tests using in situ SM data were only performed for between 1
and 10 grids (exact numbers not shown) for the various combinations of time
periods (between 1970–1980 and 1981–2010 or between 1981–2010 and
1981–2016) and depths (0–10, 10–30, 30–50, and 50–100 cm). None of
these test results showed significant discontinuity, but the scarcity of
tested grids rendered the finding inconclusive.</p>
      <p id="d1e2913">With the gridded SM datasets, the majority of the global grids satisfied the
screening criteria. The left two columns of Fig. 6 show that for all the
merged products, no significant discontinuity in mean existed between the
time periods 1970–1980 and 1981–2010. The right two columns of Fig. 6
show that discontinuity in variance existed between the time periods
1970–1980 and 1981–2010, but the percentages of discontinuous grids were
similar between the concatenated products (OLC ORS, EC ORS, EC ALL) and the
other products that were based on the same source datasets throughout
1970–2016 (Mean ORS, EC CMIP5, EC CMIP6, and EC CMIP5<inline-formula><mml:math id="M151" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula>6). Although the
fitting of a separate regression for each year and month in the EC procedure
(Sect. 2.8) might have introduced inhomogeneities into to the EC-based
products, the unweighted averaging method behind mean ORS did not have the
same concern. However, the Mean ORS product had the highest percentages of
discontinuous grids among all the merged products. These results indicate
that the identified discontinuities were more likely caused by systematic
differences between the evaluation datasets (SMERGE v2 and GLEAM v3.3a) and
the source datasets for merging (ORS, CMIP5, and CMIP6) rather than the
concatenation procedure. The homogeneity test between 1981–2010 and
2011<inline-formula><mml:math id="M152" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>2016 had similar results as the test between 1970–1980 and
1981–2010. Virtually no discontinuities in mean were identified, and
similar percentages of discontinuous grids were identified in the
concatenated products (OLC ORS, EC ORS, EC ALL) and the others (Figs. S6–S7).</p><?xmltex \hack{\newpage}?>
</sec>
<sec id="Ch1.S3.SS4">
  <label>3.4</label><title>Evaluation against selected drought events</title>
      <p id="d1e2939">Lower values in scPDSI and SM anomalies are indicative of drier conditions,
and higher values indicate wetter conditions. For the United States
1985–1992 drought, the scPDSI, 0–10 cm SM anomalies, and 0–100 cm SM
anomalies all showed gradual expansion of drought from 1985 to 1988 and
gradual alleviation from 1989 to 1992, with the most severe drought occurring in the northern Great Plains in 1988 (Figs. 4 and S9). For the
Australia 2002–2009 drought, the ORS-based 0–10 and 0–100 cm SM
anomalies captured the pan-Australian drought shown by the scPDSI in
2002–2003, 2005, and 2007–2009, as well as the eastern Australian drought in
2004 and 2006 (Figs. S8 and S10). The CMIP5- and CMIP6-based SM anomalies
also mostly captured the Australian drought patterns but did not capture the
pan-Australian drought in 2007 and 2008 (Figs. S8 and S10).</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F4" specific-use="star"><?xmltex \currentcnt{4}?><?xmltex \def\figurename{Figure}?><label>Figure 4</label><caption><p id="d1e2944">The <inline-formula><mml:math id="M153" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> values of the homogeneity test on the discontinuity in the mean
(via Wilcoxon rank-sum test) and in the variance (via Fligner–Killeen test)
between the time periods 1970–1980 and 1981–2010 in the merged
products. The SoMo datasets could not be used to evaluate discontinuity
between these two time periods because SoMo only spans 2000 to 2019. The red
numbers beneath each panel indicate the percentage of grids that had
significant discontinuity (<inline-formula><mml:math id="M154" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>≤</mml:mo><mml:mn mathvariant="normal">0.05</mml:mn></mml:mrow></mml:math></inline-formula>) in the panel. Blank grids exist
because the SMERGE v2 or GLEAMv3.3a 0–100 cm data in these grids did not
satisfy the screening criteria for the homogeneity test.</p></caption>
          <?xmltex \igopts{width=398.338583pt}?><graphic xlink:href="https://essd.copernicus.org/articles/13/4385/2021/essd-13-4385-2021-f04.png"/>

        </fig>

      <p id="d1e2972">To better quantify the similarity between the scPDSI and SM anomalies,
Spearman correlations (Hollander et al., 2013) were calculated and are shown
above each panel in Figs. 4 and S8–S10. The Spearman correlation metric
was deemed suitable for measuring the similarity because the magnitudes of
scPDSI, which is a unitless standardized index, and of SM anomalies
(m<inline-formula><mml:math id="M155" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msup></mml:math></inline-formula> m<inline-formula><mml:math id="M156" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>) are not comparable. The Spearman correlation is not sensitive
to magnitudes because the metric is calculated using the rank of each
<inline-formula><mml:math id="M157" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula> value among all the <inline-formula><mml:math id="M158" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula> values and the rank of each <inline-formula><mml:math id="M159" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula> value among all the
<inline-formula><mml:math id="M160" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula> values for an <inline-formula><mml:math id="M161" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula>–<inline-formula><mml:math id="M162" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula> pair of time series (Hollander et al., 2013). The Spearman
correlations between scPDSI and the ORS-based SM anomalies were between
0.698 and 0.890 for the United States (Figs. 4 and S9) and 0.427 to 0.872
for Australia (Figs. S8 and S10). For the purely CMIP5- or CMIP6-based
products, the Spearman correlations were between <inline-formula><mml:math id="M163" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.147 and 0.850 for the
United States (Figs. 4 and S9) and 0.005 to 0.872 for Australia (Figs. S8 and S10). The EC ALL product, which combines ORS, CMIP5, and CMIP6 source
datasets, had Spearman correlations that tended to be in the middle of the
ORS- and the CMIP5- and CMIP6-based products. The better performances of the
ORS-based than the CMIP5- and CMIP6-based merged products were consistent
with the evaluation results against in situ observations (Figs. 2 and S5).</p>
</sec>
<sec id="Ch1.S3.SS5">
  <label>3.5</label><title>The spatial and temporal characteristics of the merged SM products</title>
      <p id="d1e3055">Because all the merging procedures essentially involve averaging over
multiple source datasets (see Sect. 2.1, 2.7, and 2.8), the variability and
trends of the merged products may be damped compared with the source
datasets because of mutual cancellation. To ascertain whether this is the
case, power spectral densities were calculated from regionally averaged time
series of the merged products and the source datasets (i.e., ORS, CMIP5, and
CMIP6), and they are compared in Figs. 6 and S10. The regions used were<?pagebreak page4396?> the
Intergovernmental Panel on Climate Change (IPCC) Special Report on Managing
the Risks of Extreme Events and Disasters to Advance Climate Change
Adaptation (SREX) regions (Field et al., 2012; Fig. S11). The power
spectral densities of the Mean ORS, OLC ORS, and EC ORS products very rarely
exceeded the boundaries of the source datasets (e.g., panel c2 of Fig. 6,
panel j0 of Fig. S12), showing that at least at a regional level, these
merged products did not underestimate the temporal variability in SM. The
variabilities of the EC CMIP5, EC CMIP6, EC CMIP5<inline-formula><mml:math id="M164" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula>6, and EC ALL products
were generally within the boundaries of the source datasets at the 0<inline-formula><mml:math id="M165" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>10,
10–30, and 30–50 cm depths. However, the variability of these products
tended to be too high at the month-to-month scale (i.e., high-frequency,
short-period end of the spectrum) and too low at the decadal scale (i.e.,
low-frequency, long-period end of the spectrum) at the 50–100 cm depth
(Figs. 6 and S12).</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F5" specific-use="star"><?xmltex \currentcnt{5}?><?xmltex \def\figurename{Figure}?><label>Figure 5</label><caption><p id="d1e3074">The annual mean scPDSI anomalies (no unit) and annual mean SM
anomalies (m<inline-formula><mml:math id="M166" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msup></mml:math></inline-formula> m<inline-formula><mml:math id="M167" display="inline"><mml:mrow><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> during the US drought in 1985–1992. The numbers
above the plots are the Spearman correlation between the anomalies of the
merged product and scPDSI, and the asterisk (*) indicates that the
correlation is significant at <inline-formula><mml:math id="M168" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>=</mml:mo></mml:mrow></mml:math></inline-formula> 0.05. The anomalies were calculated
relative to the climatology of 1970–2016. The SM anomalies are for
0–10 cm.</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://essd.copernicus.org/articles/13/4385/2021/essd-13-4385-2021-f05.png"/>

        </fig>

      <?xmltex \floatpos{t}?><fig id="Ch1.F6" specific-use="star"><?xmltex \currentcnt{6}?><?xmltex \def\figurename{Figure}?><label>Figure 6</label><caption><p id="d1e3119">The power spectral density of the spatially averaged time series
of monthly SM over selected IPCC SREX regions (Field et al., 2012). The
power spectral densities of the source datasets (ORS, CMIP5, and CMIP6) were
calculated for each individual source dataset, and the displayed envelopes
encompass the minimum to maximum ranges. Abbreviations: W – west, N –
north, S – south, E – east.</p></caption>
          <?xmltex \igopts{width=398.338583pt}?><graphic xlink:href="https://essd.copernicus.org/articles/13/4385/2021/essd-13-4385-2021-f06.png"/>

        </fig>

      <p id="d1e3129">The long-term trends in the regionally averaged SM of the merged products
showed ranges that were centered around similar values as the source
datasets and were within the ranges of the source datasets except for a few
cases (EC ORS 10–30 cm Sahara and West Asia, 30–50 cm Sahara) (Fig. S13). For both the merged products and the source datasets, the most
negative trends occurred in northeastern Brazil for 0–10, 10–30, and
30–50 cm and in southern Australia–New Zealand for 50–100 cm, and the most
positive<?pagebreak page4397?> values occurred in Alaska–northwestern Canada, southeastern South
America, and northern Asia (Fig. S13). Therefore, the merging procedure was
unlikely to have caused underestimation of the trends of the merged
products. The occasional underestimations by the EC ORS dataset might be
caused by uncertainty in the precipitation and temperature trends in arid
regions. In the Sahara and West Asian regions, the CRU TS v4.03 dataset,
which was used as an observational constraint in the EC procedure (Sect. 2.4),
had either more positive or more negative trends in precipitation than the
drivers of the source datasets and mostly less positive trends in
temperature. Since temperature was positively correlated with the 10–30 cm
and 30–50 cm SM in the EC relationship in the Sahara and West Asian
regions (Fig. S3), the negative bias in temperature trends would be
consistent with the negative bias in SM trends.</p>
      <p id="d1e3132">The SM climatology showed reasonable spatial patterns in all the merged
products, with the lowest values occurring in the arid regions of the Sahara,
western United States, central Asia, and interior Australia and the highest
values occurring in the high latitudes and tropical rainforest regions
(Fig. S15). The OLC merging method caused an increase in absolute SM
values, especially in the 0–10 and 10–30 cm soil layers, relative to
unweighted averaging (Fig. S14, first and second rows). The EC method did not
induce a similar increase (Fig. S15, first and third rows), which was expected
because the procedure did not change the 1981–2010 climatology of the
source datasets (Sect. 2.8). The Mean ORS, EC ORS, EC CMIP5<inline-formula><mml:math id="M169" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula>6, and EC ALL
products showed little difference in SM climatology across the soil layers,
but the OLC ORS and EC CMIP6 products showed decreased SM from the shallower
to deeper soil layers (Fig. S15). The EC CMIP5 had the highest SM values
at the 30–50 cm soil layer (Fig. S15).</p>
      <p id="d1e3142">The timings of annual maximum SM were mostly consistent across different
merged products, with exceptions occurring in northeastern Asia, eastern
Canada, and Alaska (Fig. S16). The maximum SM occurred around February in
the southern subtropics, southern North America, and the Mediterranean;
around September in the monsoonal regions of Africa and southern and eastern
Asia; and around May in northern North America and most of Eurasia. At
deeper<?pagebreak page4398?> soil layers (30–50 and 50–100 cm), the CMIP5- and CMIP6-based
merged SM showed an earlier occurrence of the annual maximum SM (around June) than
the other merged datasets (around September) in eastern Asia.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F7" specific-use="star"><?xmltex \currentcnt{7}?><?xmltex \def\figurename{Figure}?><label>Figure 7</label><caption><p id="d1e3147">The variable that has the best explanatory power for the
interannual variability in SM in each grid for the different merged
products and depths. The best explanatory power was defined as having the
highest absolute partial correlation in the partial correlation analysis
between annual mean SM and the annual mean meteorological variables.
Hatching indicates that the partial correlation of the best explanatory
variable was significant at <inline-formula><mml:math id="M170" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>=</mml:mo></mml:mrow></mml:math></inline-formula> 0.05.</p></caption>
          <?xmltex \igopts{width=398.338583pt}?><graphic xlink:href="https://essd.copernicus.org/articles/13/4385/2021/essd-13-4385-2021-f07.png"/>

        </fig>

      <p id="d1e3166">All the merged products showed increasing SM trends in the northern
high latitudes, central Eurasia, and northern Africa and decreasing trends
in eastern South America, southern Africa, and eastern Australia (Fig. S17). The CMIP5- and CMIP6-based merged datasets showed greater drying in
eastern North America and Europe than the ORS-based estimates and less
drying near the North China Plain than the ORS-based products. A major
difference existed between the CMIP6-based merged products (EC CMIP6, EC
CMIP5<inline-formula><mml:math id="M171" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula>6, EC ALL) and the other products in northeastern Asia in the
50–100 cm soil layer; the former displayed strong drying trends and
the latter did not. The estimated uncertainty intervals of the merged
products were slightly larger for the OLC method than the unweighted
standard deviation of the source datasets, and both were considerably larger
than the EC method (Fig. S18). For all the methods, the uncertainty
intervals were greater in the temperate regions than in the arid regions,
which was consistent with the higher SM absolute values in the temperature
regions.</p>
</sec>
<?pagebreak page4399?><sec id="Ch1.S3.SS6">
  <label>3.6</label><title>Sensitivity to precipitation, air temperature, and surface downwelling shortwave radiation</title>
      <p id="d1e3184">Based on partial correlation, precipitation was the dominant control of SM
variability in the ORS-based products and in EC ALL over most of the globe
(Fig. 7), and it generally had significant positive partial correlations
(Fig. S19). Precipitation was also the dominant control of SM variability
in the CMIP5- and CMIP6-based products in the 0–10 and 10–30 cm layers,
but not in the 30–50 and 50–100 cm layers (Fig. 7), where the partial
correlations between precipitation and SM were insignificant across most of
the global land surface (Fig. S18). In all the merged products, air
temperature had significant negative partial correlations with SM in
the southwestern United States, eastern South America, southern Africa, the
Mediterranean, and Australia (Fig. S20). Some significantly positive
correlations between temperature and SM existed in the Sahara, central Asia,
and Tibetan Plateau regions (Fig. S20). The primarily negative
correlations were consistent with the physical expectation that higher
temperatures induce higher evaporative demand and thus lower SM. The CMIP5-
and CMIP6-based products had stronger negative correlations between
temperature and SM than the ORS-based products in Europe, which may explain
the former products' more negative trends in SM in this<?pagebreak page4400?> region (Fig. S17).
Downwelling shortwave radiation was rarely a dominant control of SM
variability in the ORS-based products (Fig. 7). For the CMIP5- and
CMIP6-based products, downwelling shortwave radiation was only a dominant
control of SM variability in some of the midlatitude to high-latitude and tropical
rainforest regions (Fig. 7), which were consistent with the distribution
of light-limited ecosystems.</p>
</sec>
</sec>
<sec id="Ch1.S4">
  <label>4</label><title>Discussion</title>
      <p id="d1e3196">Overall, the merged SM products showed better performances than their source
datasets (Sect. 3.1 and 3.2), temporal homogeneity (Sect. 3.3), the ability
to capture large-scale drought events (Sect. 3.4), reasonable spatiotemporal
patterns (Sect. 3.5), and reasonable climatic response characteristics
(Sect. 3.6) across the globe and multiple soil layers. The ranges of
performance metrics of the new datasets against in situ data (Figs. 2 and S5) were broadly within the estimates reported by previous SM evaluations,
although making a strict comparison is difficult because of the widely
different spatiotemporal coverages and resolutions (Beck et al., 2021;
Karthikeyan et al., 2017; Li et al., 2020b; Wang et al., 2021a; Yuan and
Quiring, 2017). These results demonstrated that the merging procedures
(unweighted averaging, OLC, EC) used were effective in creating relatively
accurate long-term multilayer SM data at the global scale.</p>
      <p id="d1e3199">Regarding the three merging methods, the OLC method only showed better
performance than unweighted averaging over the urban and built-up lands,
crop–natural vegetation mosaic, and barren land cover types (Fig. S5),
which may be a benefit rendered by the overrepresentation of these land
cover types in the in situ observations (Fig. S1). The ISMN stations are
very sparse (Fig. S1), and a previous study suggested that denser
observations may lead to a better-performing merged product (Gruber et al.,
2018). In the future, data sources such as FLUXNET and local SM networks
that are not included in the ISMN may be exploited to improve the OLC ORS
product. Future extension of the OLC method may aim to account for the
spatial representativeness of individual stations (Molero et al., 2018) and
to test alternative error estimation methods such as extended
collocation (Gruber et al., 2016). The EC method showed similar performance
as the unweighted averaging when applied to the ORS source datasets, which
may be because the meteorological forcings for these datasets were already
realistic (Table S5). However, the effectiveness of the EC method was clear
when applied to the online CMIP5 and CMIP6 simulations. Despite such
EC-based improvement, the ORS-based merged products tended to perform better
than the EC CMIP5, CMIP6, CMIP5<inline-formula><mml:math id="M172" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula>6, and ALL products (Figs. 2 and S5).
The current EC procedure used simple linear regression and only
temperature and precipitation as constraint variables (Sect. 2.8). In
addition to temperature and precipitation, the SM was influenced by other
atmospheric and land conditions (e.g., wind, leaf area and stomata closure,
snow cover and melt, groundwater). Therefore, future studies may achieve
better EC outcomes by incorporating more influencing factors into the EC
procedure and by using nonlinear regression methods such as machine
learning. Another drawback of the current EC method is the low uncertainty
interval (Fig. S18), which is likely an underestimation. Whereas the OLC
method accounts for the difference between in situ and source datasets in
estimating the uncertainty interval (Sect. 2.7), the EC method does not.
Future studies should also aim to better incorporate this information into
the estimation of the EC-based uncertainty interval and to better account for
the structural uncertainty introduced by the regression form and limited
range of predictors in the EC procedure.</p>
      <p id="d1e3209">The high performance variability of merged products across space (Fig. S5)
is consistent with previous studies (Beck et al., 2021; Karthikeyan et al.,
2017; Li et al., 2020b; Wang et al., 2021a; Yuan and Quiring, 2017). The
high RMSEs of the merged products in the shallower soil layers across the
water bodies and evergreen needleleaf forests (Fig. S5e–h) were likely
caused by the high positive bias in these land cover types (Fig. S5a–d)
since the corresponding Corr values were relatively high (Fig. S5i–l).
The positive bias over water bodies may be caused by inaccurate land–water
classification at the resolution of the source datasets (Tables S1–S4). The
positive and negative bias over the forested land cover types in
high-latitude and tropical regions (e.g., evergreen needleleaf forests,
evergreen broadleaf forests, and deciduous needleleaf forests; Fig. S2)
may be due to biases in evapotranspiration and leaf area index in the source
LSMs, reanalysis, and ESMs (Tables S2–S4), and it may be further related to processes
such as rooting depth and hydraulic redistribution (Pan et al., 2020; Wang
et al., 2021b). Low Corr occurred over some land cover types in high
latitudes, semi-arid to arid regions, and urban areas (e.g., open
shrublands, urban and built-up lands, cropland–natural vegetation mosaics,
and barren lands; Fig. S2). In the high latitudes, the low Corr may be
associated with inadequate frozen soil processes in the source LSMs,
reanalysis, and ESMs (Andresen et al., 2020). In the semi-arid to arid
regions, the low Corr may be due to random errors in SM observations and
simulated values, which would be comparatively large for low SM values. In
urban areas, the low performance may be caused by the radio frequency
interference of satellite observations (Wang et al., 2012) and inadequacies
in the representation of urban areas at the resolution of the source model
products (Tables S2–S4).</p>
      <p id="d1e3212">When evaluated against the global and regional gridded datasets that were
not used in the merging, the merged products showed the highest RMSE in
climatology and lowest Corr in spatial trends (Fig. 3; see the blue
numbers above each panel). Such results were likely because the climatology
SM values had higher magnitudes than the seasonal and interannual anomalies
or trends, and the historical SM trends<?pagebreak page4401?> were highly uncertain. The low
similarity between the SMOS L3–L4 and the synthesized SM products may be
caused by the short overlapping period (2010–2016, Table 1). A previous
study also found systematic differences between the climatology of
satellite-observed and simulated SM (Piles et al., 2019). The high
similarity between the SoMo.ml and GLEAM v3.3a root zone SM datasets and the
merged products may be because the former depend on the same ISMN stations,
from which the OLC ORS product was derived (Sect. 2.2), and the latter were
from the same reanalysis as the GLEAM v3.3a surface dataset used in the
merging (Table S2). In general, because these evaluation datasets are not
ground truths like in situ observations, the identified differences in
evaluation metrics should not be viewed as an absolute indicator of
unreliability in the merged products. Similarly, the benchmarking against
scPDSI (Sect. 3.3) only provided qualitative rather than quantitative
indicators of performance for the merged products because scPDSI is
essentially a different variable from SM.</p>
      <p id="d1e3216">The vertical gradient in bias (Figs. 2, S5), the high uncertainty in
the vertical gradient in the climatology of merged SM (Fig. S15), and the
divergent trends in the 50–100 cm SM in northeastern Asia across the merged
products (Fig. S17) point to the need to reduce uncertainties in the vertical
distribution and dynamics of SM in the merged products. The high SM values
for the HadGEM2-CC and HadGEM-ES datasets may be the reason why the highest
SM occurred in the 30–50 cm layer in the EC CMIP5 product (Fig. S22). All
the source datasets for the EC CMIP6 SM showed negative trends in northeastern
Asia in the 50–100 cm soil layer, but this feature does not exist in the
original ORS or CMIP5 datasets (results not shown). All the source datasets
do not have consistent SM vertical gradients, with the maximum value falling
at either the surface, deepest, or intermediate soil layers (Fig. S22).
Such vertical inconsistencies may be related to inconsistencies in the
vertical discretization of the soil column (Tables S2–S4), soil properties
in each layer, modeling of lateral flow and drainage, or other factors
(e.g., Balsamo et al., 2009; Best et al., 2011; Melton et al., 2019).
Previous regional or global SM evaluations (e.g., Beck et al., 2021;
Karthikeyan et al., 2017; Li et al., 2020b; Wang et al., 2021a; Yuan and
Quiring, 2017) rarely focused on the performance on vertical gradient, and
such a limitation should be better addressed in future analyses and dataset
development.</p>
      <p id="d1e3219">The temporal homogeneity test showed that discontinuity in variance existed
in all the merged products, which may arise from several sources. The
reference datasets, SMERGE v2 and GLEAM v3.3a 0–100 cm, were not perfectly
homogeneous because both datasets assimilate satellite observations made by
different instruments over time (Tobin et al., 2019; Martens et al., 2017).
The observation systems assimilated by the reanalysis datasets of the ORS
(Table S2) also change over time, thereby leading to potential
discontinuities in the ORS-based products. Such limitations cannot be
eliminated considering the paucity of records before the satellite era and
the continuous evolution of observational and reanalysis systems.
Discontinuity in a statistical sense can also be caused by changes in land
use and other types of disturbances between two time periods, but such
apparent discontinuity reflects real-world situations.</p>
      <p id="d1e3222">The SM seasonality in the merged products (Fig. S16) was broadly
consistent with previously reported timing of annual maximum precipitation
(Knoben et al., 2019). Differences at the deeper soil layers between the
CMIP5- and CMIP6-based merged products and the ORS-based products may be
partially caused by the lack of consideration of lagged SM response to
meteorological drivers, especially at the deeper layers, in the EC method
(Sect. 2.8). Uncertainty at the deeper layers would also be high because
fewer source datasets were available than for the shallower layers (Tables S1–S4). The SM trends (Fig. S17) were broadly consistent with previous
reports on historical changes in agricultural droughts (Dai and Zhao, 2017;
Liu et al., 2019; Lu et al., 2019).</p>
      <p id="d1e3225">The primarily positive partial correlations between the SM and
precipitation, and the primarily negative partial correlations between the
SM and air temperature or shortwave radiation, were consistent with
expectations from physical processes. The existence of significant positive
partial correlations between air temperature and SM might be caused by less
precipitation falling as snow at higher temperatures in the cold Tibetan
Plateau and might be caused by stronger land–atmosphere feedbacks at higher
temperatures in the Sahara and central Asia. The weaker relationships
between precipitation and SM in the CMIP5- and CMIP6-based merged products
than those in the ORS-based products (Fig. S15) may be because the EC
method did not fully explain the temporal mismatch between the source
datasets and the real world because the relationships were insignificant in
various grids and time steps (Fig. S4). The stronger relationships between
air temperature and SM of the CMIP5- and CMIP6-based merged products than
those of the ORS-based products (Fig. S16) may be partially caused by
compensation for the weaker relationships between shortwave radiation and SM
(Fig. S17). Because the temperature and shortwave radiation tend to be
highly correlated, shortwave radiation was not considered a predictor for
the EC method in the present research.</p>
</sec>
<sec id="Ch1.S5">
  <label>5</label><title>Data availability</title>
      <p id="d1e3237">The seven SM products, including the estimated SM values and uncertainty
intervals, are available from
<ext-link xlink:href="https://doi.org/10.6084/m9.figshare.13661312.v1" ext-link-type="DOI">10.6084/m9.figshare.13661312.v1</ext-link> (Wang and Mao, 2021). The
files are in NetCDF4 format.</p>
</sec>
<sec id="Ch1.S6">
  <label>6</label><title>Code availability</title>
      <p id="d1e3251">The source codes that were used to create all the SM datasets are available at <uri>https://bitbucket.org/ywang11/soil_moisture_merge/src/master/</uri> (Wang, 2020).</p>
</sec>
<?pagebreak page4402?><sec id="Ch1.S7" sec-type="conclusions">
  <label>7</label><title>Conclusions</title>
      <p id="d1e3265">This study achieved the goal of creating long-term, gap-free, multilayer SM
products (1970–2016, 0.5<inline-formula><mml:math id="M173" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>, monthly, 0–10, 10–30, 30–50, and
50–100 cm) that displayed realistic temporal evolutions and spatial
patterns and outperformed the source SM datasets in the systematic
evaluations against independent in situ measurements and semi-independent
gridded SM estimates. Three new SM products (Mean ORS, OLC ORS, and EC ORS)
developed from the satellite observations, reanalysis, and offline LSMs were
shown to perform better than those based on the ESMs. Therefore, they are
recommended for future applications, such as the detection and attribution
of historical changes of SM and associated extreme events, providing the
initial and boundary conditions for atmospheric models, benchmarking various
types of models, and managing drought and flood risks. By comparing three
different merging methods (unweighted averaging, OLC, and EC), this study
further denoted that the OLC method may require more in situ observations to
exceed the unweighted averaging, and that linear regression-based EC with a
limited range of un-lagged predictors was inadequate in correcting all the
ESM errors. Future SM developments may aim to assemble more in situ SM
datasets and to implement other advanced fusion algorithms (e.g., extended
collocation, machine learning).</p>
</sec>

      
      </body>
    <back><app-group>
        <supplementary-material position="anchor"><p id="d1e3276">The supplement related to this article is available online at: <inline-supplementary-material xlink:href="https://doi.org/10.5194/essd-13-4385-2021-supplement" xlink:title="pdf">https://doi.org/10.5194/essd-13-4385-2021-supplement</inline-supplementary-material>.</p></supplementary-material>
        </app-group><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d1e3287">JM conceived the research; YW and JM performed the analyses and
drafted the figures; YW and JM wrote the first draft of the paper;
MJ, FMH, XS, SDW, and YD reviewed and edited the paper
before submission. All authors made substantial contributions to the
discussion of content.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d1e3293">The authors declare that they have no conflict of interest.</p>
  </notes><notes notes-type="disclaimer"><title>Disclaimer</title>

      <p id="d1e3299">Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.</p>
  </notes><ack><title>Acknowledgements</title><p id="d1e3305">The Dai self-calibrated Palmer Drought Severity Index data were provided by the
NOAA/OAR/ESRL PSL, Boulder, Colorado, USA, from their website at
<uri>https://psl.noaa.gov/</uri> (last access: 30 July 2021). The GSWP3 meteorological data were downloaded as
prepared historical input datasets for ISIMIP3a. For their roles in
producing, coordinating, and making available the ISIMIP input data and
impact model output, we acknowledge the modeling groups, the ISIMIP sector
coordinators, and the ISIMIP cross-sectoral science team.</p></ack><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d1e3313">This research has been supported by an Oak Ridge National Laboratory (ORNL) subcontract (4000169153) and the Reducing Uncertainties in Biogeochemical Interactions through Synthesis and Computation Science Focus Area (RUBISCO SFA) project in the Earth and Environmental Systems Sciences Division (EESSD) of the Biological and Environmental Research (BER) office in the US Department of Energy (DOE) Office of Science. ORNL is managed by UT-BATTELLE, LLC, for the DOE under contract no. DE-AC05-00OR22725.</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d1e3320">This paper was edited by Qingxiang Li and reviewed by two anonymous referees.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bib1"><label>1</label><?label 1?><mixed-citation>
Al Bitar, A., Kerr, Y. H., Merlin, O., Cabot, F., Wigneron, J.-P.,
Froissard, F., Suquia, D., Petitpa, A., Piguet, B., and Martin, E.: Global
drought index from SMOS soil moisture, in: IEEE International Geoscience and
Remote Sensing Symposium, IGARSS, Melbourne, Australia, 2013.</mixed-citation></ref>
      <ref id="bib1.bib2"><label>2</label><?label 1?><mixed-citation>Al Bitar, A., Mialon, A., Kerr, Y. H., Cabot, F., Richaume, P., Jacquette, E., Quesney, A., Mahmoodi, A., Tarot, S., Parrens, M., Al-Yaari, A., Pellarin, T., Rodriguez-Fernandez, N., and Wigneron, J.-P.: The global SMOS Level 3 daily soil moisture and brightness temperature maps, Earth Syst. Sci. Data, 9, 293–315, <ext-link xlink:href="https://doi.org/10.5194/essd-9-293-2017" ext-link-type="DOI">10.5194/essd-9-293-2017</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib3"><label>3</label><?label 1?><mixed-citation>Andresen, C. G., Lawrence, D. M., Wilson, C. J., McGuire, A. D., Koven, C., Schaefer, K., Jafarov, E., Peng, S., Chen, X., Gouttevin, I., Burke, E., Chadburn, S., Ji, D., Chen, G., Hayes, D., and Zhang, W.: Soil moisture and hydrology projections of the permafrost region – a model intercomparison, The Cryosphere, 14, 445–459, <ext-link xlink:href="https://doi.org/10.5194/tc-14-445-2020" ext-link-type="DOI">10.5194/tc-14-445-2020</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib4"><label>4</label><?label 1?><mixed-citation>Balsamo, G., Beljaars, A., Scipal, K., Viterbo, P., van den Hurk, B.,
Hirschi, M., and Betts, A. K.: A revised hydrology for the ECMWF model:
Verification from field site to terrestrial water storage and impact in the
integrated forecast system, J. Hydrometeorol., 10, 623–643,
<ext-link xlink:href="https://doi.org/10.1175/2008JHM1068.1" ext-link-type="DOI">10.1175/2008JHM1068.1</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bib5"><label>5</label><?label 1?><mixed-citation>Balsamo, G., Albergel, C., Beljaars, A., Boussetta, S., Brun, E., Cloke, H., Dee, D., Dutra, E., Muñoz-Sabater, J., Pappenberger, F., de Rosnay, P., Stockdale, T., and Vitart, F.: ERA-Interim/Land: a global land surface reanalysis data set, Hydrol. Earth Syst. Sci., 19, 389–407, <ext-link xlink:href="https://doi.org/10.5194/hess-19-389-2015" ext-link-type="DOI">10.5194/hess-19-389-2015</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib6"><label>6</label><?label 1?><mixed-citation>
Beaudoing, H., Rodell, M., and NASA/GSFC/HSL: GLDAS Noah Land Surface Model
L4 3 hourly 0.25 x 0.25 degree V2.0, Goddard Earth Sciences Data and
Information Services Center (GES DISC), Greenbelt, MD, USA, 2019.</mixed-citation></ref>
      <?pagebreak page4403?><ref id="bib1.bib7"><label>7</label><?label 1?><mixed-citation>Beck, H. E., Pan, M., Miralles, D. G., Reichle, R. H., Dorigo, W. A., Hahn, S., Sheffield, J., Karthikeyan, L., Balsamo, G., Parinussa, R. M., van Dijk, A. I. J. M., Du, J., Kimball, J. S., Vergopolan, N., and Wood, E. F.: Evaluation of 18 satellite- and model-based soil moisture products using in situ measurements from 826 sensors, Hydrol. Earth Syst. Sci., 25, 17–40, <ext-link xlink:href="https://doi.org/10.5194/hess-25-17-2021" ext-link-type="DOI">10.5194/hess-25-17-2021</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib8"><label>8</label><?label 1?><mixed-citation>Best, M. J., Pryor, M., Clark, D. B., Rooney, G. G., Essery, R. L. H., Ménard, C. B., Edwards, J. M., Hendry, M. A., Porson, A., Gedney, N., Mercado, L. M., Sitch, S., Blyth, E., Boucher, O., Cox, P. M., Grimmond, C. S. B., and Harding, R. J.: The Joint UK Land Environment Simulator (JULES), model description – Part 1: Energy and water fluxes, Geosci. Model Dev., 4, 677–699, <ext-link xlink:href="https://doi.org/10.5194/gmd-4-677-2011" ext-link-type="DOI">10.5194/gmd-4-677-2011</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bib9"><label>9</label><?label 1?><mixed-citation>Bishop, C. H. and Abramowitz, G.: Climate model dependence and the replicate
Earth paradigm, Clim. Dynam., 41, 885–900,
<ext-link xlink:href="https://doi.org/10.1007/s00382-012-1610-y" ext-link-type="DOI">10.1007/s00382-012-1610-y</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib10"><label>10</label><?label 1?><mixed-citation>Dai, A. and Zhao, T.: Uncertainties in historical changes and future
projections of drought. Part I: estimates of historical drought changes,
Clim. Change, 144, 519–533, <ext-link xlink:href="https://doi.org/10.1007/s10584-016-1705-2" ext-link-type="DOI">10.1007/s10584-016-1705-2</ext-link>,
2017.</mixed-citation></ref>
      <ref id="bib1.bib11"><label>11</label><?label 1?><mixed-citation>Dai, A., Trenberth, K. E., and Qian, T.: A global dataset of Palmer Drought
Severity Index for 1870–2002: Relationship with soil moisture and effects
of surface warming, J. Hydrometeorol., 5, 1117–1130,
<ext-link xlink:href="https://doi.org/10.1175/jhm-386.1" ext-link-type="DOI">10.1175/jhm-386.1</ext-link>, 2004.</mixed-citation></ref>
      <ref id="bib1.bib12"><label>12</label><?label 1?><mixed-citation>Dee, D. P., Uppala, S. M., Simmons, A. J., Berrisford, P., Poli, P.,
Kobayashi, S., Andrae, U., Balmaseda, M. A., Balsamo, G., Bauer, P.,
Bechtold, P., Beljaars, A. C. M., van de Berg, I., Biblot, J., Bormann, N.,
Delsol, C., Dragani, R., Fuentes, M., Greer, A. J., Haimberger, L., Healy, S.
B., Hersbach, H., Holm, E. V., Isaksen, L., Kallberg, P., Kohler, M.,
Matricardi, M., McNally, A. P., Mong-Sanz, B. M., Morcette, J.-J., Park,
B.-K., Peubey, C., de Rosnay, P., Tavolato, C., Thepaut, J. N., and Vitart,
F.: The ERA-Interim reanalysis: Configuration and performance of the data
assimilation system, Q. J. Roy. Meteorol. Soc., 137, 553–597, <ext-link xlink:href="https://doi.org/10.1002/qj.828" ext-link-type="DOI">10.1002/qj.828</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bib13"><label>13</label><?label 1?><mixed-citation>de Rosnay, P., Drusch, M., Vasiljevic, D., Balsamo, G., Albergel, C., and
Isaksen, L.: A simplified Extended Kalman Filter for the global operational
soil moisture analysis at ECMWF, Q. J. Roy. Meteor. Soc., 139, 1199–1213,
<ext-link xlink:href="https://doi.org/10.1002/qj.2023" ext-link-type="DOI">10.1002/qj.2023</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib14"><label>14</label><?label 1?><mixed-citation>Dirmeyer, P. A., Gao, X., Zhao, M., Guo, Z., Oki, T., and Hanasaki, N.:
GSWP-2: Multimodel analysis and implications for our perception of the land
surface, B. Am. Meteorol. Soc., 87, 1381–1398,
<ext-link xlink:href="https://doi.org/10.1175/bams-87-10-1381" ext-link-type="DOI">10.1175/bams-87-10-1381</ext-link>, 2006.</mixed-citation></ref>
      <ref id="bib1.bib15"><label>15</label><?label 1?><mixed-citation>Dorigo, W., de Jeu, R., Chung, D., Parinussa, R., Liu, Y., Wagner, W., and
Fernández-Prieto, D.: Evaluating global trends (1988–2010) in harmonized
multi-satellite surface soil moisture, Geophys. Res. Lett., 39, L18405,
<ext-link xlink:href="https://doi.org/10.1029/2012GL052988" ext-link-type="DOI">10.1029/2012GL052988</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bib16"><label>16</label><?label 1?><mixed-citation>Dorigo, W., Wagner, W., Albergel, C., Albrecht, F., Balsamo, G., Brocca, L.,
Chung, D., Ertl, M., Forkel, M., Gruber, A., Haas, E., Hamer, P. D.,
Hirschi, M., Ikonen, J., de Jeu, R., Kidd, R., Lahoz, W., Liu, Y. Y.,
Miralles, D., Mistelbauer, T., Nicolai-Shaw, N., Parinussa, R., Pratola, C.,
Reimer, C., van der Schalie, R., Seneviratne, S. I., Smolander, T., and
Lecomte, P.: ESA CCI soil moisture for improved Earth system understanding:
State-of-the art and future directions, Remote Sens. Environ., 203,
185–215, <ext-link xlink:href="https://doi.org/10.1016/j.rse.2017.07.001" ext-link-type="DOI">10.1016/j.rse.2017.07.001</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib17"><label>17</label><?label 1?><mixed-citation>Dorigo, W. A., Wagner, W., Hohensinn, R., Hahn, S., Paulik, C., Xaver, A., Gruber, A., Drusch, M., Mecklenburg, S., van Oevelen, P., Robock, A., and Jackson, T.: The International Soil Moisture Network: a data hosting facility for global in situ soil moisture measurements, Hydrol. Earth Syst. Sci., 15, 1675–1698, <ext-link xlink:href="https://doi.org/10.5194/hess-15-1675-2011" ext-link-type="DOI">10.5194/hess-15-1675-2011</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bib18"><label>18</label><?label 1?><mixed-citation>Dorigo, W. A., Xaver, A., Vreugdenhil, M., Gruber, A., Hegyiová, A.,
Sanchis-Dufau, A. D., Zamojski, D., Cordes, C., Wagner, W., and Drusch, M.:
Global automated quality control of in situ soil moisture data from the
International Soil Moisture Network, Vadose Zone J., 12, vzj2012.0097,
<ext-link xlink:href="https://doi.org/10.2136/vzj2012.0097" ext-link-type="DOI">10.2136/vzj2012.0097</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib19"><label>19</label><?label 1?><mixed-citation>EODC (Earth Observation Data Centre for Water Resources Monitoring GmbH):
Algorithm theoretical baseline document (ATBD) supporting product version
06.1 – D2.1 version 2, Earth Observation Data Centre for Water Resources
Monitoring GmbH, available at: <uri>https://climate.esa.int/media/documents/ESA_CCI_SM_RD_D2.1_v1_ATBD_v05.2.pdf</uri>, last access: 30 July 2021.</mixed-citation></ref>
      <ref id="bib1.bib20"><label>20</label><?label 1?><mixed-citation>Eyring, V., Bony, S., Meehl, G. A., Senior, C. A., Stevens, B., Stouffer, R. J., and Taylor, K. E.: Overview of the Coupled Model Intercomparison Project Phase 6 (CMIP6) experimental design and organization, Geosci. Model Dev., 9, 1937–1958, <ext-link xlink:href="https://doi.org/10.5194/gmd-9-1937-2016" ext-link-type="DOI">10.5194/gmd-9-1937-2016</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib21"><label>21</label><?label 1?><mixed-citation>Field, C. B., Barros, V., Stocker, T. F., and Dahe, Q. (Eds.): Managing the
risks of extreme events and disasters to advance climate change adaptation:
Special report of the intergovernmental panel on climate change, Cambridge
University Press, Cambridge, <ext-link xlink:href="https://doi.org/10.1017/CBO9781139177245" ext-link-type="DOI">10.1017/CBO9781139177245</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bib22"><label>22</label><?label 1?><mixed-citation>Friedl, M. and Sulla-Menashe, D.: MCD12C1 MODIS/Terra<inline-formula><mml:math id="M174" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula>Aqua land cover type
L3 global 0.05Deg CMG V006, LP DAAC [Data set],
<ext-link xlink:href="https://doi.org/10.5067/MODIS/MCD12C1.006" ext-link-type="DOI">10.5067/MODIS/MCD12C1.006</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib23"><label>23</label><?label 1?><mixed-citation>Giorgi, F. and Mearns, L. O.: Calculation of average, uncertainty range, and
reliability of regional climate changes from AOGCM simulations via the
“reliability ensemble averaging” (REA) method, J. Climate, 15, 1141–1158,
<ext-link xlink:href="https://doi.org/10.1175/1520-0442(2002)015&lt;1141:COAURA&gt;2.0.CO;2" ext-link-type="DOI">10.1175/1520-0442(2002)015&lt;1141:COAURA&gt;2.0.CO;2</ext-link>, 2002.</mixed-citation></ref>
      <ref id="bib1.bib24"><label>24</label><?label 1?><mixed-citation>Green, J. K., Seneviratne, S. I., Berg, A. M., Findell, K. L., Hagemann, S.,
Lawrence, D. M., and Gentine, P.: Large influence of soil moisture on
long-term terrestrial carbon uptake, Nature, 565, 476–479,
<ext-link xlink:href="https://doi.org/10.1038/s41586-018-0848-x" ext-link-type="DOI">10.1038/s41586-018-0848-x</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib25"><label>25</label><?label 1?><mixed-citation>Gruber, A., Su, C.-H., Crow, W. T., Zwieback, S., Dorigo, W. A., and Wagner,
W.: Estimating error cross-correlations in soil moisture data sets using
extended collocation analysis, J. Geophys. Res.-Atmos., 121, 1208–1219,
<ext-link xlink:href="https://doi.org/10.1002/2015JD024027" ext-link-type="DOI">10.1002/2015JD024027</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib26"><label>26</label><?label 1?><mixed-citation>Gruber, A., Crow, W. T., and Dorigo, W. A.: Assimilation of spatially sparse
in situ soil moisture networks into a continuous model domain, Water Resour.
Res., 54, 1353–1367, <ext-link xlink:href="https://doi.org/10.1002/2017WR021277" ext-link-type="DOI">10.1002/2017WR021277</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib27"><label>27</label><?label 1?><mixed-citation>Gruber, A., De Lannoy, G., Albergel, C., Al-Yaari, A., Brocca, L., Calvet,
J.-C., Colliander, A., Cosh, M., Crow, W., Dorigo, W., Draper, C., Hirschi,
M., Kerr, Y., Konings, A., Lahoz, W., McColl, K., Montzka, C.,
Muñoz-Sabater, J., Peng, J., Reichle, R., Richaume, P., Rüdiger, C.,
Scanlon, T., van der Schalie, R., Wigneron, J.-P., and Wagner, W.:
Validation practices for satellite soil moisture retrievals: What are (the)
errors?, Remote Sensing of Environment, 244, 111806, <ext-link xlink:href="https://doi.org/10.1016/j.rse.2020.111806" ext-link-type="DOI">10.1016/j.rse.2020.111806</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib28"><label>28</label><?label 1?><mixed-citation>Gu, X., Li, J., Chen, Y. D., Kong, D., and Liu, J.: Consistency and
discrepancy of global surface soil moisture changes from multiple
model-based data sets agains<?pagebreak page4404?>t satellite observations, J. Geophys. Res.-Atmos., 124, 1474–1495, <ext-link xlink:href="https://doi.org/10.1029/2018JD029304" ext-link-type="DOI">10.1029/2018JD029304</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib29"><label>29</label><?label 1?><mixed-citation>Harris, I., Jones, P. D., Osborn, T. J., and Lister, D. H.: Updated
high-resolution grids of monthly climatic observations – the CRU TS3.10
dataset, Int. J. Climatol., 34, 623–642, <ext-link xlink:href="https://doi.org/10.1002/joc.3711" ext-link-type="DOI">10.1002/joc.3711</ext-link>,
2014.</mixed-citation></ref>
      <ref id="bib1.bib30"><label>30</label><?label 1?><mixed-citation>Hauser, M.: Regionmask: plotting and creation of masks of spatial regions,
Zenodo, <ext-link xlink:href="https://doi.org/10.5281/zenodo.3585543" ext-link-type="DOI">10.5281/zenodo.3585543</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib31"><label>31</label><?label 1?><mixed-citation>Hersbach, H., Bell, B., Berrisford, P., Hirahara, S., Horányi, A.,
Muñoz-Sabater, J., Nicolas, J., Peubey, C., Radu, R., Schepers, D.,
Simmons, A., Soci, C., Abdalla, S., Abellan, X., Balsamo, G., Bechtold, P.,
Biavati, G., Bidlot, J., Bonavita, M., Chiara, G., Dahlgren, P., Dee, D.,
Diamantakis, M., Dragani, R., Flemming, J., Forbes, R., Fuentes, M., Geer,
A., Haimberger, L., Healy, S., Hogan, R. J., Hólm, E., Janisková,
M., Keeley, S., Laloyaux, P., Lopez, P., Lupu, C., Radnoti, G., Rosnay, P.,
Rozum, I., Vamborg, F., Villaume, S., and Thépaut, J.: The ERA5 global
reanalysis, Q. J. Roy. Meteor. Soc., 146, 1999–2049,
<ext-link xlink:href="https://doi.org/10.1002/qj.3803" ext-link-type="DOI">10.1002/qj.3803</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib32"><label>32</label><?label 1?><mixed-citation>Hobeichi, S., Abramowitz, G., Evans, J., and Ukkola, A.: Derived Optimal Linear Combination Evapotranspiration (DOLCE): a global gridded synthesis ET estimate, Hydrol. Earth Syst. Sci., 22, 1317–1336, <ext-link xlink:href="https://doi.org/10.5194/hess-22-1317-2018" ext-link-type="DOI">10.5194/hess-22-1317-2018</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib33"><label>33</label><?label 1?><mixed-citation>Hobeichi, S., Abramowitz, G., Evans, J., and Beck, H. E.: Linear Optimal Runoff Aggregate (LORA): a global gridded synthesis runoff product, Hydrol. Earth Syst. Sci., 23, 851–870, <ext-link xlink:href="https://doi.org/10.5194/hess-23-851-2019" ext-link-type="DOI">10.5194/hess-23-851-2019</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib34"><label>34</label><?label 1?><mixed-citation>
Hollander, M., Wolfe, D. A., and Chicken, E.: Nonparametric statistical
methods, 3rd ed., Wiley, 2013.</mixed-citation></ref>
      <ref id="bib1.bib35"><label>35</label><?label 1?><mixed-citation>Huntzinger, D. N., Schwalm, C. R., Wei, Y., Cook, R. B., Michalak, a M.,
Schaefer, K., Jacobson, a R., Arain, M. a, Ciais, P., Fisher, J. B., Hayes,
D. J., Huang, M., Huang, S., Ito, A., Jain, a K., Lei, H., Lu, C., Maignan,
F., Mao, J., Parazoo, N. C., Peng, C., Peng, S., Poulter, B., Ricciuto, D.
M., Tian, H., Shi, X., Wang, W., Zeng, N., Zhao, F., Zhu, Q., Yang, J., and
Tao, B.: NACP MsTMIP: Global 0.5-degree model outputs in standard format,
Version 1.0, ORNL Distributed Active Archive Center,
<ext-link xlink:href="https://doi.org/10.3334/ornldaac/1225" ext-link-type="DOI">10.3334/ornldaac/1225</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib36"><label>36</label><?label 1?><mixed-citation>Karthikeyan, L., Pan, M., Wanders, N., Kumar, D. N., and Wood, E. F.: Four
decades of microwave satellite soil moisture observations: Part 2. Product
validation and inter-satellite comparisons, Adv. Water Resour.,
109, 236–252, <ext-link xlink:href="https://doi.org/10.1016/j.advwatres.2017.09.010" ext-link-type="DOI">10.1016/j.advwatres.2017.09.010</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib37"><label>37</label><?label 1?><mixed-citation>Knoben, W. J. M., Woods, R. A., and Freer, J. E.: Global bimodal
precipitation seasonality: A systematic overview, Int. J. Climatol., 39,
558–567, <ext-link xlink:href="https://doi.org/10.1002/joc.5786" ext-link-type="DOI">10.1002/joc.5786</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib38"><label>38</label><?label 1?><mixed-citation>Kobayashi, S., Ota, Y., Harada, Y., Ebita, A., Moriya, M., Onoda, H., Onogi,
K., Kamahori, H., Kobayashi, C., Endo, H., Miyaoka, K., and Takahashi, K.:
The JRA-55 Reanalysis: General specifications and basic characteristics, J.
Meteor. Soc. Jpn., 93, 5–48, <ext-link xlink:href="https://doi.org/10.2151/jmsj.2015-001" ext-link-type="DOI">10.2151/jmsj.2015-001</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib39"><label>39</label><?label 1?><mixed-citation>Kumar, S., Newman, M., Wang, Y., and Livneh, B.: Potential reemergence of
seasonal soil moisture anomalies in North America, J. Climate, 32,
2707–2734, <ext-link xlink:href="https://doi.org/10.1175/JCLI-D-18-0540.1" ext-link-type="DOI">10.1175/JCLI-D-18-0540.1</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib40"><label>40</label><?label 1?><mixed-citation>Laloyaux, P., de Boisseson, E., and Dahlgren, P.: CERA-20C: An Earth system
approach to climate reanalysis, ECMWF Newsl., 150, 25–30,
<ext-link xlink:href="https://doi.org/10.21957/FFS36BIRJ2" ext-link-type="DOI">10.21957/FFS36BIRJ2</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib41"><label>41</label><?label 1?><mixed-citation>Li, L., Shangguan, W., Deng, Y., Mao, J., Pan, J., Wei, N., Yuan, H., Zhang,
S., Zhang, Y., and Dai, Y.: A causal inference model based on random forests
to identify the effect of soil moisture on precipitation, J. Hydrometeorol.,
21, 1115–1131, <ext-link xlink:href="https://doi.org/10.1175/JHM-D-19-0209.1" ext-link-type="DOI">10.1175/JHM-D-19-0209.1</ext-link>, 2020a.</mixed-citation></ref>
      <ref id="bib1.bib42"><label>42</label><?label 1?><mixed-citation>Li, M., Wu, P., and Ma, Z.: A comprehensive evaluation of soil moisture and
soil temperature from third-generation atmospheric and land reanalysis data
sets, Int. J. Climatol., 40, 5744–5766, <ext-link xlink:href="https://doi.org/10.1002/joc.6549" ext-link-type="DOI">10.1002/joc.6549</ext-link>,
2020b.</mixed-citation></ref>
      <ref id="bib1.bib43"><label>43</label><?label 1?><mixed-citation>Liu, Y., Liu, Y., and Wang, W.: Inter-comparison of satellite-retrieved and
Global Land Data Assimilation System-simulated soil moisture datasets for
global drought analysis, Remote Sens. Environ., 220, 1–18,
<ext-link xlink:href="https://doi.org/10.1016/j.rse.2018.10.026" ext-link-type="DOI">10.1016/j.rse.2018.10.026</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib44"><label>44</label><?label 1?><mixed-citation>Liu, Y. Y., Parinussa, R. M., Dorigo, W. A., De Jeu, R. A. M., Wagner, W., van Dijk, A. I. J. M., McCabe, M. F., and Evans, J. P.: Developing an improved soil moisture dataset by blending passive and active microwave satellite-based retrievals, Hydrol. Earth Syst. Sci., 15, 425–436, <ext-link xlink:href="https://doi.org/10.5194/hess-15-425-2011" ext-link-type="DOI">10.5194/hess-15-425-2011</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bib45"><label>45</label><?label 1?><mixed-citation>Liu, Y. Y., Dorigo, W. A., Parinussa, R. M., de Jeu, R. A. M., Wagner, W.,
McCabe, M. F., Evans, J. P., and van Dijk, A. I. J. M.: Trend-preserving
blending of passive and active microwave soil moisture retrievals, Remote
Sens. Environ., 123, 280–297, <ext-link xlink:href="https://doi.org/10.1016/j.rse.2012.03.014" ext-link-type="DOI">10.1016/j.rse.2012.03.014</ext-link>,
2012.</mixed-citation></ref>
      <ref id="bib1.bib46"><label>46</label><?label 1?><mixed-citation>Llamas, R. M., Guevara, M., Rorabaugh, D., Taufer, M., and Vargas, R.:
Spatial gap-filling of ESA-CCI satellite-derived soil moisture based on
geostatistical techniques and multiple regression, Remote Sens., 12, 665,
<ext-link xlink:href="https://doi.org/10.3390/rs12040665" ext-link-type="DOI">10.3390/rs12040665</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib47"><label>47</label><?label 1?><mixed-citation>Loew, A., Stacke, T., Dorigo, W., de Jeu, R., and Hagemann, S.: Potential and limitations of multidecadal satellite soil moisture observations for selected climate model evaluation studies, Hydrol. Earth Syst. Sci., 17, 3523–3542, <ext-link xlink:href="https://doi.org/10.5194/hess-17-3523-2013" ext-link-type="DOI">10.5194/hess-17-3523-2013</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib48"><label>48</label><?label 1?><mixed-citation>Lu, J., Carbone, G. J., and Grego, J. M.: Uncertainty and hotspots in 21st
century projections of agricultural drought from CMIP5 models, Sci. Rep., 9,
4922, <ext-link xlink:href="https://doi.org/10.1038/s41598-019-41196-z" ext-link-type="DOI">10.1038/s41598-019-41196-z</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib49"><label>49</label><?label 1?><mixed-citation>Martens, B., Miralles, D. G., Lievens, H., van der Schalie, R., de Jeu, R. A. M., Fernández-Prieto, D., Beck, H. E., Dorigo, W. A., and Verhoest, N. E. C.: GLEAM v3: satellite-based land evaporation and root-zone soil moisture, Geosci. Model Dev., 10, 1903–1925, <ext-link xlink:href="https://doi.org/10.5194/gmd-10-1903-2017" ext-link-type="DOI">10.5194/gmd-10-1903-2017</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib50"><label>50</label><?label 1?><mixed-citation>
McCarty, W., Coy, L., Gelaro, R., Huang, A., Merkova, D., Smith, E. B.,
Sienkiewicz, M., and Wargan, K.: MERRA-2 input observations: summary and
assessment, Goddard Space Flight Center, Greenbelt, MD, USA, 2016.</mixed-citation></ref>
      <ref id="bib1.bib51"><label>51</label><?label 1?><mixed-citation>Melton, J. R., Verseghy, D. L., Sospedra-Alfonso, R., and Gruber, S.: Improving permafrost physics in the coupled Canadian Land Surface Scheme (v.3.6.2) and Canadian Terrestrial Ecosystem Model (v.2.1) (CLASS-CTEM), Geosci. Model Dev., 12, 4443–4467, <ext-link xlink:href="https://doi.org/10.5194/gmd-12-4443-2019" ext-link-type="DOI">10.5194/gmd-12-4443-2019</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib52"><label>52</label><?label 1?><mixed-citation>Molero, B., Leroux, D. J., Richaume, P., Kerr, Y. H., Merlin, O., Cosh, M.
H., and Bindlish, R.: Multi-timescale analysis of the spatial
representativeness of in situ soil moisture data within satellite
footprints: Soil moisture time and spatial scales, J. Geophys. Res.-Atmos.,
123, 3–21, <ext-link xlink:href="https://doi.org/10.1002/2017JD027478" ext-link-type="DOI">10.1002/2017JD027478</ext-link>, 2018.</mixed-citation></ref>
      <?pagebreak page4405?><ref id="bib1.bib53"><label>53</label><?label 1?><mixed-citation>Mystakidis, S., Davin, E. L., Gruber, N., and Seneviratne, S. I.:
Constraining future terrestrial carbon cycle projections using
observation-based water and carbon flux estimates, Glob. Change Biol., 22,
2198–2215, <ext-link xlink:href="https://doi.org/10.1111/gcb.13217" ext-link-type="DOI">10.1111/gcb.13217</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib54"><label>54</label><?label 1?><mixed-citation>O, S. and Orth, R.: Global soil moisture data derived through machine
learning trained with in-situ measurements, Sci. Data, 8, 170,
<ext-link xlink:href="https://doi.org/10.1038/s41597-021-00964-1" ext-link-type="DOI">10.1038/s41597-021-00964-1</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib55"><label>55</label><?label 1?><mixed-citation>Padrón, R. S., Gudmundsson, L., and Seneviratne, S. I.: Observational
constraints reduce likelihood of extreme changes in multidecadal land water
availability, Geophys. Res. Lett., 46, 736–744,
<ext-link xlink:href="https://doi.org/10.1029/2018GL080521" ext-link-type="DOI">10.1029/2018GL080521</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib56"><label>56</label><?label 1?><mixed-citation>Pan, S., Pan, N., Tian, H., Friedlingstein, P., Sitch, S., Shi, H., Arora, V. K., Haverd, V., Jain, A. K., Kato, E., Lienert, S., Lombardozzi, D., Nabel, J. E. M. S., Ottlé, C., Poulter, B., Zaehle, S., and Running, S. W.: Evaluation of global terrestrial evapotranspiration using state-of-the-art approaches in remote sensing, machine learning and land surface modeling, Hydrol. Earth Syst. Sci., 24, 1485–1509, <ext-link xlink:href="https://doi.org/10.5194/hess-24-1485-2020" ext-link-type="DOI">10.5194/hess-24-1485-2020</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib57"><label>57</label><?label 1?><mixed-citation>
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel,
O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J.,
Passos, A., Cournapeau, D., Brucher, M., Perrot, M., and Duchesnay, E.:
Scikit-learn: machine learning in Python, J. Mach. Learn. Res., 12,
2825–2830, 2011.</mixed-citation></ref>
      <ref id="bib1.bib58"><label>58</label><?label 1?><mixed-citation>Piles, M., Ballabrera-Poy, J., and Muñoz-Sabater, J.: Dominant features
of global surface soil moisture variability observed by the SMOS satellite,
Remote Sens., 11, 95, <ext-link xlink:href="https://doi.org/10.3390/rs11010095" ext-link-type="DOI">10.3390/rs11010095</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib59"><label>59</label><?label 1?><mixed-citation>Poli, P., Hersbach, H., Dee, D. P., Berrisford, P., Simmons, A. J., Vitart,
F., Laloyaux, P., Tan, D. G. H., Peubey, C., Thépaut, J.-N.,
Trémolet, Y., Hólm, E. V., Bonavita, M., Isaksen, L., and Fisher,
M.: ERA-20C: An atmospheric reanalysis of the twentieth century, J. Climate,
29, 4083–4097, <ext-link xlink:href="https://doi.org/10.1175/JCLI-D-15-0556.1" ext-link-type="DOI">10.1175/JCLI-D-15-0556.1</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib60"><label>60</label><?label 1?><mixed-citation>Rui, H. and Beaudoing, H.: README document for NASA GLDAS version 2 data
products, NASA Goddard Space Flight Center (GES DISC), Greenbelt, MD, USA,
available at: <uri>https://disc.gsfc.nasa.gov/information/documents?title=Hydrology Documentation</uri> (last access: 30 July 2021), 2020.</mixed-citation></ref>
      <ref id="bib1.bib61"><label>61</label><?label 1?><mixed-citation>Seabold, S. and Perktold, J.: Statsmodels: Econometric and statistical
modeling with Python, in: 9th Python in Science Conference, Austin, Texas, 28 June–3 July 2010
<ext-link xlink:href="https://doi.org/10.25080/majora-92bf1922-011" ext-link-type="DOI">10.25080/majora-92bf1922-011</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bib62"><label>62</label><?label 1?><mixed-citation>Sitch, S., Huntingford, C., Gedney, N., Levy, P. E., Lomas, M., Piao, S. L.,
Betts, R., Ciais, P., Cox, P., Friedlingstein, P., Jones, C. D., Prentice,
I. C., and Woodward, F. I.: Evaluation of the terrestrial carbon cycle,
future plant geography and climate-carbon cycle feedbacks using five Dynamic
Global Vegetation Models (DGVMs), Glob. Change Biol., 14, 2015–2039,
<ext-link xlink:href="https://doi.org/10.1111/j.1365-2486.2008.01626.x" ext-link-type="DOI">10.1111/j.1365-2486.2008.01626.x</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bib63"><label>63</label><?label 1?><mixed-citation>Sospedra-Alfonso, R. and Merryfield, W. J.: Initialization and potential
predictability of soil moisture in the Canadian Seasonal to Interannual
Prediction System, J. Climate, 31, 5205–5224,
<ext-link xlink:href="https://doi.org/10.1175/JCLI-D-17-0707.1" ext-link-type="DOI">10.1175/JCLI-D-17-0707.1</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib64"><label>64</label><?label 1?><mixed-citation>Su, C.-H., Ryu, D., Dorigo, W., Zwieback, S., Gruber, A., Albergel, C.,
Reichle, R. H., and Wagner, W.: Homogeneity of a global multisatellite soil
moisture climate data record, Geophys. Res. Lett., 43, 11245–11252,
<ext-link xlink:href="https://doi.org/10.1002/2016GL070458" ext-link-type="DOI">10.1002/2016GL070458</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib65"><label>65</label><?label 1?><mixed-citation>Spinoni, J., Barbosa, P., De Jager, A., McCormick, N., Naumann, G., Vogt, J.
V., Magni, D., Masante, D., and Mazzeschi, M.: A new global database of
meteorological drought events from 1951 to 2016, J. Hydrol. Reg. Stud., 22,
100593, <ext-link xlink:href="https://doi.org/10.1016/j.ejrh.2019.100593" ext-link-type="DOI">10.1016/j.ejrh.2019.100593</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib66"><label>66</label><?label 1?><mixed-citation>Su, C.-H., Ryu, D., Dorigo, W., Zwieback, S., Gruber, A., Albergel, C.,
Reichle, R. H., and Wagner, W.: Homogeneity of a global multisatellite soil
moisture climate data record, Geophys. Res. Lett., 43, 11245–11252,
<ext-link xlink:href="https://doi.org/10.1002/2016GL070458" ext-link-type="DOI">10.1002/2016GL070458</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib67"><label>67</label><?label 1?><mixed-citation>Taylor, K. E., Stouffer, R. J., and Meehl, G. A.: An overview of CMIP5 and
the experiment design, B. Am. Meteorol. Soc., 93, 485–498,
<ext-link xlink:href="https://doi.org/10.1175/BAMS-D-11-00094.1" ext-link-type="DOI">10.1175/BAMS-D-11-00094.1</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bib68"><label>68</label><?label 1?><mixed-citation>Tobin, K. J., Torres, R., Crow, W. T., and Bennett, M. E.: Multi-decadal analysis of root-zone soil moisture applying the exponential filter across CONUS, Hydrol. Earth Syst. Sci., 21, 4403–4417, <ext-link xlink:href="https://doi.org/10.5194/hess-21-4403-2017" ext-link-type="DOI">10.5194/hess-21-4403-2017</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib69"><label>69</label><?label 1?><mixed-citation>UCAR/NCAR/CISL/TDD: The NCAR Comannd Language (Version 6.6.2) [Software],
Boulder, CO, USA, <ext-link xlink:href="https://doi.org/10.5065/D6WD3XH5" ext-link-type="DOI">10.5065/D6WD3XH5</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib70"><label>70</label><?label 1?><mixed-citation>Wang, G., Garcia, D., Liu, Y., de Jeu, R., and Johannes Dolman, A.: A
three-dimensional gap filling method for large geophysical datasets:
Application to global satellite soil moisture observations, Environ. Model.
Softw., 30, 139–142, <ext-link xlink:href="https://doi.org/10.1016/j.envsoft.2011.10.015" ext-link-type="DOI">10.1016/j.envsoft.2011.10.015</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bib71"><label>71</label><?label 1?><mixed-citation>Wang, Y.: soil_moisture_merge,  Bitbucket repository [code],
<uri>https://bitbucket.org/ywang11/soil_moisture_merge/src/master/</uri>
(last access: 30 July 2021),   2020.</mixed-citation></ref>
      <ref id="bib1.bib72"><label>72</label><?label 1?><mixed-citation>Wang, Y. and Mao, J.: Global multi-layer soil moisture products,
<ext-link xlink:href="https://doi.org/10.6084/m9.figshare.13661312.v1" ext-link-type="DOI">10.6084/m9.figshare.13661312.v1</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib73"><label>73</label><?label 1?><mixed-citation>Wang, Y., Leng, P., Peng, J., Marzahn, P., and Ludwig, R.: Global
assessments of two blended microwave soil moisture products CCI and SMOPS
with in-situ measurements and reanalysis data, Int. J. Appl. Earth Obs., 94, 102234,
<ext-link xlink:href="https://doi.org/10.1016/j.jag.2020.102234" ext-link-type="DOI">10.1016/j.jag.2020.102234</ext-link>, 2021a.</mixed-citation></ref>
      <ref id="bib1.bib74"><label>74</label><?label 1?><mixed-citation>Wang, Z., Zhan, C., Ning, L., and Guo, H.: Evaluation of global terrestrial
evapotranspiration in CMIP6 models, Theor. Appl. Climatol., 143, 521–531,
<ext-link xlink:href="https://doi.org/10.1007/s00704-020-03437-4" ext-link-type="DOI">10.1007/s00704-020-03437-4</ext-link>, 2021b.</mixed-citation></ref>
      <ref id="bib1.bib75"><label>75</label><?label 1?><mixed-citation>Wu, Z., Zhou, J., He, H., Lin, Q., Wu, X., and Xu, Z.: An advanced error
correction methodology for merging in-situ observed and model-based soil
moisture, J. Hydrol., 566, 150–163,
<ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2018.09.018" ext-link-type="DOI">10.1016/j.jhydrol.2018.09.018</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib76"><label>76</label><?label 1?><mixed-citation>Yuan, S. and Quiring, S. M.: Evaluation of soil moisture in CMIP5 simulations over the contiguous United States using in situ and satellite observations, Hydrol. Earth Syst. Sci., 21, 2203–2218, <ext-link xlink:href="https://doi.org/10.5194/hess-21-2203-2017" ext-link-type="DOI">10.5194/hess-21-2203-2017</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib77"><label>77</label><?label 1?><mixed-citation>Zeng, Y., Su, Z., van der Velde, R., Wang, L., Xu, K., Wang, X., and Wen,
J.: Blending satellite observed, model simulated, and in situ measured soil
moisture over Tibetan plateau, Remote Sens., 8, 268,
<ext-link xlink:href="https://doi.org/10.3390/rs8030268" ext-link-type="DOI">10.3390/rs8030268</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib78"><label>78</label><?label 1?><mixed-citation>Zhang, G. and Cook, K. H.: West African monsoon demise: Climatology,
interannual variations, and relationship to seasonal rainfall, J. Geophys.
Res.-Atmos., 119, 10175–10193, <ext-link xlink:href="https://doi.org/10.1002/2014JD022043" ext-link-type="DOI">10.1002/2014JD022043</ext-link>, 2014.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>Development of observation-based global multilayer soil moisture products for 1970 to 2016</article-title-html>
<abstract-html><p>Soil moisture (SM) datasets are critical to understanding
the global water, energy, and biogeochemical cycles and benefit extensive
societal applications. However, individual sources of SM data (e.g., in situ
and satellite observations, reanalysis, offline land surface model
simulations, Earth system model – ESM – simulations) have source-specific
limitations and biases related to the spatiotemporal continuity,
resolutions, and modeling and retrieval assumptions. Here, we developed seven
global, gap-free, long-term (1970–2016), multilayer (0–10, 10–30,
30–50, and 50–100&thinsp;cm) SM products at monthly 0.5° resolution
(available at <a href="https://doi.org/10.6084/m9.figshare.13661312.v1" target="_blank">https://doi.org/10.6084/m9.figshare.13661312.v1</a>; Wang and Mao, 2021) by
synthesizing a wide range of SM datasets using three statistical methods
(unweighted averaging, optimal linear combination, and emergent constraint).
The merged products outperformed their source datasets when evaluated with
in situ observations (mean bias from −0.044 to 0.033&thinsp;m<sup>3</sup>&thinsp;m<sup>−3</sup>, root
mean square errors from 0.076 to 0.104&thinsp;m<sup>3</sup>&thinsp;m<sup>−3</sup>, Pearson
correlations from 0.35 to 0.67) and multiple gridded datasets that did not
enter merging because of insufficient spatial, temporal, or soil layer
coverage. Three of the new SM products, which were produced by applying any
of the three merging methods to the source datasets excluding the ESMs,
had lower bias and root mean square errors and higher correlations than the
ESM-dependent merged products. The ESM-independent products also showed a
better ability to capture historical large-scale drought events than the
ESM-dependent products. The merged products generally showed reasonable
temporal homogeneity and physically plausible global sensitivities to
observed meteorological factors, except that the ESM-dependent products
underestimated the low-frequency temporal variability in SM and
overestimated the high-frequency variability for the 50–100&thinsp;cm depth.
Based on these evaluation results, the three ESM-independent products were
finally recommended for future applications because of their better
performances than the ESM-dependent ones. Despite uncertainties in the raw
SM datasets and fusion methods, these hybrid products create added value
over existing SM datasets because of the performance improvement and
harmonized spatial, temporal, and vertical coverages, and they provide a new
foundation for scientific investigation and resource management.</p></abstract-html>
<ref-html id="bib1.bib1"><label>1</label><mixed-citation>
Al Bitar, A., Kerr, Y. H., Merlin, O., Cabot, F., Wigneron, J.-P.,
Froissard, F., Suquia, D., Petitpa, A., Piguet, B., and Martin, E.: Global
drought index from SMOS soil moisture, in: IEEE International Geoscience and
Remote Sensing Symposium, IGARSS, Melbourne, Australia, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>2</label><mixed-citation>
Al Bitar, A., Mialon, A., Kerr, Y. H., Cabot, F., Richaume, P., Jacquette, E., Quesney, A., Mahmoodi, A., Tarot, S., Parrens, M., Al-Yaari, A., Pellarin, T., Rodriguez-Fernandez, N., and Wigneron, J.-P.: The global SMOS Level 3 daily soil moisture and brightness temperature maps, Earth Syst. Sci. Data, 9, 293–315, <a href="https://doi.org/10.5194/essd-9-293-2017" target="_blank">https://doi.org/10.5194/essd-9-293-2017</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>3</label><mixed-citation>
Andresen, C. G., Lawrence, D. M., Wilson, C. J., McGuire, A. D., Koven, C., Schaefer, K., Jafarov, E., Peng, S., Chen, X., Gouttevin, I., Burke, E., Chadburn, S., Ji, D., Chen, G., Hayes, D., and Zhang, W.: Soil moisture and hydrology projections of the permafrost region – a model intercomparison, The Cryosphere, 14, 445–459, <a href="https://doi.org/10.5194/tc-14-445-2020" target="_blank">https://doi.org/10.5194/tc-14-445-2020</a>, 2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>4</label><mixed-citation>
Balsamo, G., Beljaars, A., Scipal, K., Viterbo, P., van den Hurk, B.,
Hirschi, M., and Betts, A. K.: A revised hydrology for the ECMWF model:
Verification from field site to terrestrial water storage and impact in the
integrated forecast system, J. Hydrometeorol., 10, 623–643,
<a href="https://doi.org/10.1175/2008JHM1068.1" target="_blank">https://doi.org/10.1175/2008JHM1068.1</a>, 2009.
</mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>5</label><mixed-citation>
Balsamo, G., Albergel, C., Beljaars, A., Boussetta, S., Brun, E., Cloke, H., Dee, D., Dutra, E., Muñoz-Sabater, J., Pappenberger, F., de Rosnay, P., Stockdale, T., and Vitart, F.: ERA-Interim/Land: a global land surface reanalysis data set, Hydrol. Earth Syst. Sci., 19, 389–407, <a href="https://doi.org/10.5194/hess-19-389-2015" target="_blank">https://doi.org/10.5194/hess-19-389-2015</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>6</label><mixed-citation>
Beaudoing, H., Rodell, M., and NASA/GSFC/HSL: GLDAS Noah Land Surface Model
L4 3 hourly 0.25 x 0.25 degree V2.0, Goddard Earth Sciences Data and
Information Services Center (GES DISC), Greenbelt, MD, USA, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>7</label><mixed-citation>
Beck, H. E., Pan, M., Miralles, D. G., Reichle, R. H., Dorigo, W. A., Hahn, S., Sheffield, J., Karthikeyan, L., Balsamo, G., Parinussa, R. M., van Dijk, A. I. J. M., Du, J., Kimball, J. S., Vergopolan, N., and Wood, E. F.: Evaluation of 18 satellite- and model-based soil moisture products using in situ measurements from 826 sensors, Hydrol. Earth Syst. Sci., 25, 17–40, <a href="https://doi.org/10.5194/hess-25-17-2021" target="_blank">https://doi.org/10.5194/hess-25-17-2021</a>, 2021.
</mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>8</label><mixed-citation>
Best, M. J., Pryor, M., Clark, D. B., Rooney, G. G., Essery, R. L. H., Ménard, C. B., Edwards, J. M., Hendry, M. A., Porson, A., Gedney, N., Mercado, L. M., Sitch, S., Blyth, E., Boucher, O., Cox, P. M., Grimmond, C. S. B., and Harding, R. J.: The Joint UK Land Environment Simulator (JULES), model description – Part 1: Energy and water fluxes, Geosci. Model Dev., 4, 677–699, <a href="https://doi.org/10.5194/gmd-4-677-2011" target="_blank">https://doi.org/10.5194/gmd-4-677-2011</a>, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>9</label><mixed-citation>
Bishop, C. H. and Abramowitz, G.: Climate model dependence and the replicate
Earth paradigm, Clim. Dynam., 41, 885–900,
<a href="https://doi.org/10.1007/s00382-012-1610-y" target="_blank">https://doi.org/10.1007/s00382-012-1610-y</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>10</label><mixed-citation>
Dai, A. and Zhao, T.: Uncertainties in historical changes and future
projections of drought. Part I: estimates of historical drought changes,
Clim. Change, 144, 519–533, <a href="https://doi.org/10.1007/s10584-016-1705-2" target="_blank">https://doi.org/10.1007/s10584-016-1705-2</a>,
2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>11</label><mixed-citation>
Dai, A., Trenberth, K. E., and Qian, T.: A global dataset of Palmer Drought
Severity Index for 1870–2002: Relationship with soil moisture and effects
of surface warming, J. Hydrometeorol., 5, 1117–1130,
<a href="https://doi.org/10.1175/jhm-386.1" target="_blank">https://doi.org/10.1175/jhm-386.1</a>, 2004.
</mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>12</label><mixed-citation>
Dee, D. P., Uppala, S. M., Simmons, A. J., Berrisford, P., Poli, P.,
Kobayashi, S., Andrae, U., Balmaseda, M. A., Balsamo, G., Bauer, P.,
Bechtold, P., Beljaars, A. C. M., van de Berg, I., Biblot, J., Bormann, N.,
Delsol, C., Dragani, R., Fuentes, M., Greer, A. J., Haimberger, L., Healy, S.
B., Hersbach, H., Holm, E. V., Isaksen, L., Kallberg, P., Kohler, M.,
Matricardi, M., McNally, A. P., Mong-Sanz, B. M., Morcette, J.-J., Park,
B.-K., Peubey, C., de Rosnay, P., Tavolato, C., Thepaut, J. N., and Vitart,
F.: The ERA-Interim reanalysis: Configuration and performance of the data
assimilation system, Q. J. Roy. Meteorol. Soc., 137, 553–597, <a href="https://doi.org/10.1002/qj.828" target="_blank">https://doi.org/10.1002/qj.828</a>, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>13</label><mixed-citation>
de Rosnay, P., Drusch, M., Vasiljevic, D., Balsamo, G., Albergel, C., and
Isaksen, L.: A simplified Extended Kalman Filter for the global operational
soil moisture analysis at ECMWF, Q. J. Roy. Meteor. Soc., 139, 1199–1213,
<a href="https://doi.org/10.1002/qj.2023" target="_blank">https://doi.org/10.1002/qj.2023</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>14</label><mixed-citation>
Dirmeyer, P. A., Gao, X., Zhao, M., Guo, Z., Oki, T., and Hanasaki, N.:
GSWP-2: Multimodel analysis and implications for our perception of the land
surface, B. Am. Meteorol. Soc., 87, 1381–1398,
<a href="https://doi.org/10.1175/bams-87-10-1381" target="_blank">https://doi.org/10.1175/bams-87-10-1381</a>, 2006.
</mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>15</label><mixed-citation>
Dorigo, W., de Jeu, R., Chung, D., Parinussa, R., Liu, Y., Wagner, W., and
Fernández-Prieto, D.: Evaluating global trends (1988–2010) in harmonized
multi-satellite surface soil moisture, Geophys. Res. Lett., 39, L18405,
<a href="https://doi.org/10.1029/2012GL052988" target="_blank">https://doi.org/10.1029/2012GL052988</a>, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>16</label><mixed-citation>
Dorigo, W., Wagner, W., Albergel, C., Albrecht, F., Balsamo, G., Brocca, L.,
Chung, D., Ertl, M., Forkel, M., Gruber, A., Haas, E., Hamer, P. D.,
Hirschi, M., Ikonen, J., de Jeu, R., Kidd, R., Lahoz, W., Liu, Y. Y.,
Miralles, D., Mistelbauer, T., Nicolai-Shaw, N., Parinussa, R., Pratola, C.,
Reimer, C., van der Schalie, R., Seneviratne, S. I., Smolander, T., and
Lecomte, P.: ESA CCI soil moisture for improved Earth system understanding:
State-of-the art and future directions, Remote Sens. Environ., 203,
185–215, <a href="https://doi.org/10.1016/j.rse.2017.07.001" target="_blank">https://doi.org/10.1016/j.rse.2017.07.001</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>17</label><mixed-citation>
Dorigo, W. A., Wagner, W., Hohensinn, R., Hahn, S., Paulik, C., Xaver, A., Gruber, A., Drusch, M., Mecklenburg, S., van Oevelen, P., Robock, A., and Jackson, T.: The International Soil Moisture Network: a data hosting facility for global in situ soil moisture measurements, Hydrol. Earth Syst. Sci., 15, 1675–1698, <a href="https://doi.org/10.5194/hess-15-1675-2011" target="_blank">https://doi.org/10.5194/hess-15-1675-2011</a>, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>18</label><mixed-citation>
Dorigo, W. A., Xaver, A., Vreugdenhil, M., Gruber, A., Hegyiová, A.,
Sanchis-Dufau, A. D., Zamojski, D., Cordes, C., Wagner, W., and Drusch, M.:
Global automated quality control of in situ soil moisture data from the
International Soil Moisture Network, Vadose Zone J., 12, vzj2012.0097,
<a href="https://doi.org/10.2136/vzj2012.0097" target="_blank">https://doi.org/10.2136/vzj2012.0097</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>19</label><mixed-citation>
EODC (Earth Observation Data Centre for Water Resources Monitoring GmbH):
Algorithm theoretical baseline document (ATBD) supporting product version
06.1 – D2.1 version 2, Earth Observation Data Centre for Water Resources
Monitoring GmbH, available at: <a href="https://climate.esa.int/media/documents/ESA_CCI_SM_RD_D2.1_v1_ATBD_v05.2.pdf" target="_blank"/>, last access: 30 July 2021.
</mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>20</label><mixed-citation>
Eyring, V., Bony, S., Meehl, G. A., Senior, C. A., Stevens, B., Stouffer, R. J., and Taylor, K. E.: Overview of the Coupled Model Intercomparison Project Phase 6 (CMIP6) experimental design and organization, Geosci. Model Dev., 9, 1937–1958, <a href="https://doi.org/10.5194/gmd-9-1937-2016" target="_blank">https://doi.org/10.5194/gmd-9-1937-2016</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>21</label><mixed-citation>
Field, C. B., Barros, V., Stocker, T. F., and Dahe, Q. (Eds.): Managing the
risks of extreme events and disasters to advance climate change adaptation:
Special report of the intergovernmental panel on climate change, Cambridge
University Press, Cambridge, <a href="https://doi.org/10.1017/CBO9781139177245" target="_blank">https://doi.org/10.1017/CBO9781139177245</a>, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>22</label><mixed-citation>
Friedl, M. and Sulla-Menashe, D.: MCD12C1 MODIS/Terra+Aqua land cover type
L3 global 0.05Deg CMG V006, LP DAAC [Data set],
<a href="https://doi.org/10.5067/MODIS/MCD12C1.006" target="_blank">https://doi.org/10.5067/MODIS/MCD12C1.006</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>23</label><mixed-citation>
Giorgi, F. and Mearns, L. O.: Calculation of average, uncertainty range, and
reliability of regional climate changes from AOGCM simulations via the
“reliability ensemble averaging” (REA) method, J. Climate, 15, 1141–1158,
<a href="https://doi.org/10.1175/1520-0442(2002)015&lt;1141:COAURA&gt;2.0.CO;2" target="_blank">https://doi.org/10.1175/1520-0442(2002)015&lt;1141:COAURA&gt;2.0.CO;2</a>, 2002.
</mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>24</label><mixed-citation>
Green, J. K., Seneviratne, S. I., Berg, A. M., Findell, K. L., Hagemann, S.,
Lawrence, D. M., and Gentine, P.: Large influence of soil moisture on
long-term terrestrial carbon uptake, Nature, 565, 476–479,
<a href="https://doi.org/10.1038/s41586-018-0848-x" target="_blank">https://doi.org/10.1038/s41586-018-0848-x</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>25</label><mixed-citation>
Gruber, A., Su, C.-H., Crow, W. T., Zwieback, S., Dorigo, W. A., and Wagner,
W.: Estimating error cross-correlations in soil moisture data sets using
extended collocation analysis, J. Geophys. Res.-Atmos., 121, 1208–1219,
<a href="https://doi.org/10.1002/2015JD024027" target="_blank">https://doi.org/10.1002/2015JD024027</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>26</label><mixed-citation>
Gruber, A., Crow, W. T., and Dorigo, W. A.: Assimilation of spatially sparse
in situ soil moisture networks into a continuous model domain, Water Resour.
Res., 54, 1353–1367, <a href="https://doi.org/10.1002/2017WR021277" target="_blank">https://doi.org/10.1002/2017WR021277</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>27</label><mixed-citation>
Gruber, A., De Lannoy, G., Albergel, C., Al-Yaari, A., Brocca, L., Calvet,
J.-C., Colliander, A., Cosh, M., Crow, W., Dorigo, W., Draper, C., Hirschi,
M., Kerr, Y., Konings, A., Lahoz, W., McColl, K., Montzka, C.,
Muñoz-Sabater, J., Peng, J., Reichle, R., Richaume, P., Rüdiger, C.,
Scanlon, T., van der Schalie, R., Wigneron, J.-P., and Wagner, W.:
Validation practices for satellite soil moisture retrievals: What are (the)
errors?, Remote Sensing of Environment, 244, 111806, <a href="https://doi.org/10.1016/j.rse.2020.111806" target="_blank">https://doi.org/10.1016/j.rse.2020.111806</a>, 2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>28</label><mixed-citation>
Gu, X., Li, J., Chen, Y. D., Kong, D., and Liu, J.: Consistency and
discrepancy of global surface soil moisture changes from multiple
model-based data sets against satellite observations, J. Geophys. Res.-Atmos., 124, 1474–1495, <a href="https://doi.org/10.1029/2018JD029304" target="_blank">https://doi.org/10.1029/2018JD029304</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>29</label><mixed-citation>
Harris, I., Jones, P. D., Osborn, T. J., and Lister, D. H.: Updated
high-resolution grids of monthly climatic observations – the CRU TS3.10
dataset, Int. J. Climatol., 34, 623–642, <a href="https://doi.org/10.1002/joc.3711" target="_blank">https://doi.org/10.1002/joc.3711</a>,
2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>30</label><mixed-citation>
Hauser, M.: Regionmask: plotting and creation of masks of spatial regions,
Zenodo, <a href="https://doi.org/10.5281/zenodo.3585543" target="_blank">https://doi.org/10.5281/zenodo.3585543</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>31</label><mixed-citation>
Hersbach, H., Bell, B., Berrisford, P., Hirahara, S., Horányi, A.,
Muñoz-Sabater, J., Nicolas, J., Peubey, C., Radu, R., Schepers, D.,
Simmons, A., Soci, C., Abdalla, S., Abellan, X., Balsamo, G., Bechtold, P.,
Biavati, G., Bidlot, J., Bonavita, M., Chiara, G., Dahlgren, P., Dee, D.,
Diamantakis, M., Dragani, R., Flemming, J., Forbes, R., Fuentes, M., Geer,
A., Haimberger, L., Healy, S., Hogan, R. J., Hólm, E., Janisková,
M., Keeley, S., Laloyaux, P., Lopez, P., Lupu, C., Radnoti, G., Rosnay, P.,
Rozum, I., Vamborg, F., Villaume, S., and Thépaut, J.: The ERA5 global
reanalysis, Q. J. Roy. Meteor. Soc., 146, 1999–2049,
<a href="https://doi.org/10.1002/qj.3803" target="_blank">https://doi.org/10.1002/qj.3803</a>, 2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>32</label><mixed-citation>
Hobeichi, S., Abramowitz, G., Evans, J., and Ukkola, A.: Derived Optimal Linear Combination Evapotranspiration (DOLCE): a global gridded synthesis ET estimate, Hydrol. Earth Syst. Sci., 22, 1317–1336, <a href="https://doi.org/10.5194/hess-22-1317-2018" target="_blank">https://doi.org/10.5194/hess-22-1317-2018</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>33</label><mixed-citation>
Hobeichi, S., Abramowitz, G., Evans, J., and Beck, H. E.: Linear Optimal Runoff Aggregate (LORA): a global gridded synthesis runoff product, Hydrol. Earth Syst. Sci., 23, 851–870, <a href="https://doi.org/10.5194/hess-23-851-2019" target="_blank">https://doi.org/10.5194/hess-23-851-2019</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>34</label><mixed-citation>
Hollander, M., Wolfe, D. A., and Chicken, E.: Nonparametric statistical
methods, 3rd ed., Wiley, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>35</label><mixed-citation>
Huntzinger, D. N., Schwalm, C. R., Wei, Y., Cook, R. B., Michalak, a M.,
Schaefer, K., Jacobson, a R., Arain, M. a, Ciais, P., Fisher, J. B., Hayes,
D. J., Huang, M., Huang, S., Ito, A., Jain, a K., Lei, H., Lu, C., Maignan,
F., Mao, J., Parazoo, N. C., Peng, C., Peng, S., Poulter, B., Ricciuto, D.
M., Tian, H., Shi, X., Wang, W., Zeng, N., Zhao, F., Zhu, Q., Yang, J., and
Tao, B.: NACP MsTMIP: Global 0.5-degree model outputs in standard format,
Version 1.0, ORNL Distributed Active Archive Center,
<a href="https://doi.org/10.3334/ornldaac/1225" target="_blank">https://doi.org/10.3334/ornldaac/1225</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>36</label><mixed-citation>
Karthikeyan, L., Pan, M., Wanders, N., Kumar, D. N., and Wood, E. F.: Four
decades of microwave satellite soil moisture observations: Part 2. Product
validation and inter-satellite comparisons, Adv. Water Resour.,
109, 236–252, <a href="https://doi.org/10.1016/j.advwatres.2017.09.010" target="_blank">https://doi.org/10.1016/j.advwatres.2017.09.010</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>37</label><mixed-citation>
Knoben, W. J. M., Woods, R. A., and Freer, J. E.: Global bimodal
precipitation seasonality: A systematic overview, Int. J. Climatol., 39,
558–567, <a href="https://doi.org/10.1002/joc.5786" target="_blank">https://doi.org/10.1002/joc.5786</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>38</label><mixed-citation>
Kobayashi, S., Ota, Y., Harada, Y., Ebita, A., Moriya, M., Onoda, H., Onogi,
K., Kamahori, H., Kobayashi, C., Endo, H., Miyaoka, K., and Takahashi, K.:
The JRA-55 Reanalysis: General specifications and basic characteristics, J.
Meteor. Soc. Jpn., 93, 5–48, <a href="https://doi.org/10.2151/jmsj.2015-001" target="_blank">https://doi.org/10.2151/jmsj.2015-001</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>39</label><mixed-citation>
Kumar, S., Newman, M., Wang, Y., and Livneh, B.: Potential reemergence of
seasonal soil moisture anomalies in North America, J. Climate, 32,
2707–2734, <a href="https://doi.org/10.1175/JCLI-D-18-0540.1" target="_blank">https://doi.org/10.1175/JCLI-D-18-0540.1</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>40</label><mixed-citation>
Laloyaux, P., de Boisseson, E., and Dahlgren, P.: CERA-20C: An Earth system
approach to climate reanalysis, ECMWF Newsl., 150, 25–30,
<a href="https://doi.org/10.21957/FFS36BIRJ2" target="_blank">https://doi.org/10.21957/FFS36BIRJ2</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib41"><label>41</label><mixed-citation>
Li, L., Shangguan, W., Deng, Y., Mao, J., Pan, J., Wei, N., Yuan, H., Zhang,
S., Zhang, Y., and Dai, Y.: A causal inference model based on random forests
to identify the effect of soil moisture on precipitation, J. Hydrometeorol.,
21, 1115–1131, <a href="https://doi.org/10.1175/JHM-D-19-0209.1" target="_blank">https://doi.org/10.1175/JHM-D-19-0209.1</a>, 2020a.
</mixed-citation></ref-html>
<ref-html id="bib1.bib42"><label>42</label><mixed-citation>
Li, M., Wu, P., and Ma, Z.: A comprehensive evaluation of soil moisture and
soil temperature from third-generation atmospheric and land reanalysis data
sets, Int. J. Climatol., 40, 5744–5766, <a href="https://doi.org/10.1002/joc.6549" target="_blank">https://doi.org/10.1002/joc.6549</a>,
2020b.
</mixed-citation></ref-html>
<ref-html id="bib1.bib43"><label>43</label><mixed-citation>
Liu, Y., Liu, Y., and Wang, W.: Inter-comparison of satellite-retrieved and
Global Land Data Assimilation System-simulated soil moisture datasets for
global drought analysis, Remote Sens. Environ., 220, 1–18,
<a href="https://doi.org/10.1016/j.rse.2018.10.026" target="_blank">https://doi.org/10.1016/j.rse.2018.10.026</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib44"><label>44</label><mixed-citation>
Liu, Y. Y., Parinussa, R. M., Dorigo, W. A., De Jeu, R. A. M., Wagner, W., van Dijk, A. I. J. M., McCabe, M. F., and Evans, J. P.: Developing an improved soil moisture dataset by blending passive and active microwave satellite-based retrievals, Hydrol. Earth Syst. Sci., 15, 425–436, <a href="https://doi.org/10.5194/hess-15-425-2011" target="_blank">https://doi.org/10.5194/hess-15-425-2011</a>, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib45"><label>45</label><mixed-citation>
Liu, Y. Y., Dorigo, W. A., Parinussa, R. M., de Jeu, R. A. M., Wagner, W.,
McCabe, M. F., Evans, J. P., and van Dijk, A. I. J. M.: Trend-preserving
blending of passive and active microwave soil moisture retrievals, Remote
Sens. Environ., 123, 280–297, <a href="https://doi.org/10.1016/j.rse.2012.03.014" target="_blank">https://doi.org/10.1016/j.rse.2012.03.014</a>,
2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib46"><label>46</label><mixed-citation>
Llamas, R. M., Guevara, M., Rorabaugh, D., Taufer, M., and Vargas, R.:
Spatial gap-filling of ESA-CCI satellite-derived soil moisture based on
geostatistical techniques and multiple regression, Remote Sens., 12, 665,
<a href="https://doi.org/10.3390/rs12040665" target="_blank">https://doi.org/10.3390/rs12040665</a>, 2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib47"><label>47</label><mixed-citation>
Loew, A., Stacke, T., Dorigo, W., de Jeu, R., and Hagemann, S.: Potential and limitations of multidecadal satellite soil moisture observations for selected climate model evaluation studies, Hydrol. Earth Syst. Sci., 17, 3523–3542, <a href="https://doi.org/10.5194/hess-17-3523-2013" target="_blank">https://doi.org/10.5194/hess-17-3523-2013</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib48"><label>48</label><mixed-citation>
Lu, J., Carbone, G. J., and Grego, J. M.: Uncertainty and hotspots in 21st
century projections of agricultural drought from CMIP5 models, Sci. Rep., 9,
4922, <a href="https://doi.org/10.1038/s41598-019-41196-z" target="_blank">https://doi.org/10.1038/s41598-019-41196-z</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib49"><label>49</label><mixed-citation>
Martens, B., Miralles, D. G., Lievens, H., van der Schalie, R., de Jeu, R. A. M., Fernández-Prieto, D., Beck, H. E., Dorigo, W. A., and Verhoest, N. E. C.: GLEAM v3: satellite-based land evaporation and root-zone soil moisture, Geosci. Model Dev., 10, 1903–1925, <a href="https://doi.org/10.5194/gmd-10-1903-2017" target="_blank">https://doi.org/10.5194/gmd-10-1903-2017</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib50"><label>50</label><mixed-citation>
McCarty, W., Coy, L., Gelaro, R., Huang, A., Merkova, D., Smith, E. B.,
Sienkiewicz, M., and Wargan, K.: MERRA-2 input observations: summary and
assessment, Goddard Space Flight Center, Greenbelt, MD, USA, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib51"><label>51</label><mixed-citation>
Melton, J. R., Verseghy, D. L., Sospedra-Alfonso, R., and Gruber, S.: Improving permafrost physics in the coupled Canadian Land Surface Scheme (v.3.6.2) and Canadian Terrestrial Ecosystem Model (v.2.1) (CLASS-CTEM), Geosci. Model Dev., 12, 4443–4467, <a href="https://doi.org/10.5194/gmd-12-4443-2019" target="_blank">https://doi.org/10.5194/gmd-12-4443-2019</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib52"><label>52</label><mixed-citation>
Molero, B., Leroux, D. J., Richaume, P., Kerr, Y. H., Merlin, O., Cosh, M.
H., and Bindlish, R.: Multi-timescale analysis of the spatial
representativeness of in situ soil moisture data within satellite
footprints: Soil moisture time and spatial scales, J. Geophys. Res.-Atmos.,
123, 3–21, <a href="https://doi.org/10.1002/2017JD027478" target="_blank">https://doi.org/10.1002/2017JD027478</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib53"><label>53</label><mixed-citation>
Mystakidis, S., Davin, E. L., Gruber, N., and Seneviratne, S. I.:
Constraining future terrestrial carbon cycle projections using
observation-based water and carbon flux estimates, Glob. Change Biol., 22,
2198–2215, <a href="https://doi.org/10.1111/gcb.13217" target="_blank">https://doi.org/10.1111/gcb.13217</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib54"><label>54</label><mixed-citation>
O, S. and Orth, R.: Global soil moisture data derived through machine
learning trained with in-situ measurements, Sci. Data, 8, 170,
<a href="https://doi.org/10.1038/s41597-021-00964-1" target="_blank">https://doi.org/10.1038/s41597-021-00964-1</a>, 2021.
</mixed-citation></ref-html>
<ref-html id="bib1.bib55"><label>55</label><mixed-citation>
Padrón, R. S., Gudmundsson, L., and Seneviratne, S. I.: Observational
constraints reduce likelihood of extreme changes in multidecadal land water
availability, Geophys. Res. Lett., 46, 736–744,
<a href="https://doi.org/10.1029/2018GL080521" target="_blank">https://doi.org/10.1029/2018GL080521</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib56"><label>56</label><mixed-citation>
Pan, S., Pan, N., Tian, H., Friedlingstein, P., Sitch, S., Shi, H., Arora, V. K., Haverd, V., Jain, A. K., Kato, E., Lienert, S., Lombardozzi, D., Nabel, J. E. M. S., Ottlé, C., Poulter, B., Zaehle, S., and Running, S. W.: Evaluation of global terrestrial evapotranspiration using state-of-the-art approaches in remote sensing, machine learning and land surface modeling, Hydrol. Earth Syst. Sci., 24, 1485–1509, <a href="https://doi.org/10.5194/hess-24-1485-2020" target="_blank">https://doi.org/10.5194/hess-24-1485-2020</a>, 2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib57"><label>57</label><mixed-citation>
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel,
O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J.,
Passos, A., Cournapeau, D., Brucher, M., Perrot, M., and Duchesnay, E.:
Scikit-learn: machine learning in Python, J. Mach. Learn. Res., 12,
2825–2830, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib58"><label>58</label><mixed-citation>
Piles, M., Ballabrera-Poy, J., and Muñoz-Sabater, J.: Dominant features
of global surface soil moisture variability observed by the SMOS satellite,
Remote Sens., 11, 95, <a href="https://doi.org/10.3390/rs11010095" target="_blank">https://doi.org/10.3390/rs11010095</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib59"><label>59</label><mixed-citation>
Poli, P., Hersbach, H., Dee, D. P., Berrisford, P., Simmons, A. J., Vitart,
F., Laloyaux, P., Tan, D. G. H., Peubey, C., Thépaut, J.-N.,
Trémolet, Y., Hólm, E. V., Bonavita, M., Isaksen, L., and Fisher,
M.: ERA-20C: An atmospheric reanalysis of the twentieth century, J. Climate,
29, 4083–4097, <a href="https://doi.org/10.1175/JCLI-D-15-0556.1" target="_blank">https://doi.org/10.1175/JCLI-D-15-0556.1</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib60"><label>60</label><mixed-citation>
Rui, H. and Beaudoing, H.: README document for NASA GLDAS version 2 data
products, NASA Goddard Space Flight Center (GES DISC), Greenbelt, MD, USA,
available at: <a href="https://disc.gsfc.nasa.gov/information/documents?title=Hydrology Documentation" target="_blank"/> (last access: 30 July 2021), 2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib61"><label>61</label><mixed-citation>
Seabold, S. and Perktold, J.: Statsmodels: Econometric and statistical
modeling with Python, in: 9th Python in Science Conference, Austin, Texas, 28 June–3 July 2010
<a href="https://doi.org/10.25080/majora-92bf1922-011" target="_blank">https://doi.org/10.25080/majora-92bf1922-011</a>, 2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib62"><label>62</label><mixed-citation>
Sitch, S., Huntingford, C., Gedney, N., Levy, P. E., Lomas, M., Piao, S. L.,
Betts, R., Ciais, P., Cox, P., Friedlingstein, P., Jones, C. D., Prentice,
I. C., and Woodward, F. I.: Evaluation of the terrestrial carbon cycle,
future plant geography and climate-carbon cycle feedbacks using five Dynamic
Global Vegetation Models (DGVMs), Glob. Change Biol., 14, 2015–2039,
<a href="https://doi.org/10.1111/j.1365-2486.2008.01626.x" target="_blank">https://doi.org/10.1111/j.1365-2486.2008.01626.x</a>, 2008.
</mixed-citation></ref-html>
<ref-html id="bib1.bib63"><label>63</label><mixed-citation>
Sospedra-Alfonso, R. and Merryfield, W. J.: Initialization and potential
predictability of soil moisture in the Canadian Seasonal to Interannual
Prediction System, J. Climate, 31, 5205–5224,
<a href="https://doi.org/10.1175/JCLI-D-17-0707.1" target="_blank">https://doi.org/10.1175/JCLI-D-17-0707.1</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib64"><label>64</label><mixed-citation>
Su, C.-H., Ryu, D., Dorigo, W., Zwieback, S., Gruber, A., Albergel, C.,
Reichle, R. H., and Wagner, W.: Homogeneity of a global multisatellite soil
moisture climate data record, Geophys. Res. Lett., 43, 11245–11252,
<a href="https://doi.org/10.1002/2016GL070458" target="_blank">https://doi.org/10.1002/2016GL070458</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib65"><label>65</label><mixed-citation>
Spinoni, J., Barbosa, P., De Jager, A., McCormick, N., Naumann, G., Vogt, J.
V., Magni, D., Masante, D., and Mazzeschi, M.: A new global database of
meteorological drought events from 1951 to 2016, J. Hydrol. Reg. Stud., 22,
100593, <a href="https://doi.org/10.1016/j.ejrh.2019.100593" target="_blank">https://doi.org/10.1016/j.ejrh.2019.100593</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib66"><label>66</label><mixed-citation>
Su, C.-H., Ryu, D., Dorigo, W., Zwieback, S., Gruber, A., Albergel, C.,
Reichle, R. H., and Wagner, W.: Homogeneity of a global multisatellite soil
moisture climate data record, Geophys. Res. Lett., 43, 11245–11252,
<a href="https://doi.org/10.1002/2016GL070458" target="_blank">https://doi.org/10.1002/2016GL070458</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib67"><label>67</label><mixed-citation>
Taylor, K. E., Stouffer, R. J., and Meehl, G. A.: An overview of CMIP5 and
the experiment design, B. Am. Meteorol. Soc., 93, 485–498,
<a href="https://doi.org/10.1175/BAMS-D-11-00094.1" target="_blank">https://doi.org/10.1175/BAMS-D-11-00094.1</a>, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib68"><label>68</label><mixed-citation>
Tobin, K. J., Torres, R., Crow, W. T., and Bennett, M. E.: Multi-decadal analysis of root-zone soil moisture applying the exponential filter across CONUS, Hydrol. Earth Syst. Sci., 21, 4403–4417, <a href="https://doi.org/10.5194/hess-21-4403-2017" target="_blank">https://doi.org/10.5194/hess-21-4403-2017</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib69"><label>69</label><mixed-citation>
UCAR/NCAR/CISL/TDD: The NCAR Comannd Language (Version 6.6.2) [Software],
Boulder, CO, USA, <a href="https://doi.org/10.5065/D6WD3XH5" target="_blank">https://doi.org/10.5065/D6WD3XH5</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib70"><label>70</label><mixed-citation>
Wang, G., Garcia, D., Liu, Y., de Jeu, R., and Johannes Dolman, A.: A
three-dimensional gap filling method for large geophysical datasets:
Application to global satellite soil moisture observations, Environ. Model.
Softw., 30, 139–142, <a href="https://doi.org/10.1016/j.envsoft.2011.10.015" target="_blank">https://doi.org/10.1016/j.envsoft.2011.10.015</a>, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib71"><label>71</label><mixed-citation>
Wang, Y.: soil_moisture_merge,  Bitbucket repository [code],
<a href="https://bitbucket.org/ywang11/soil_moisture_merge/src/master/" target="_blank"/>
(last access: 30 July 2021),   2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib72"><label>72</label><mixed-citation>
Wang, Y. and Mao, J.: Global multi-layer soil moisture products,
<a href="https://doi.org/10.6084/m9.figshare.13661312.v1" target="_blank">https://doi.org/10.6084/m9.figshare.13661312.v1</a>, 2021.
</mixed-citation></ref-html>
<ref-html id="bib1.bib73"><label>73</label><mixed-citation>
Wang, Y., Leng, P., Peng, J., Marzahn, P., and Ludwig, R.: Global
assessments of two blended microwave soil moisture products CCI and SMOPS
with in-situ measurements and reanalysis data, Int. J. Appl. Earth Obs., 94, 102234,
<a href="https://doi.org/10.1016/j.jag.2020.102234" target="_blank">https://doi.org/10.1016/j.jag.2020.102234</a>, 2021a.
</mixed-citation></ref-html>
<ref-html id="bib1.bib74"><label>74</label><mixed-citation>
Wang, Z., Zhan, C., Ning, L., and Guo, H.: Evaluation of global terrestrial
evapotranspiration in CMIP6 models, Theor. Appl. Climatol., 143, 521–531,
<a href="https://doi.org/10.1007/s00704-020-03437-4" target="_blank">https://doi.org/10.1007/s00704-020-03437-4</a>, 2021b.
</mixed-citation></ref-html>
<ref-html id="bib1.bib75"><label>75</label><mixed-citation>
Wu, Z., Zhou, J., He, H., Lin, Q., Wu, X., and Xu, Z.: An advanced error
correction methodology for merging in-situ observed and model-based soil
moisture, J. Hydrol., 566, 150–163,
<a href="https://doi.org/10.1016/j.jhydrol.2018.09.018" target="_blank">https://doi.org/10.1016/j.jhydrol.2018.09.018</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib76"><label>76</label><mixed-citation>
Yuan, S. and Quiring, S. M.: Evaluation of soil moisture in CMIP5 simulations over the contiguous United States using in situ and satellite observations, Hydrol. Earth Syst. Sci., 21, 2203–2218, <a href="https://doi.org/10.5194/hess-21-2203-2017" target="_blank">https://doi.org/10.5194/hess-21-2203-2017</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib77"><label>77</label><mixed-citation>
Zeng, Y., Su, Z., van der Velde, R., Wang, L., Xu, K., Wang, X., and Wen,
J.: Blending satellite observed, model simulated, and in situ measured soil
moisture over Tibetan plateau, Remote Sens., 8, 268,
<a href="https://doi.org/10.3390/rs8030268" target="_blank">https://doi.org/10.3390/rs8030268</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib78"><label>78</label><mixed-citation>
Zhang, G. and Cook, K. H.: West African monsoon demise: Climatology,
interannual variations, and relationship to seasonal rainfall, J. Geophys.
Res.-Atmos., 119, 10175–10193, <a href="https://doi.org/10.1002/2014JD022043" target="_blank">https://doi.org/10.1002/2014JD022043</a>, 2014.
</mixed-citation></ref-html>--></article>
