<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0" article-type="data-paper">
  <front>
    <journal-meta><journal-id journal-id-type="publisher">ESSD</journal-id><journal-title-group>
    <journal-title>Earth System Science Data</journal-title>
    <abbrev-journal-title abbrev-type="publisher">ESSD</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Earth Syst. Sci. Data</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1866-3516</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/essd-18-4451-2026</article-id><title-group><article-title>High-resolution, multi-depth mapping of soil bulk density and pH in China's forests using machine learning</article-title><alt-title>Forest soil bulk density and pH in China</alt-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author" corresp="no" rid="aff1 aff2">
          <name><surname>Chen</surname><given-names>Jizhen</given-names></name>
          
        <ext-link>https://orcid.org/0000-0003-4668-0876</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1 aff2">
          <name><surname>Zhang</surname><given-names>Xin</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1 aff2">
          <name><surname>Fan</surname><given-names>Zihao</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff3">
          <name><surname>Liu</surname><given-names>Tao</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1 aff2">
          <name><surname>Xiao</surname><given-names>Wenfa</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff4">
          <name><surname>Sun</surname><given-names>Qiwu</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff5">
          <name><surname>Sun</surname><given-names>Xiangyang</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="yes" rid="aff1 aff2">
          <name><surname>Huang</surname><given-names>Zhilin</given-names></name>
          <email>hzlin66@caf.ac.cn</email>
        </contrib>
        <aff id="aff1"><label>1</label><institution>Key Laboratory of Forest Ecology and Environment of National Forestry and Grassland Administration, Ecology and Nature Conservation Institute, Chinese Academy of Forestry, Beijing 100091, China</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>Hubei Zigui Three Gorges Reservoir Forest Ecosystem Observation and Research Station, Zigui 443600, China</institution>
        </aff>
        <aff id="aff3"><label>3</label><institution>Department of Earth System Science, Ministry of Education Key Laboratory for Earth System Modeling, Institute for Global Change Studies, Tsinghua University, Beijing 100084, China</institution>
        </aff>
        <aff id="aff4"><label>4</label><institution>Research Institute of Forestry, Chinese Academy of Forestry, Beijing 100091, China</institution>
        </aff>
        <aff id="aff5"><label>5</label><institution>College of Forestry, Beijing Forestry University, Beijing 100083, China</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Zhilin Huang (hzlin66@caf.ac.cn)</corresp></author-notes><pub-date><day>30</day><month>June</month><year>2026</year></pub-date>
      
      <volume>18</volume>
      <issue>6</issue>
      <fpage>4451</fpage><lpage>4474</lpage>
      <history>
        <date date-type="received"><day>15</day><month>August</month><year>2025</year></date>
           <date date-type="rev-request"><day>5</day><month>November</month><year>2025</year></date>
           <date date-type="rev-recd"><day>28</day><month>May</month><year>2026</year></date>
           <date date-type="accepted"><day>18</day><month>June</month><year>2026</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2026 Jizhen Chen et al.</copyright-statement>
        <copyright-year>2026</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://essd.copernicus.org/articles/18/4451/2026/essd-18-4451-2026.html">This article is available from https://essd.copernicus.org/articles/18/4451/2026/essd-18-4451-2026.html</self-uri><self-uri xlink:href="https://essd.copernicus.org/articles/18/4451/2026/essd-18-4451-2026.pdf">The full text article is available as a PDF file from https://essd.copernicus.org/articles/18/4451/2026/essd-18-4451-2026.pdf</self-uri>
      <abstract><title>Abstract</title>

      <p id="d2e171">Precise monitoring of forest soil bulk density (BD) and pH is crucial for addressing global challenges like carbon sequestration and soil acidification. However, existing national soil maps, primarily derived from comprehensive ecosystem samples, inadequately represent the distinct characteristics and high spatial heterogeneity of China's vast and diverse forest ecosystems. To bridge this gap, we present high-resolution (90 m), forest-specific maps of soil BD and pH across China. Leveraging 4356 forest soil profiles collected through extensive field surveys and 41 environmental covariates within an optimized Quantile Regression Forests (QRF) framework incorporating forward recursive feature selection (FRFS), we generated wall-to-wall predictions for five standardized depth intervals (0–5, 5–15, 15–30, 30–60, 60–100 cm). Model performance, assessed through 10-fold cross-validation (CV) and independent validation (IV), achieved model efficiency coefficients (MEC) ranging from 0.78 to 0.89 (CV) and 0.60 to 0.66 (IV) for BD, and from 0.83 to 0.87 (CV) and 0.71 to 0.81 (IV) for pH, indicating the product's strong capability to capture the spatial variability of forest soil properties across China. The 90 m resolution BD and pH maps contribute to the GlobalSoilMap initiative and provide forest-specific inputs for regional Earth system and land surface models. These products advance the quantification of soil acidification processes and provide critical baseline data for estimating forest soil carbon stocks across China. The dataset is available at <ext-link xlink:href="https://doi.org/10.57760/sciencedb.25375" ext-link-type="DOI">10.57760/sciencedb.25375</ext-link> (Chen and Huang, 2025).</p>
  </abstract>
    
<funding-group>
<award-group id="gs1">
<funding-source>National Key Research and Development Program of China</funding-source>
<award-id>2021FY100800</award-id>
</award-group>
</funding-group>
</article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d2e186">Soil bulk density (BD) and pH are fundamental properties that govern the physical and chemical environment of forest soils. They critically influence key ecosystem processes such as water infiltration, nutrient cycling, and microbial activity, and serve as essential parameters for quantifying forest carbon storage and assessing soil acidification dynamics at regional to global scales (Dai et al., 2019; Kleber et al., 2021). Consequently, accurate, high-resolution spatial information on these properties is indispensable for advancing our understanding of forest ecosystem functions and their responses to environmental change (Zhu et al., 2016; Huang et al., 2022b; Xu et al., 2015).</p>
      <p id="d2e189">Bulk density and pH are foundational descriptors of forest soil condition, reflecting, respectively, the physical structure and chemical environment of the soil system. In forest ecosystems, bulk density reflects the cumulative influence of litter accumulation, root development, organic matter incorporation, and disturbance on soil structure, making it distinct from bulk density patterns observed in other land-use systems (Brady and Weil, 2016; Chen et al., 2024). Soil pH, in turn, is strongly shaped by canopy composition, litter quality, decomposition pathways, and leaching-related acidification, and thus serves as an important indicator of forest-specific nutrient cycling and belowground ecological processes (Abdullah et al., 2025). Spatially explicit information of BD and pH is essential for forest soil studies. Bulk density enters directly into the calculation of area-based soil organic carbon stocks, so spatial variation in BD contributes to uncertainty in regional- and national-scale estimates of forest soil carbon storage (Lal, 2005; Kumar et al., 2022). Soil pH provides complementary information on the chemical environment of forest soils through its influence on nutrient availability, microbial processes, and acidification status, making it central to the assessment of forest soil health and ecosystem functioning (Liu et al., 2024b; Abdullah et al., 2025). Conventional soil survey methods can provide reliable measurements of these properties, but they are labor-intensive, time-consuming, and often insufficient for characterizing their fine-scale variability across large, forested regions (Arrouays et al., 2014). This challenge is particularly acute in China, where forests span pronounced climatic gradients, complex topography, and high vegetation diversity, generating substantial heterogeneity in soil properties across both geographical space and soil depth (Chen et al., 2016; Patton et al., 2019; Zhang et al., 2024b). High-resolution, forest-specific mapping is therefore needed to improve the representation of soil structural and chemical variability across China's forest domain and to provide a spatial basis for forest soil inventories, carbon accounting, and ecological decision-making.</p>
      <p id="d2e192">Digital Soil Mapping (DSM) addresses the need by serving as an innovative alternative to traditional soil surveys, aiming to generate continuous, high-resolution soil property maps through the quantitative modeling of relationships between soil observations and environmental covariates (McBratney et al., 2003; Minasny et al., 2013; Padarian et al., 2019). In recent years, significant progress has been made in developing high-resolution digital soil maps for key properties like BD and pH at national and global scales. Prominent examples include the SoilGrids 2.0 global product (Poggio et al., 2021), ChinaSoilInfoGrids (Liu et al., 2022a), and the China Soil Data Library version 2 (CSDLv2) (Shi et al., 2025). However, these existing datasets, while valuable for broad-scale applications, are primarily derived from soil profiles sampled across mixed land cover types (e.g., cropland, grassland, and forest). For dedicated forest ecosystem applications, these general-purpose products exhibit critical limitations. First, they rely on legacy soil profiles from national surveys (e.g., the Second National Soil Survey of China) and global databases where forest soils constitute only a minor fraction (typically less than 20 %) of the total samples (Zhao et al., 2019; Chen et al., 2022; Liu et al., 2024b). This inadequate representation fails to capture the distinct pedogenic processes, vegetation–soil interactions, and topography-driven heterogeneity inherent to diverse forest ecosystems. Second, the predictive models and environmental covariates within these datasets are optimized for broad-scale mapping across all ecosystems. Consequently, they may overlook forest-specific controlling factors (e.g., forest type) that are crucial for accurately predicting forest soil BD and pH (Zhao et al., 2019; Chen et al., 2022; Liu et al., 2024b).</p>
      <p id="d2e195">China's forest ecosystems cover <inline-formula><mml:math id="M1" display="inline"><mml:mrow><mml:mn mathvariant="normal">209</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">6</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> ha, spanning diverse climatic zones and complex topographies, and encompass 452 vegetation types, making it one of the most ecologically diverse forest regions on Earth (Chen et al., 2016; Patton et al., 2019; Zhang et al., 2024a). Forest soils exhibit considerable heterogeneity across geographical space, influenced by long-term climatic gradients, vegetation succession, and topographic variation (Zhao et al., 2019; Chen et al., 2022; Liu et al., 2024b). Revealing the spatial distribution of forest BD and pH is therefore essential not only for estimating carbon storage and assessing soil acidification risks, but also for strengthening the scientific basis of national-scale soil assessment and ecological decision-making (Zhu et al., 2016; Huang et al., 2022b; Xu et al., 2015). Spatially explicit information on these properties can improve carbon accounting, support forest soil inventories, and help identify areas vulnerable to soil degradation and acidification. However, to the best of our knowledge, no previous study has estimated these properties at high resolution across China's forests.</p>
      <p id="d2e214">To address this gap, we assembled a comprehensive forest soil database for China, comprising 4356 soil profiles and 11 873 samples across five standardized depth intervals (0–5, 5–15, 15–30, 30–60, and 60–100 cm), thereby capturing the broad ecological variability of China's forest ecosystems. Based on this database, we developed a forest-specific DSM framework to produce high-resolution (90 m) maps of BD and pH across China's forest domain. The key objectives of this study are: The objectives of this study were to: (1) establish a nationwide forest soil profile database; (2) generate high-resolution digital maps of forest soil BD and pH; and (3) provide a spatially explicit dataset to support forest soil inventories, carbon accounting, and assessments of soil acidification risk. The resulting maps provide continuous, wall-to-wall characterization of BD and pH at 90 m resolution, consistent with GlobalSoilMap standards, and offer a baseline for national-scale analyses of forest soil condition and ecosystem functioning.</p>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Materials and Methods</title>
      <p id="d2e225">We developed 90 m resolution forest soil BD and pH grids for China (0–100 cm) using an optimized QRF model, a machine learning algorithm effective for both spatial prediction and uncertainty quantification (Szatmári et al., 2024). This framework integrated 4356 georeferenced forest soil profiles, combining historical inventory data (2018–2023). Sampling efforts were designed to ensure ecological and spatial representativeness across major climatic zones and forest types. Soil profiles were harmonized into standardized depth intervals (0–5, 5–15, 15–30, 30–60, and 60–100 cm) using an adaptive equal-area spline method (Bishop et al., 1999; Liu et al., 2022a) and randomly partitioned into training (80 %) and independent validation (20 %) subsets The spatial distributions of the training and validation samples were examined to ensure that both subsets maintain comparable spatial coverage across regions and soil depths (Fig. S2 and Tables S2–S4 in the Supplement). A set of 41 environmental covariates, aligned with soil-forming factors (Jenny, 1941), were resampled to a 90 m grid via bilinear interpolation. Feature selection and hyperparameter tuning were implemented to optimize model performance. Predictive accuracy was evaluated using 10-fold cross-validation and independent validation based on a withheld dataset. A summary of the modelling framework is shown in Fig. 1.</p>

      <fig id="F1" specific-use="star"><label>Figure 1</label><caption><p id="d2e230">Workflow diagram for forest soil mapping.</p></caption>
        <graphic xlink:href="https://essd.copernicus.org/articles/18/4451/2026/essd-18-4451-2026-f01.png"/>

      </fig>

<sec id="Ch1.S2.SS1">
  <label>2.1</label><title>Data compilation</title>
<sec id="Ch1.S2.SS1.SSS1">
  <label>2.1.1</label><title>Soil database</title>
      <p id="d2e253">We developed a comprehensive forest soil property database for China, representing the most extensive and up-to-date collection of forest soil data to date (Fig. 2). The data were compiled from two major nationwide forest soil surveys conducted in 2018 and 2023, complemented by independently conducted regional forest soil surveys during the intervening years to enhance spatial and ecological representation. Detailed metadata for these regional surveys, including survey year, geographic coverage, and profile numbers, are provided in Table S2. These surveys employed a stratified sampling design to ensure broad representativeness across China's major forest ecosystems, covering diverse climate zones, forest types, and topographic gradients, with detailed profile counts for each category summarized in Tables S3–S6 in the Supplement.</p>

      <fig id="F2" specific-use="star"><label>Figure 2</label><caption><p id="d2e258">Spatial distribution of soil sampling plots and forest coverage.</p></caption>
            <graphic xlink:href="https://essd.copernicus.org/articles/18/4451/2026/essd-18-4451-2026-f02.png"/>

          </fig>

      <p id="d2e267">Quality control procedures included the removal of duplicated records, verification of geographic coordinates, and screening of potential outliers in soil BD and pH. Potential outliers were identified using the interquartile range (IQR) method, with values below <inline-formula><mml:math id="M2" display="inline"><mml:mrow><mml:mtext>Q1</mml:mtext><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1.5</mml:mn><mml:mo>×</mml:mo><mml:mtext>IQR</mml:mtext></mml:mrow></mml:math></inline-formula> or above <inline-formula><mml:math id="M3" display="inline"><mml:mrow><mml:mtext>Q3</mml:mtext><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1.5</mml:mn><mml:mo>×</mml:mo><mml:mtext>IQR</mml:mtext></mml:mrow></mml:math></inline-formula> flagged for manual inspection. Flagged records were then checked against the original field and laboratory records, and only confirmed abnormal values were removed. Geographic coordinates were checked for missing values, duplicate locations, and obvious spatial inconsistencies. In addition, all sampling points were overlaid with the forest mask and the environmental covariate rasters used for modelling; points falling outside the forest extent or on pixels with missing covariate values were excluded from further analysis.</p>
      <p id="d2e303">Data harmonization involved the standardization of measurement units, alignment of soil depth intervals to the five target layers using mass-preserving spline functions, and the reconciliation of metadata across different survey sources. After rigorous quality control and data harmonization, the final integrated dataset comprises 4356 profiles and 11 873 samples. These profiles were collected between 2018 and 2023, with profile counts of 2045 in 2018, 157 in 2019, 530 in 2021, 621 in 2022, and 1003 in 2023, reflecting the timing and scope of different survey campaigns. The spatial distribution of sampling plots and forest coverage is displayed in Fig. 2.</p>
      <p id="d2e306">To ensure data comparability and minimize measurement errors, all samples were processed under identical conditions. Soil sampling and analysis followed standardized protocols to ensure data consistency. Soil samples were collected using a soil auger, air-dried at room temperature, homogenized, and passed through a 2 mm sieve to remove coarse fragments and roots for physicochemical analyses. Undisturbed soil cores were collected from each horizon using a cutting ring sampler to determine BD. Soil pH was measured using a pH meter following the potentiometric method, with a soil-to-water ratio of <inline-formula><mml:math id="M4" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>:</mml:mo><mml:mn mathvariant="normal">2.5</mml:mn></mml:mrow></mml:math></inline-formula> (<inline-formula><mml:math id="M5" display="inline"><mml:mrow><mml:mi>w</mml:mi><mml:mo>/</mml:mo><mml:mi>v</mml:mi></mml:mrow></mml:math></inline-formula>). Reference materials were used throughout the analytical process to ensure measurement accuracy and control data quality.</p>
</sec>
<sec id="Ch1.S2.SS1.SSS2">
  <label>2.1.2</label><title>Standard soil depths</title>
      <p id="d2e341">Following GlobalSoilMap specifications (Arrouays et al., 2014), soil samples are typically standardized to fixed depth intervals of 0–5, 5–15, 15–30, 30–60, 60–100, and 100–200 cm. To model continuous depth functions from soil property measurements recorded by genetic horizons, equal-area quadratic spline interpolation is commonly used (Bishop et al., 1999). However, natural soil profiles often contain abrupt changes in properties between adjacent horizons, leading to inconsistencies with these standardized depth layers. To address this issue and reduce fitting errors, we applied an adaptive equal-area spline method (Liu et al., 2022a). This method detects abrupt transitions by calculating the ratio of property values between adjacent horizons and applying a predefined threshold. When such discontinuities are identified, a 1 cm transitional layer is inserted between the affected horizons before spline fitting. This adjustment ensures improved consistency with the observed morphological structure of each soil profile. While the GlobalSoilMap framework includes the 100–200 cm interval, our study focused on the upper five layers (0–5, 5–15, 15–30, 30–60, and 60–100 cm) due to the limited number of forest soil profiles extending beyond 100 cm in depth. </p>
</sec>
<sec id="Ch1.S2.SS1.SSS3">
  <label>2.1.3</label><title>Environmental Covariates</title>
      <p id="d2e353">Soil formation is governed by the combined effects of climate, parent material, topography, vegetation, and human activities. In this study, 41 environmental covariates were selected based on the soil-forming factor framework (Jenny, 1941; Minasny et al., 2013) and categorized into five groups: parent material, climate, organisms, topography, and intrinsic soil properties (Table S1 and Fig. S1 in the Supplement). To reduce multicollinearity, a variance inflation factor (VIF) threshold of less than 10 was applied through iterative variable exclusion. The VIF values of all retained predictors are reported in Tables S8 and S9 in the Supplement.</p>
      <p id="d2e356">All covariate layers were reprojected to a common Albers Equal Area projection based on the WGS84 datum and resampled to a unified spatial resolution of 90 m. For multi-year variables, long-term annual means and growing season (May–September) averages were calculated from monthly records spanning 2003 to 2023, thereby capturing both historical trends and contemporary environmental conditions relevant to forest soil development.</p>
      <p id="d2e359">Climate-related covariates included temperature and precipitation derived from gridded climate datasets (Peng et al., 2019), potential evapotranspiration from the corresponding evapotranspiration dataset (Ding and Peng, 2021), and solar radiation from the TerraClimate dataset (Abatzoglou et al., 2018). Parent material characteristics were obtained from Sentinel-2 imagery using the shortwave infrared band (B12) and the B8<inline-formula><mml:math id="M6" display="inline"><mml:mo>/</mml:mo></mml:math></inline-formula>B12 band ratio to estimate clay content. Depth to Bedrock (DTB) data were incorporated to represent weathering intensity, and lithological context was supplemented using the Geological Map of China. Topographic attributes were extracted from the NASADEM digital elevation model (Peng et al., 2024) and computed using SAGA GIS (<uri>http://www.saga-gis.org</uri>, last access: 20 January 2025). Organism indicators were sourced from MODIS products, including NDVI, LAI, and NPP, while forest type classifications were based on the National Atlas of Forest Vegetation in China. Detailed source information and references for all environmental covariates are provided in Table S1.</p>
</sec>
</sec>
<sec id="Ch1.S2.SS2">
  <label>2.2</label><title>Modelling</title>
<sec id="Ch1.S2.SS2.SSS1">
  <label>2.2.1</label><title>Covariate selection</title>
      <p id="d2e389">To balance model parsimony with biogeochemical interpretability, we adapted the Forward Recursive Feature Selection (FRFS) approach proposed by Xiao et al. (2022) into a depth-specific selection framework, applied independently to five standardized soil layers. The procedure comprised three sequential steps. First, the covariate with the highest predictive importance, as assessed by permutation-based Random Forest analysis, was selected to initiate the model. Subsequently, additional variables were iteratively added based on two criteria: a reduction of more than 5 % in five-fold cross-validated root mean square error (RMSE) and a variance inflation factor (VIF) below 10. The selection process was automatically terminated when five consecutive iterations failed to achieve an RMSE improvement of at least 1 %, thereby avoiding model overfitting. This hierarchical strategy ensured effective dimensionality reduction while maintaining predictive performance across all soil depths. The framework was applied across five distinct soil horizons, enabled depth-specific dimensionality reduction, retaining 7 to 16 covariates per soil layer while eliminating 60.98 % to 77.93 % of the initial predictor set (Tables S9 and S10 in the Supplement). </p>
</sec>
<sec id="Ch1.S2.SS2.SSS2">
  <label>2.2.2</label><title>Predictive models</title>
      <p id="d2e401">Quantile Regression Forests (QRF), a nonparametric ensemble learning method extending the Random Forest framework, were used to model the relationships between environmental covariates and soil properties, while explicitly quantifying predictive uncertainty (Meinshausen, 2006). As a state-of-the-art algorithm in DSM (Liu et al., 2022a; Poggio et al., 2021; Pouladi et al., 2019), QRF leverages both bootstrap aggregation of regression trees and randomized feature subset selection at each node, enabling robust handling of high-dimensional, non-stationary data.</p>
      <p id="d2e404">Unlike standard Random Forests, QRF retains the full conditional distribution <inline-formula><mml:math id="M7" display="inline"><mml:mrow><mml:mi>F</mml:mi><mml:mo>(</mml:mo><mml:mi>y</mml:mi><mml:mo>∥</mml:mo><mml:mi>X</mml:mi><mml:mo>=</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> At each prediction node, allowing estimation of both point predictions and confidence intervals. This is achieved via kernel-based empirical distribution construction:

              <disp-formula id="Ch1.E1" content-type="numbered"><label>1</label><mml:math id="M8" display="block"><mml:mrow><mml:mover accent="true"><mml:mi>F</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mo>(</mml:mo><mml:mi>y</mml:mi><mml:mi mathvariant="normal">|</mml:mi><mml:mi>X</mml:mi><mml:mo>=</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:msubsup><mml:msub><mml:mi>w</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo><mml:mi>I</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>≤</mml:mo><mml:mi>y</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></disp-formula>

            where <inline-formula><mml:math id="M9" display="inline"><mml:mrow><mml:msub><mml:mi>w</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> Is the weight assigned to each training observation based on terminal node proximity. The conditional quantile function is derived as:

              <disp-formula id="Ch1.E2" content-type="numbered"><label>2</label><mml:math id="M10" display="block"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>q</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi mathvariant="italic">α</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mo movablelimits="false">inf⁡</mml:mo><mml:mo mathvariant="italic">{</mml:mo><mml:mi>y</mml:mi><mml:mo>:</mml:mo><mml:mover accent="true"><mml:mi>F</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>(</mml:mo><mml:mi>y</mml:mi><mml:mo>∣</mml:mo><mml:mi>X</mml:mi><mml:mo>=</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo><mml:mo>≥</mml:mo><mml:mi mathvariant="italic">α</mml:mi><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></disp-formula>

            for a given quantile level <inline-formula><mml:math id="M11" display="inline"><mml:mrow><mml:mi mathvariant="italic">α</mml:mi><mml:mo>∈</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. This allows the derivation of the median estimate <inline-formula><mml:math id="M12" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>q</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mn mathvariant="normal">0.5</mml:mn></mml:msub><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, prediction intervals <inline-formula><mml:math id="M13" display="inline"><mml:mrow><mml:mo>[</mml:mo><mml:msub><mml:mover accent="true"><mml:mi>q</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mrow><mml:mi>a</mml:mi><mml:mo>/</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msub><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mover accent="true"><mml:mi>q</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mi>a</mml:mi><mml:mo>/</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msub><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>, and the full uncertainty distribution, enhancing both interpretability and decision support in forest soil assessments. Here, <inline-formula><mml:math id="M14" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula> denotes the quantile level, with <inline-formula><mml:math id="M15" display="inline"><mml:mrow><mml:mi mathvariant="italic">α</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.5</mml:mn></mml:mrow></mml:math></inline-formula> corresponding to the median prediction. This formulation generates three interconnected outputs: the median predictor as a robust central tendency estimate, prediction intervals for heteroscedastic uncertainty quantification, and the complete conditional distribution through parametric evaluation of <inline-formula><mml:math id="M16" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>q</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi mathvariant="italic">α</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> across the <inline-formula><mml:math id="M17" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula> continuum.</p>
      <p id="d2e709">To implement QRF across China's forested regions, we adopted a spatially parallel computing framework. The study area was divided into 461 contiguous grid tiles (<inline-formula><mml:math id="M18" display="inline"><mml:mrow><mml:mn mathvariant="normal">10</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow><mml:mo>×</mml:mo><mml:mn mathvariant="normal">10</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>) using the Albers Equal Area projection. Model execution was carried out using the quantregForest package in R 4.2.1, running on 24 logical cores of a high-performance computing node. Spatial continuity was preserved across grid boundaries using a Gaussian kernel-based edge matching algorithm, enabling seamless 90 m resolution prediction without artifacts.</p>
</sec>
<sec id="Ch1.S2.SS2.SSS3">
  <label>2.2.3</label><title>Hyperparameter tuning</title>
      <p id="d2e740">Hyperparameter optimization was conducted for three parameters critical to model performance: mtry (number of variables randomly sampled at each split), num.trees (number of trees), and min.node.size (minimum samples per terminal node). The randomized search strategy was employed, guided by 10-fold cross-validation and using RMSE as the evaluation metric. Fifty iterations of parameter space sampling were performed to identify the optimal combination. Final hyperparameter values were selected based on configurations that yielded the highest prediction accuracy on the validation dataset. A summary of optimized parameters for each soil property and depth interval is provided in Table S3.</p>
</sec>
</sec>
<sec id="Ch1.S2.SS3">
  <label>2.3</label><title>Model validation and Interpretability</title>
      <p id="d2e752">To comprehensively evaluate model performance, we applied two complementary validation strategies: 10-fold cross-validation on the training dataset (80 %) and independent validation using a held-out test set (20 %). These schemes were implemented across the entire study region to assess the predictive accuracy of forest soil BD and pH.</p>
      <p id="d2e755">In 10-fold cross-validation, the training set was randomly partitioned into ten equal subsets. In each iteration, nine subsets were used to train the model, and the remaining one was used for validation. This procedure was repeated ten times, ensuring each subset served as validation data exactly once. Model accuracy was assessed by averaging performance metrics across folds, including mean error (ME), root mean square error (RMSE), and the model efficiency coefficient (MEC).</p>
      <p id="d2e758">For independent validation, the reserved test set was excluded entirely from model training and hyperparameter tuning, thereby providing an unbiased evaluation of generalizability. The formulas used for calculating the evaluation metrics are as follows: 

                <disp-formula specific-use="gather" content-type="numbered"><mml:math id="M19" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E3"><mml:mtd><mml:mtext>3</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mtext>ME</mml:mtext><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mi>n</mml:mi></mml:mfrac></mml:mstyle><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E4"><mml:mtd><mml:mtext>4</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mtext>RMSE</mml:mtext><mml:mo>=</mml:mo><mml:msqrt><mml:mrow><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mi>n</mml:mi></mml:mfrac></mml:mstyle><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mi>i</mml:mi></mml:msub><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:msqrt></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E5"><mml:mtd><mml:mtext>5</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mtext>MEC</mml:mtext><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>i</mml:mi></mml:msub><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal">¯</mml:mo></mml:mover><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          where <inline-formula><mml:math id="M20" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the observed soil property value, <inline-formula><mml:math id="M21" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the predicted value, and <inline-formula><mml:math id="M22" display="inline"><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal">¯</mml:mo></mml:mover></mml:math></inline-formula> is the mean of observed values. ME, also referred to as bias, measures average deviation. RMSE reflects the overall prediction error, with lower values indicating higher accuracy. MEC, equivalent to the coefficient of determination (<inline-formula><mml:math id="M23" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>), ranges from 0 to 1, with higher values indicating better predictive performance.</p>
      <p id="d2e1001">To enhance model interpretability and quantify the contribution of each environmental covariate to the predictions of soil BD and pH across the five soil layers, we utilized SHAP (SHapley Additive exPlanations) (Lundberg and Lee, 2017), a game-theoretic approach that assigns each feature a Shapley value representing its contribution to model output. SHAP provides a unified framework for evaluating feature importance and the directionality of each feature's effect on model predictions. This allows us to determine not only how each environmental factor influences the model's predictions for soil BD and pH but also revealing the direction (positive or negative) and magnitude of the feature's impact on that specific prediction. SHAP values were computed using TreeSHAP based on the training dataset.</p>
</sec>
<sec id="Ch1.S2.SS4">
  <label>2.4</label><title>Uncertainty Quantification</title>
      <p id="d2e1012">Quantifying spatial uncertainty is essential in DSM, as prediction errors may arise from input data variability, model structure, and environmental heterogeneity (Arrouays et al., 2014; Poggio et al., 2021; Liu et al., 2022a; Shi et al., 2025). To visualize the spatial distribution of prediction uncertainty, we calculated the Prediction Interval Ratio (PIR), defined as the ratio between the 90 % prediction interval width and the median estimate:

            <disp-formula id="Ch1.E6" content-type="numbered"><label>6</label><mml:math id="M24" display="block"><mml:mrow><mml:mtext>PIR</mml:mtext><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mn mathvariant="normal">0.95</mml:mn></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>q</mml:mi><mml:mn mathvariant="normal">0.05</mml:mn></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mn mathvariant="normal">0.50</mml:mn></mml:msub></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:math></disp-formula>

          where <inline-formula><mml:math id="M25" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mn mathvariant="normal">0.95</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M26" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mn mathvariant="normal">0.05</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> represent the upper and lower bounds of the 90 % prediction interval, respectively, and <inline-formula><mml:math id="M27" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mn mathvariant="normal">0.50</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> denotes the median prediction. PIR is a dimensionless metric that quantifies the relative spread of prediction uncertainty around the central estimate. Higher PIR values indicate greater dispersion and, therefore, higher predictive uncertainty.</p>
      <p id="d2e1079">To evaluate the calibration of these uncertainty estimates, we used the Prediction Interval Coverage Probability (PICP), computed from the independent validation dataset (Goovaerts, 2001). PICP measures the proportion of observed values that fall within their corresponding prediction intervals at a specified confidence level (e.g., 90 %). A well-calibrated model should yield a PICP value close to the nominal coverage. For example, a 90 % prediction interval is considered reliable if the empirical PICP also approximates 90 %. Systematic deviation from this benchmark can indicate miscalibration: a PICP above the target level suggests that intervals are too narrow (underestimated uncertainty), while a PICP below the target indicates overly wide intervals (overestimated uncertainty) (Poggio et al., 2021; Liang et al., 2019). This diagnostic approach supports the robust interpretation of uncertainty in DSM outputs.</p>
</sec>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Results</title>
<sec id="Ch1.S3.SS1">
  <label>3.1</label><title>Forest soil database</title>
      <p id="d2e1098">As shown in Fig. 3 and Table S7 in the Supplement, the dataset comprises 4356 forest soil profiles distributed nationwide. Soil properties were standardized to fixed depth intervals (0–5, 5–15, 15–30, 30–60, and 60–100 cm) using the equal-area spline method, resulting in 15 845 horizons for BD and 15 978 horizons for pH, which were visualized using violin plots to illustrate their distribution across soil depths. Differences among soil depth intervals were assessed using the Kruskal–Wallis test, followed by Dunn's post hoc pairwise comparisons with Bonferroni correction. Statistical significance was determined at <inline-formula><mml:math id="M28" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.05</mml:mn></mml:mrow></mml:math></inline-formula> based on adjusted <inline-formula><mml:math id="M29" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula>-values.</p>

      <fig id="F3" specific-use="star"><label>Figure 3</label><caption><p id="d2e1122">Variation of bulk density (BD) and pH across soil depths. Violin plots illustrate the distribution of observations, with boxplots showing medians and interquartile ranges. Horizontal bars indicate significant pairwise differences among soil depths (Dunn's test with Bonferroni correction; <sup>∗</sup>, <sup>∗∗</sup>, <sup>∗∗∗</sup> denote <inline-formula><mml:math id="M33" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.05</mml:mn></mml:mrow></mml:math></inline-formula>, 0.01, and 0.001, respectively).</p></caption>
          <graphic xlink:href="https://essd.copernicus.org/articles/18/4451/2026/essd-18-4451-2026-f03.png"/>

        </fig>

      <p id="d2e1178">Forest soil BD exhibited a non-linear vertical distribution pattern, characterized by an increase from surface soils to mid-depth layers followed by a decrease in deeper layers (Fig. 3a; Table S7). Specifically, mean BD values were similar in the two surface layers (1.206 <inline-formula><mml:math id="M34" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">g</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> at 0–5 cm and 1.209 <inline-formula><mml:math id="M35" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">g</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> at 5–15 cm), increased in the 15–30 cm layer (1.287 <inline-formula><mml:math id="M36" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">g</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>), and reached the maximum value in the 30–60 cm layer (1.340 <inline-formula><mml:math id="M37" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">g</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>). In the deepest layer (60–100 cm), mean BD declined to 1.242 <inline-formula><mml:math id="M38" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">g</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>. The violin plots revealed a relatively compact distribution of BD across all soil layers, with no pronounced extreme outliers (Fig. 3a). The left and right contours of the violin plots were well balanced, corresponding to low skewness values (0.16–0.42), indicating nearly symmetrical distributions. BD differed significantly among soil depth intervals (Fig. 3a), with statistically significant differences observed between layers (<inline-formula><mml:math id="M39" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.05</mml:mn></mml:mrow></mml:math></inline-formula>).</p>
      <p id="d2e1280">Forest soil pH exhibited a monotonic vertical distribution pattern, with values increasing progressively with soil depth (Fig. 3b; Table S7). Mean pH increased from 6.066 in the 0–5 cm layer to 6.131 at 5–15 cm and 6.172 at 15–30 cm, followed by a more pronounced increase in deeper layers, reaching 6.458 at 30–60 cm and 6.466 at 60–100 cm. The violin plots for the upper layers (0–5, 5–15, and 15–30 cm) showed more extended contours, whereas the deeper layers (30–60 and 60–100 cm) exhibited wider upper contours, indicating a higher density of pH values toward the higher end of the scale (Fig. 3b). The violin plot profiles for pH were nearly symmetric, with low skewness values (0.05–0.19), indicating a distribution close to normal. Pairwise comparison results revealed that soil pH formed two distinct depth-related levels, with no significant differences within the 0–30 and 30–100 cm intervals, but significant differences between these two groups (Fig. 3b).</p>
</sec>
<sec id="Ch1.S3.SS2">
  <label>3.2</label><title>Model performance</title>
      <p id="d2e1291">Table 1 presents the evaluation results of model predictive performance based on 10-fold cross-validation (CV) and independent validation (IV). Overall, model performance varied between soil properties and validation strategies, yet consistently indicated reliable predictive capability across all depth intervals.</p>

<table-wrap id="T1" specific-use="star"><label>Table 1</label><caption><p id="d2e1297">Predictive performance of bulk density (BD) and pH predictions.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="8">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right" colsep="1"/>
     <oasis:colspec colnum="6" colname="col6" align="right"/>
     <oasis:colspec colnum="7" colname="col7" align="right"/>
     <oasis:colspec colnum="8" colname="col8" align="right"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1">Validation</oasis:entry>
         <oasis:entry colname="col2">Depth (cm)</oasis:entry>
         <oasis:entry rowsep="1" namest="col3" nameend="col5" align="center" colsep="1">10-fold CV </oasis:entry>
         <oasis:entry rowsep="1" namest="col6" nameend="col8" align="center">IV </oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">MEC</oasis:entry>
         <oasis:entry colname="col4">RMSE</oasis:entry>
         <oasis:entry colname="col5">ME</oasis:entry>
         <oasis:entry colname="col6">MEC</oasis:entry>
         <oasis:entry colname="col7">RMSE</oasis:entry>
         <oasis:entry colname="col8">ME</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">BD</oasis:entry>
         <oasis:entry colname="col2">0–5</oasis:entry>
         <oasis:entry colname="col3">0.782</oasis:entry>
         <oasis:entry colname="col4">0.090</oasis:entry>
         <oasis:entry colname="col5">0.000</oasis:entry>
         <oasis:entry colname="col6">0.598</oasis:entry>
         <oasis:entry colname="col7">0.164</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M40" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.010</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">5–15</oasis:entry>
         <oasis:entry colname="col3">0.815</oasis:entry>
         <oasis:entry colname="col4">0.084</oasis:entry>
         <oasis:entry colname="col5">0.000</oasis:entry>
         <oasis:entry colname="col6">0.611</oasis:entry>
         <oasis:entry colname="col7">0.181</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M41" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.017</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">15–30</oasis:entry>
         <oasis:entry colname="col3">0.828</oasis:entry>
         <oasis:entry colname="col4">0.081</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M42" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.000</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6">0.657</oasis:entry>
         <oasis:entry colname="col7">0.155</oasis:entry>
         <oasis:entry colname="col8">0.006</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">30–60</oasis:entry>
         <oasis:entry colname="col3">0.874</oasis:entry>
         <oasis:entry colname="col4">0.079</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M43" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.000</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6">0.614</oasis:entry>
         <oasis:entry colname="col7">0.166</oasis:entry>
         <oasis:entry colname="col8">0.005</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">60–100</oasis:entry>
         <oasis:entry colname="col3">0.889</oasis:entry>
         <oasis:entry colname="col4">0.087</oasis:entry>
         <oasis:entry colname="col5">0.000</oasis:entry>
         <oasis:entry colname="col6">0.656</oasis:entry>
         <oasis:entry colname="col7">0.166</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M44" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.019</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">pH</oasis:entry>
         <oasis:entry colname="col2">0–5</oasis:entry>
         <oasis:entry colname="col3">0.844</oasis:entry>
         <oasis:entry colname="col4">0.215</oasis:entry>
         <oasis:entry colname="col5">0.000</oasis:entry>
         <oasis:entry colname="col6">0.705</oasis:entry>
         <oasis:entry colname="col7">0.432</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M45" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.003</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">5–15</oasis:entry>
         <oasis:entry colname="col3">0.834</oasis:entry>
         <oasis:entry colname="col4">0.254</oasis:entry>
         <oasis:entry colname="col5">0.000</oasis:entry>
         <oasis:entry colname="col6">0.726</oasis:entry>
         <oasis:entry colname="col7">0.480</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M46" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">15–30</oasis:entry>
         <oasis:entry colname="col3">0.854</oasis:entry>
         <oasis:entry colname="col4">0.214</oasis:entry>
         <oasis:entry colname="col5">0.000</oasis:entry>
         <oasis:entry colname="col6">0.742</oasis:entry>
         <oasis:entry colname="col7">0.448</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M47" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.007</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">30–60</oasis:entry>
         <oasis:entry colname="col3">0.854</oasis:entry>
         <oasis:entry colname="col4">0.256</oasis:entry>
         <oasis:entry colname="col5">0.001</oasis:entry>
         <oasis:entry colname="col6">0.760</oasis:entry>
         <oasis:entry colname="col7">0.515</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M48" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.002</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">60–100</oasis:entry>
         <oasis:entry colname="col3">0.868</oasis:entry>
         <oasis:entry colname="col4">0.238</oasis:entry>
         <oasis:entry colname="col5">0.001</oasis:entry>
         <oasis:entry colname="col6">0.812</oasis:entry>
         <oasis:entry colname="col7">0.492</oasis:entry>
         <oasis:entry colname="col8">0.014</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <p id="d2e1714">For soil BD, the MEC values obtained from 10-fold CV range from 0.782 to 0.889, with RMSE values between 0.079 and 0.090 <inline-formula><mml:math id="M49" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">g</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>, while the mean error (ME) remains close to zero. Under the independent validation scheme, MEC values are slightly lower (0.598–0.657), accompanied by higher RMSE values (0.155–0.181 <inline-formula><mml:math id="M50" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">g</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>); however, the predictive performance remains acceptable, and ME values are likewise close to zero, indicating the absence of systematic bias.</p>
      <p id="d2e1752">Model performance for soil pH is consistently higher. This result is consistent with previous studies indicating pH as a more stable and predictable property at regional scales (Abdullah et al., 2025). The MEC values derived from 10-fold CV range from 0.834 to 0.868, with RMSE values of 0.214–0.256, whereas the independent validation yields MEC values of 0.705–0.812 and RMSE values of 0.432–0.515. Across all soil depths, ME values are consistently close to zero, further confirming the unbiased nature of the predictions. The comparable performance between the CV and IV results suggests good model generalizability across soil depths.</p>
</sec>
<sec id="Ch1.S3.SS3">
  <label>3.3</label><title>Spatial patterns</title>
      <p id="d2e1763">We applied the QRF model to generate spatially explicit maps of soil BD and pH across five depth intervals in forested regions of China. To characterize their spatial patterns, we calculated depth-specific mean values of BD and pH along both latitudinal and longitudinal gradients. In addition, the prediction results were statistically summarized and visualized using boxplots to illustrate regional differences (Figs. 4 and 5).</p>

      <fig id="F4" specific-use="star"><label>Figure 4</label><caption><p id="d2e1768">Spatially explicit maps of predicted bulk density (BD) at depths of 0–5 cm <bold>(a)</bold>, 5–15 cm <bold>(b)</bold>, 15–30 cm <bold>(c)</bold>, 30–60 cm <bold>(d)</bold>, and 60–100 cm <bold>(e)</bold>, and their differences across China's seven geographic regions. Subfigure <bold>(A)</bold> illustrates the spatial distribution (prediction maps) of forest soil BD across the five soil layers, together with the latitudinal and longitudinal profile-averaged BD values for each layer. Subfigure <bold>(B)</bold> presents boxplots of BD for China as a whole and for each of the seven geographic regions under different soil layers.</p></caption>
          <graphic xlink:href="https://essd.copernicus.org/articles/18/4451/2026/essd-18-4451-2026-f04.png"/>

        </fig>

      <fig id="F5" specific-use="star"><label>Figure 5</label><caption><p id="d2e1801">Spatially explicit maps of predicted soil pH at depths of 0–5 cm <bold>(a)</bold>, 5–15 cm <bold>(b)</bold>, 15–30 cm <bold>(c)</bold>, 30–60 cm <bold>(d)</bold>, and 60–100 cm <bold>(e)</bold>, and their differences across China's seven geographic regions. Subfigure <bold>(A)</bold> shows the spatial distribution (prediction maps) of forest soil pH across the five soil layers, together with the latitudinal and longitudinal profile-averaged pH values for each layer. Subfigure <bold>(B)</bold> presents boxplots of soil pH for China as a whole and for each of the seven geographic regions across different soil layers.</p></caption>
          <graphic xlink:href="https://essd.copernicus.org/articles/18/4451/2026/essd-18-4451-2026-f05.png"/>

        </fig>

<sec id="Ch1.S3.SS3.SSS1">
  <label>3.3.1</label><title>Spatial patterns of BD</title>
      <p id="d2e1840">As shown in Fig. 4, the predicted spatial distribution of BD closely matches the statistical characteristics of the observations, indicating that the model effectively captures the underlying spatial patterns of soil BD. The mapping results show that the mean BD values across soil layers range from 1.16 to 1.34 <inline-formula><mml:math id="M51" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">g</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>, with standard deviations between 0.15 and 0.21 <inline-formula><mml:math id="M52" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">g</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>. Along the vertical profile, forest soil BD exhibits an increasing trend with depth followed by stabilization: values increase from 1.16 <inline-formula><mml:math id="M53" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">g</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> in the surface layer (0–5 cm) to a maximum of 1.34 <inline-formula><mml:math id="M54" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">g</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> at 30–60 cm, before slightly decreasing to 1.25 <inline-formula><mml:math id="M55" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">g</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> in the 60–100 cm layer. This vertical pattern is consistent with that observed in the field measurements (Fig. 3a), further supporting the model's ability to reproduce the vertical gradient of BD.</p>
      <p id="d2e1928">As shown in Fig. 4A, forest soil BD in China exhibits a clear decreasing trend along the latitudinal gradient and a unimodal pattern along the longitudinal gradient. Specifically, mean BD values decrease systematically from south to north with increasing latitude across all soil layers. Along the longitudinal gradient, BD follows a unimodal pattern, increasing from west to east, peaking at approximately 105° E, and subsequently decreasing. This peak approximately corresponds to the transition zone between China's second and first topographical steps, suggesting a potential influence of terrain–climate interactions on soil BD. At the pixel scale, the simulations reveal pronounced regional heterogeneity in forest soil BD. Together, the latitudinal and longitudinal gradients define a broad spatial pattern characterized by higher BD values in southwestern regions and lower values in northeastern regions (Fig. 4). Southwestern China consistently exhibits higher BD values across all soil layers (mean of 2.03 <inline-formula><mml:math id="M56" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">g</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>), whereas northeastern regions show comparatively lower values (mean of 1.36 <inline-formula><mml:math id="M57" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">g</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>). Notably, the magnitude of vertical BD variation differs substantially among regions. Specifically, northeastern regions show the largest increase in BD from surface to deeper layers (from 1.16 to 1.34 <inline-formula><mml:math id="M58" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">g</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>), whereas northwestern regions exhibit relatively stable BD values across depths (from 1.95 to 1.96 <inline-formula><mml:math id="M59" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">g</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>). Overall, the model successfully reproduces both the spatial and vertical variations of forest soil BD, offering a comprehensive depiction of its distribution across China.</p>
</sec>
<sec id="Ch1.S3.SS3.SSS2">
  <label>3.3.2</label><title>Spatial patterns of pH</title>
      <p id="d2e2007">We next examine the spatial patterns of soil pH, which are also well reproduced by the model (Fig. 5c and d). The resulting forest soil pH maps show mean values ranging from 5.70 to 6.06 across all soil layers, with standard deviations between 0.65 and 0.81 (Fig. 5A), indicating moderate spatial variability at the national scale. Specifically, soil pH exhibited a generally increasing trend with depth. Mean pH values changed from 5.99 at 0–5 cm to 5.70 at 15–30 cm, increased to 6.06 at 30–60 cm, and then slightly decreased to 6.01 at 60–100 cm. The pattern of pH variation across soil layers observed in the prediction map is consistent with the statistical results from field data (Fig. 3b), indicating the model's strong performance.</p>
      <p id="d2e2010">As shown in Fig. 5B, the spatial distribution of soil pH in Chinese forests exhibits a distinct latitudinal unimodal pattern, combined with an overall decreasing trend from west to east and localized sharp declines. Specifically, along the latitudinal gradient, soil pH first increases and then decreases with latitude, peaking at approximately 39° N, which corresponds to the transition between the warm temperate zone and semi-arid regions of China. Along the longitudinal gradient, pH values generally decrease with increasing longitude, with a pronounced decline around 77° E, which may reflect the influence of the major topographic barrier associated with the eastern Tibetan Plateau on soil processes. As a result, forest soil pH exhibits an overall increasing gradient from southwestern China toward northern regions. Northwestern China exhibits the highest pH values (mean range from 6.75 to 7.31), whereas the lowest values occur in southern regions (mean range from 5.01 to 5.35). Regional differences were also evident in the magnitude of depth-related pH variation. Northwestern China exhibits the largest vertical variation in mean pH across soil depths, with a range of 0.14, whereas the smallest variation is observed in northeastern regions, with a range of only 0.01. In summary, the predicted maps successfully reproduce the pronounced spatial patterns of forest soil pH across China, characterized by higher values in the northwest and lower values in southern regions.</p>
</sec>
</sec>
<sec id="Ch1.S3.SS4">
  <label>3.4</label><title>Prediction uncertainty</title>
      <p id="d2e2022">Visualization of prediction uncertainty using the PIR highlighted clear regional variations in uncertainty for BD and pH across China. Higher uncertainty for BD was concentrated in the northeastern and southwestern regions, while lower uncertainty characterized the southeastern coastal areas (Fig. S3 in the Supplement). Conversely, pH uncertainty was more pronounced in northern China and parts of the southwest, with relatively lower levels observed in the northeast and the central-eastern coastal zone (Fig. S4 in the Supplement). Overall, areas of elevated uncertainty predominantly coincided with southwestern China, where complex soil-landscape interactions likely contribute to increased model uncertainty. Additionally, regions with sparse data coverage, such as high-altitude areas, exhibited amplified extrapolation uncertainty due to limited representation in the training dataset, further challenging model reliability in these environments. For both BD and pH, prediction uncertainty generally increased with soil depth, a pattern potentially attributable to the reduced availability of soil observations at deeper intervals.</p>
      <p id="d2e2025">To ensure that biased uncertainty estimates do not compromise practical applications of the model, we further employed the PICP to perform this critical validation step. Five predictive accuracy plots were generated to evaluate the alignment of predicted intervals with actual observations for BD and pH (Fig. 6). The QRF-based digital soil mapping model showed close adherence to the <inline-formula><mml:math id="M60" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>:</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> reference line across both properties, indicating strong consistency in local uncertainty estimation. However, for pH, a slight overestimation of uncertainty was detected at intermediate probability levels (60 %–90 %) within subsurface layers (0–60 cm), suggesting minor deviations from optimal calibration. In contrast, uncertainty quantification for BD remained well-calibrated across all depth intervals and probability thresholds.</p>

      <fig id="F6" specific-use="star"><label>Figure 6</label><caption><p id="d2e2042">Validation of uncertainty quantifications. Prediction interval coverage probability (PICP) plots derived from model predictions for bulk density <bold>(</bold>BD; <bold>a–e)</bold> and soil pH <bold>(g–j)</bold>.</p></caption>
          <graphic xlink:href="https://essd.copernicus.org/articles/18/4451/2026/essd-18-4451-2026-f06.png"/>

        </fig>

</sec>
<sec id="Ch1.S3.SS5">
  <label>3.5</label><title>SHAP Analysis of Predictive Factors</title>
      <p id="d2e2069">SHAP analysis revealed that climate-related covariates accounted for most of the model-attributed spatial variation in forest soil BD and pH across China. The relative importance of individual predictors differed markedly between BD and pH, as well as among soil depths. For BD, annual precipitation (PRE_A), annual potential evapotranspiration (PET_A), and growing-season solar radiation (SR_GS) consistently ranked among the most influential predictors. In contrast, PRE_A dominated pH predictions across most soil layers, whereas the importance of the normalized difference water index (NDWI_A) increased with soil depth. These depth-stratified patterns are evident in both the SHAP importance rankings and the summary plots (Figs. 7 and 8).</p>

      <fig id="F7" specific-use="star"><label>Figure 7</label><caption><p id="d2e2074">Interpreting the effects of driving factors on forest soil bulk density (BD) across different soil depths: SHAP-based feature importance and covariate contribution patterns. Subfigure <bold>(A)</bold> shows SHAP-based analysis of feature importance for soil BD driving factors. Subfigure <bold>(B)</bold> shows quantification of contribution rates of covariate categories to soil BD. Variable importance for model training at different soil depths. The abbreviations of the predictors are defined in Table S1.</p></caption>
          <graphic xlink:href="https://essd.copernicus.org/articles/18/4451/2026/essd-18-4451-2026-f07.png"/>

        </fig>

      <fig id="F8" specific-use="star"><label>Figure 8</label><caption><p id="d2e2091">Interpreting the effects of driving factors on forest soil pH across different soil depths: SHAP-based feature importance and covariate contribution patterns. Subfigure <bold>(A)</bold> shows SHAP-based analysis of feature importance for soil pH driving factors. Subfigure <bold>(B)</bold> shows quantification of contribution rates of covariate categories to soil pH. Variable importance for model training at different soil depths. The abbreviations of the predictors are defined in Table S1.</p></caption>
          <graphic xlink:href="https://essd.copernicus.org/articles/18/4451/2026/essd-18-4451-2026-f08.png"/>

        </fig>

<sec id="Ch1.S3.SS5.SSS1">
  <label>3.5.1</label><title>SHAP Analysis for BD</title>
      <p id="d2e2114">SHAP analysis reveals that the spatial variation of soil BD in Chinese forests is predominantly explained in the model by climate-related factors (Fig. 7). Climate-related factors collectively contribute approximately 71.09 %–77.80 % of the total mean absolute SHAP attribution across soil layers. Specifically, PRE_A, PET_A, and SR_GS consistently rank as the top three influential factors across all layers, with their combined mean absolute SHAP values ranging from 0.55 to 0.67 and representing the majority of the model-attributed importance (approximately 63.72 %–72.82 %). Other contributing factors include Elevation (mean absolute SHAP value 0.08–0.12), suggesting that variables related to evaporation–precipitation balance and terrain play an important role in the model-predicted variation of forest soil BD. This finding aligns with previous studies on the factors affecting BD (Liu et al., 2022a).</p>
      <p id="d2e2117">The dominance of climate-driven patterns varies with soil depth. In the surface layer (0–30 cm), PET_A shows the highest model-attributed importance, whereas at deeper layers (<inline-formula><mml:math id="M61" display="inline"><mml:mrow><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">30</mml:mn></mml:mrow></mml:math></inline-formula> cm), PRE_A becomes the most influential predictor in the model. At the 0–30 cm depth, PET_A exhibits the largest mean absolute SHAP values (0.21–0.25), with an importance approximately 1.31 times that of PRE_A (0.15–0.21). At the 30–100 cm depth, the mean absolute SHAP values of PRE_A (0.20–0.42) increase substantially, reaching up to 2.58 times those of PET_A (0.06–0.18), and representing a large share of the model-attributed importance in the 60–100 cm layer (approximately 23.94 %–45.45 %). In comparison, the SHAP value of SR_GS consistently ranks among the top three factors, with values ranging from 0.17 to 0.19. This suggests that the relative importance of individual climate-related covariates differs across soil depths, with surface layers showing stronger attribution to PET_A and deeper layers to PRE_A.</p>
      <p id="d2e2130">The SHAP summary plot further clarifies the relationships between BD and key covariates. Both PET_A and PRE_A show substantial SHAP contributions to BD predictions across all soil layers. Higher values of these factors are generally associated with more positive SHAP contributions, indicating that higher PET_A and PRE_A tend to increase the model-predicted BD. Notably, the absolute magnitude of PET_A's SHAP contribution is lower in deeper soil layers (from 0.25 in surface soil to 0.06 in deeper layers), whereas the absolute magnitude of PRE_A's negative SHAP contribution is higher in deeper soil layers (from 0.15 to 0.42). Potential evapotranspiration influences surface soil BD through seasonal moisture fluctuations, whereas, in deeper soils, precipitation-related processes such as leaching may contribute to higher BD (Xu et al., 2022; Liu et al., 2024a), which is consistent with the observed SHAP contribution patterns. In contrast, SR_GS exhibits predominantly negative SHAP contributions, suggesting that increased growing-season solar radiation is associated with lower predicted BD. The magnitude of SR_GS's negative SHAP contribution remains relatively stable across soil layers (0.15–0.19). To summarize, the dominance of climate factors in influencing BD varies notably across different soil depths, with surface soil BD being more strongly influenced by growing season potential evapotranspiration, while deeper soil BD is predominantly associated with precipitation-related predictors within the model framework.</p>
</sec>
<sec id="Ch1.S3.SS5.SSS2">
  <label>3.5.2</label><title>SHAP Analysis for pH</title>
      <p id="d2e2141">As shown in Fig. 8, the spatial variation of soil pH in Chinese forests is predominantly explained by climate-related factors in the model. Climate-related factors collectively account for approximately 46.06 % to 66.25 % of the total mean absolute SHAP attribution across soil layers (Fig. 8B). Specifically, PRE_A exhibits the highest mean absolute SHAP values for pH, with values ranging from 0.18 to 0.52 across soil layers and representing the largest share of model-attributed importance (approximately 17.40 %–46.32 %). SR_GS ranks second, with mean absolute SHAP values ranging from 0.04 to 0.12, contributing a moderate proportion of the model-attributed importance (approximately 4.18 %–11.89 %). This result is consistent with previous studies, which have identified precipitation as the dominant factor controlling soil pH at large spatial scales (Huang et al., 2025). Less important but still informative covariates include Elevation, suggesting that climate-related variables dominate the model-attributed spatial patterns of pH at the national scale, while terrain-related effects may play a greater role locally.</p>
      <p id="d2e2144">The SHAP values of factors influencing forest soil pH vary considerably with soil depth. In the surface layer (0–60 cm), PRE_A exhibits the highest model-attributed importance, whereas in deeper layers (60–100 cm), NDWI_A becomes the leading predictor. At the 0–60 cm depth, PRE_A is the most important factor (SHAP values 0.32–0.52), with its importance being 2.91 times that of NDWI_A (SHAP values 0.05–0.16). At the 60–100 cm depth, the mean absolute SHAP value of PRE_A decreases to 0.04, whereas NDWI_A becomes the most influential predictor (SHAP value 0.20), representing the largest share of model-attributed importance in this layer (19.51 %). Additionally, SR_GS ranks among the top three influential factors for the 0–60 cm layer (SHAP values 0.09–0.12), but its importance considerably decreases in the 60–100 cm layer (SHAP value 0.04). Overall, climate-related covariates tend to exhibit stronger model-attributed importance in shallower soil layers.</p>
      <p id="d2e2147">As illustrated in Fig. 8, the relationship between pH and key covariates shows considerable variation with soil depth. Both PRE_A and SR_GS show substantial SHAP contributions to soil pH predictions across all soil layers. Higher values of these factors are generally associated with negative SHAP contributions, indicating that increased PRE_A and SR_GS tend to decrease the model-predicted soil pH. Specifically, the absolute magnitude of the negative SHAP contributions of PRE_A and SR_GS is lower in deeper soil layers, from 0.34 to 0.18 for PRE_A and from 0.12 to 0.04 for SR_GS. Forest surface soil pH tends to decrease with increased precipitation, which may be attributed to enhanced leaching of base cations under wetter conditions. Increased rainfall accelerates the downward transport and loss of buffering cations, thereby potentially weakening the soil's acid-neutralizing capacity and promoting soil acidification (Huang et al., 2022b). Similarly, NDWI_A shows a negative influence, with its impact varying considerably across soil layers. At 0–30 cm, the influence is stronger (SHAP values 0.14–0.15), but it decreases between 15–30 cm (SHAP value 0.06) and increases again to the highest level at 30–100 cm (SHAP values 0.16–0.20). NDWI_A has been shown to correlate with soil water content. A possible explanation for this pattern is that NDWI_A may capture hydrological variability that is associated with the redistribution of soluble ions, which in turn may influence soil pH at different depths (Serrano et al., 2019). In general, the findings emphasize the depth-dependent effects of climate factors on forest soil pH, with precipitation emerging as a key predictor associated with soil acidification patterns in the model.</p>
</sec>
</sec>
</sec>
<sec id="Ch1.S4">
  <label>4</label><title>Discussion</title>
<sec id="Ch1.S4.SS1">
  <label>4.1</label><title>Comparison with previous products</title>
      <p id="d2e2167">While national datasets such as CSDLv2 (Shi et al., 2025) and ChinaSoilInfoGrids (Liu et al., 2022a) provide extensive information on soil properties, and global products such as SoilGrids 2.0 (Poggio et al., 2021) offer valuable worldwide references, their application to forest ecosystem studies remains constrained by the limited representation of forest-specific soil-forming environments. To address this data gap, our study developed a comprehensive nationwide forest soil dataset for China and integrated it with the FRFS-QRF model to simulate the spatial distribution of BD and pH in Chinese forests. By comparing our results with existing national and global products, we aim to quantify the potential enhancements achievable through the use of a forest ecosystem data foundation and the FRFS-QRF methodology we propose for DSM in forest ecosystems.</p>
      <p id="d2e2170">The proposed model exhibits enhanced predictive accuracy and reliability for forest soil BD and pH when compared to existing datasets, as illustrated in Table 2. The model achieved reductions in RMSE of 27.41 %, 37.26 %, and 49.46 % relative to CSDLv2, ChinaSoilInfoGrids, and SoilGrids 2.0, respectively. For pH, RMSE reductions were 65.98 %, 68.43 %, and 73.69 % against the same benchmarks. The model was calibrated exclusively on forest soils, thereby reducing the environmental and sampling heterogeneity introduced when cropland, grassland, and forest observations are combined in a single training dataset. Although mixed land use datasets remain valuable for broad scale mapping, the relationships they capture may be influenced by anthropogenic processes that are less prevalent in forests, including soil compaction associated with tillage and traffic and pH modification caused by fertilization or liming (Hamza and Anderson, 2005; Guo et al., 2022).</p>

<table-wrap id="T2" specific-use="star"><label>Table 2</label><caption><p id="d2e2176">Predictive performance of soil property predictions, CSDLv2, ChinaSoilInfoGrids and SoilGrids 2.0.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="13">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right" colsep="1"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:colspec colnum="6" colname="col6" align="right"/>
     <oasis:colspec colnum="7" colname="col7" align="right" colsep="1"/>
     <oasis:colspec colnum="8" colname="col8" align="right"/>
     <oasis:colspec colnum="9" colname="col9" align="right"/>
     <oasis:colspec colnum="10" colname="col10" align="right" colsep="1"/>
     <oasis:colspec colnum="11" colname="col11" align="right"/>
     <oasis:colspec colnum="12" colname="col12" align="right"/>
     <oasis:colspec colnum="13" colname="col13" align="right"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1">Depth (cm)</oasis:entry>
         <oasis:entry rowsep="1" namest="col2" nameend="col4" align="center" colsep="1">Our predictions </oasis:entry>
         <oasis:entry rowsep="1" namest="col5" nameend="col7" align="center" colsep="1">CSDLv2 </oasis:entry>
         <oasis:entry rowsep="1" namest="col8" nameend="col10" align="center" colsep="1">ChinaSoilInfoGrids </oasis:entry>
         <oasis:entry rowsep="1" namest="col11" nameend="col13" align="center">SoilGrids 2.0 </oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">MEC</oasis:entry>
         <oasis:entry colname="col3">RMSE</oasis:entry>
         <oasis:entry colname="col4">ME</oasis:entry>
         <oasis:entry colname="col5">MEC</oasis:entry>
         <oasis:entry colname="col6">RMSE</oasis:entry>
         <oasis:entry colname="col7">ME</oasis:entry>
         <oasis:entry colname="col8">MEC</oasis:entry>
         <oasis:entry colname="col9">RMSE</oasis:entry>
         <oasis:entry colname="col10">ME</oasis:entry>
         <oasis:entry colname="col11">MEC</oasis:entry>
         <oasis:entry colname="col12">RMSE</oasis:entry>
         <oasis:entry colname="col13">ME</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row rowsep="1">
         <oasis:entry namest="col1" nameend="col13">BD </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">0–5</oasis:entry>
         <oasis:entry colname="col2">0.782</oasis:entry>
         <oasis:entry colname="col3">0.090</oasis:entry>
         <oasis:entry colname="col4">0.000</oasis:entry>
         <oasis:entry colname="col5">0.620</oasis:entry>
         <oasis:entry colname="col6">0.120</oasis:entry>
         <oasis:entry colname="col7">0.000</oasis:entry>
         <oasis:entry colname="col8">0.483</oasis:entry>
         <oasis:entry colname="col9">0.147</oasis:entry>
         <oasis:entry colname="col10"><inline-formula><mml:math id="M62" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.002</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col11">0.268</oasis:entry>
         <oasis:entry colname="col12">0.185</oasis:entry>
         <oasis:entry colname="col13">0.025</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">5–15</oasis:entry>
         <oasis:entry colname="col2">0.815</oasis:entry>
         <oasis:entry colname="col3">0.084</oasis:entry>
         <oasis:entry colname="col4">0.000</oasis:entry>
         <oasis:entry colname="col5">0.630</oasis:entry>
         <oasis:entry colname="col6">0.110</oasis:entry>
         <oasis:entry colname="col7">0.000</oasis:entry>
         <oasis:entry colname="col8">0.479</oasis:entry>
         <oasis:entry colname="col9">0.138</oasis:entry>
         <oasis:entry colname="col10"><inline-formula><mml:math id="M63" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.002</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col11">0.279</oasis:entry>
         <oasis:entry colname="col12">0.172</oasis:entry>
         <oasis:entry colname="col13">0.053</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">15–30</oasis:entry>
         <oasis:entry colname="col2">0.828</oasis:entry>
         <oasis:entry colname="col3">0.081</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M64" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.000</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5">0.600</oasis:entry>
         <oasis:entry colname="col6">0.110</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M65" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.000</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col8">0.457</oasis:entry>
         <oasis:entry colname="col9">0.132</oasis:entry>
         <oasis:entry colname="col10"><inline-formula><mml:math id="M66" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.002</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col11">0.263</oasis:entry>
         <oasis:entry colname="col12">0.16</oasis:entry>
         <oasis:entry colname="col13">0.043</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">30–60</oasis:entry>
         <oasis:entry colname="col2">0.874</oasis:entry>
         <oasis:entry colname="col3">0.079</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M67" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.000</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5">0.550</oasis:entry>
         <oasis:entry colname="col6">0.120</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M68" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.000</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col8">0.403</oasis:entry>
         <oasis:entry colname="col9">0.128</oasis:entry>
         <oasis:entry colname="col10"><inline-formula><mml:math id="M69" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.002</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col11">0.193</oasis:entry>
         <oasis:entry colname="col12">0.158</oasis:entry>
         <oasis:entry colname="col13">0.050</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">60–100</oasis:entry>
         <oasis:entry colname="col2">0.889</oasis:entry>
         <oasis:entry colname="col3">0.087</oasis:entry>
         <oasis:entry colname="col4">0.000</oasis:entry>
         <oasis:entry colname="col5">0.570</oasis:entry>
         <oasis:entry colname="col6">0.120</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M70" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.000</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col8">0.303</oasis:entry>
         <oasis:entry colname="col9">0.126</oasis:entry>
         <oasis:entry colname="col10"><inline-formula><mml:math id="M71" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.003</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col11">0.098</oasis:entry>
         <oasis:entry colname="col12">0.158</oasis:entry>
         <oasis:entry colname="col13">0.065</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry namest="col1" nameend="col13">pH </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">0–5</oasis:entry>
         <oasis:entry colname="col2">0.844</oasis:entry>
         <oasis:entry colname="col3">0.215</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M72" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.000</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5">0.690</oasis:entry>
         <oasis:entry colname="col6">0.700</oasis:entry>
         <oasis:entry colname="col7">0.000</oasis:entry>
         <oasis:entry colname="col8">0.711</oasis:entry>
         <oasis:entry colname="col9">0.791</oasis:entry>
         <oasis:entry colname="col10">0.001</oasis:entry>
         <oasis:entry colname="col11">0.635</oasis:entry>
         <oasis:entry colname="col12">0.916</oasis:entry>
         <oasis:entry colname="col13"><inline-formula><mml:math id="M73" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.038</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">5–15</oasis:entry>
         <oasis:entry colname="col2">0.834</oasis:entry>
         <oasis:entry colname="col3">0.254</oasis:entry>
         <oasis:entry colname="col4">0.000</oasis:entry>
         <oasis:entry colname="col5">0.700</oasis:entry>
         <oasis:entry colname="col6">0.680</oasis:entry>
         <oasis:entry colname="col7">0.000</oasis:entry>
         <oasis:entry colname="col8">0.724</oasis:entry>
         <oasis:entry colname="col9">0.724</oasis:entry>
         <oasis:entry colname="col10">0.003</oasis:entry>
         <oasis:entry colname="col11">0.652</oasis:entry>
         <oasis:entry colname="col12">0.893</oasis:entry>
         <oasis:entry colname="col13"><inline-formula><mml:math id="M74" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.066</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">15–30</oasis:entry>
         <oasis:entry colname="col2">0.854</oasis:entry>
         <oasis:entry colname="col3">0.214</oasis:entry>
         <oasis:entry colname="col4">0.000</oasis:entry>
         <oasis:entry colname="col5">0.700</oasis:entry>
         <oasis:entry colname="col6">0.680</oasis:entry>
         <oasis:entry colname="col7">0.000</oasis:entry>
         <oasis:entry colname="col8">0.74</oasis:entry>
         <oasis:entry colname="col9">0.74</oasis:entry>
         <oasis:entry colname="col10">0.002</oasis:entry>
         <oasis:entry colname="col11">0.659</oasis:entry>
         <oasis:entry colname="col12">0.878</oasis:entry>
         <oasis:entry colname="col13"><inline-formula><mml:math id="M75" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.141</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">30–60</oasis:entry>
         <oasis:entry colname="col2">0.854</oasis:entry>
         <oasis:entry colname="col3">0.256</oasis:entry>
         <oasis:entry colname="col4">0.001</oasis:entry>
         <oasis:entry colname="col5">0.680</oasis:entry>
         <oasis:entry colname="col6">0.700</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M76" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.000</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col8">0.737</oasis:entry>
         <oasis:entry colname="col9">0.737</oasis:entry>
         <oasis:entry colname="col10">0.004</oasis:entry>
         <oasis:entry colname="col11">0.647</oasis:entry>
         <oasis:entry colname="col12">0.898</oasis:entry>
         <oasis:entry colname="col13"><inline-formula><mml:math id="M77" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.194</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">60–100</oasis:entry>
         <oasis:entry colname="col2">0.868</oasis:entry>
         <oasis:entry colname="col3">0.238</oasis:entry>
         <oasis:entry colname="col4">0.001</oasis:entry>
         <oasis:entry colname="col5">0.680</oasis:entry>
         <oasis:entry colname="col6">0.700</oasis:entry>
         <oasis:entry colname="col7">0.000</oasis:entry>
         <oasis:entry colname="col8">0.736</oasis:entry>
         <oasis:entry colname="col9">0.736</oasis:entry>
         <oasis:entry colname="col10">0.006</oasis:entry>
         <oasis:entry colname="col11">0.641</oasis:entry>
         <oasis:entry colname="col12">0.888</oasis:entry>
         <oasis:entry colname="col13"><inline-formula><mml:math id="M78" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.184</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <p id="d2e2870">The SHAP analysis suggests that BD and pH in forest soils are controlled primarily by climatic and hydrological factors. For BD, the dominant predictors were annual precipitation, annual potential evapotranspiration, and growing-season solar radiation, with surface layers showing stronger sensitivity to litter input, organic matter dynamics, drying–rewetting, and root activity, and deeper layers reflecting longer-term water redistribution and profile differentiation (Jenny, 1941; Brady and Weil, 2016). For pH, climatic and hydrological controls also dominated, consistent with sustained leaching of base cations and continuous acid inputs from litter decomposition and root processes in forest ecosystems (Abdullah et al., 2025; Huang et al., 2022b; Liu et al., 2024b). Compared with broader mixed-land-use studies, this pattern suggests a stronger role of climate- and vegetation-related controls in forest ecosystems. In forests, soil properties are more tightly coupled to canopy-mediated water balance, litter inputs, root activity, and biologically mediated acidification, so climate, hydrology, and vegetation can become more influential than the broader cross-ecosystem relationships captured in general-purpose products (Augusto et al., 2022; Huang et al., 2022b). Visual comparison of Figs. 6 and S5 in the Supplement further suggests that divergence from existing datasets may be more pronounced in humid southern forests and mountainous regions of southwestern China, where strong precipitation gradients, steep terrain, and heterogeneous vegetation structure intensify local soil variability.</p>
      <p id="d2e2873">Our results indicate a non-linear vertical pattern of forest soil BD, with BD peaking in the 30–60 cm layer and declining in the 60–100 cm layer (Fig. 9). This profile differs from the commonly reported monotonic increase in BD with depth in many ecosystems, including forest systems where BD generally increases with soil depth (Shen et al., 2022). However, non-monotonic vertical BD distributions have also been documented in forest soils, particularly in relation to long-term stand development and soil profile differentiation (Tang et al., 2025), suggesting that forest soil BD may not universally follow a simple monotonic depth trend. This depth profile is plausibly linked to clay illuviation and deep-root processes. Downward translocation and mid-profile accumulation of fine clay can promote denser packing and reduce pore space, leading to higher BD in intermediate layers, as commonly observed in soils affected by argilluviation (Jenny, 1941; Brady and Weil, 2016). In deeper horizons, lower clay contents together with more pronounced root activity may enhance biopore formation and maintain a looser structure, thereby contributing to lower BD (Jobbágy and Jackson, 2000; Angers et al., 2011). Comparison with existing datasets further suggests that currently available products do not fully represent this complex depth-dependent variation in forest BD. In particular, for the 60–100 cm layer, our model predicts a mean BD of 1.25 <inline-formula><mml:math id="M79" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">g</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>, lower than the 1.37–1.38 <inline-formula><mml:math id="M80" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">g</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> reported by existing datasets (Fig. S5). This discrepancy implies that extrapolations based solely on those datasets may overestimate forest soil organic carbon stocks at this depth.</p>

      <fig id="F9" specific-use="star"><label>Figure 9</label><caption><p id="d2e2912">Bulk density prediction (<inline-formula><mml:math id="M81" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">g</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>) maps: rows show five soil depths (0–5, 5–15, 15–30, 30–60, and 60–100 cm) and columns show four datasets (Our prediction, CSDLv2, ChinaSoilInfoGrids, and SoilGrids2.0).</p></caption>
          <graphic xlink:href="https://essd.copernicus.org/articles/18/4451/2026/essd-18-4451-2026-f09.png"/>

        </fig>

      <p id="d2e2938">Assuming other terms remain constant, using the SoilGrids estimate (1.37 <inline-formula><mml:math id="M82" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">g</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>) instead of our forest-specific estimate (1.25 <inline-formula><mml:math id="M83" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">g</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>) would increase the estimated soil mass, and thus the SOC stock, of the standard 60–100 cm layer by approximately 9.6 %. However, the magnitude and ecological meaning of this bias should be interpreted cautiously because the standardized 60–100 cm interval does not always correspond to a fully developed and ecologically equivalent soil body in forest ecosystems. In this study, all profiles were harmonized to the GlobalSoilMap standard depth intervals to improve comparability among heterogeneous datasets (Bishop et al., 1999; Arrouays et al., 2014). Yet in many forested landscapes, especially mountainous regions, effective soil depth is constrained by topography, parent material, erosion, and shallow bedrock. Under such conditions, the nominal 60–100 cm layer may partly approach saprolite or the bedrock interface, and in some cases may reflect the standardized extension of limited deep-horizon information rather than a complete and ecologically uniform mineral-soil unit. Consequently, uncertainty in deep-soil carbon estimates arises from both BD and the actual soil volume represented by the standard layer. Generalized products may therefore overestimate subsoil carbon storage not only by assigning a higher BD, but also by implicitly assuming a uniformly developed 40 cm soil layer where the effective soil depth is smaller. Although DTB was included as a predictor in this study, future work should more explicitly couple standard-depth soil products with effective soil thickness or depth-to-bedrock constraints to improve the ecological realism and reliability of national-scale deep-soil carbon assessments (Adhikari et al., 2013; Chen et al., 2019).</p>
      <p id="d2e2975">Forest soil pH profiles are primarily governed by chemical processes and generally exhibit an increasing trend with depth. Model comparison results indicate that forest soil pH in China displays a consistent spatial pattern characterized by lower values in southern regions and higher values in northern regions, while vertically increasing with soil depth (Yu et al., 2020). This depth-dependent pattern is in line with the commonly observed pH profiles of forest soils (Fig. 10). However, spatial statistical analyses reveal that, across all soil layers, the mean forest soil pH predicted in this study (5.70–6.01) is consistently lower than the corresponding values reported by existing datasets (5.93–6.23) (Fig. S6 in the Supplement). This systematic difference likely reflects ecosystem-specific acidification processes in forest soils. Long-term leaching under relatively high precipitation regimes promotes the removal of base cations, while the continuous accumulation of acidic inputs derived from organic matter decomposition and root exudation further contributes to soil acidification (Farooq et al., 2022a; Abdullah et al., 2025). As a result, forest soils typically exhibit lower pH values than soils in other ecosystems. These findings suggest that existing datasets may underestimate the degree of soil acidification in Chinese forests, particularly when generalized across ecosystems or derived from limited forest-specific observations.</p>

      <fig id="F10" specific-use="star"><label>Figure 10</label><caption><p id="d2e2981">pH prediction maps: rows show five soil depths (0–5, 5–15, 15–30, 30–60, and 60–100 cm) and columns show four datasets (Our prediction, CSDLv2, ChinaSoilInfoGrids, and SoilGrids2.0).</p></caption>
          <graphic xlink:href="https://essd.copernicus.org/articles/18/4451/2026/essd-18-4451-2026-f10.png"/>

        </fig>

      <p id="d2e2990">Overall, our results indicate that the use of forest-specific datasets and modeling strategies can substantially improve the representation of soil BD and pH in forest ecosystems. The discrepancies identified relative to existing products suggest that generalized soil datasets may overestimate forest soil carbon stocks and underestimate soil acidification, underscoring the necessity of ecosystem-specific digital soil mapping for reliable soil assessments and forest management.</p>
</sec>
<sec id="Ch1.S4.SS2">
  <label>4.2</label><title>Potential applications</title>
      <p id="d2e3001">The high-resolution spatial dataset of forest soil BD and pH developed in this study represents a national-scale digital soil mapping product that captures the spatial variability of key soil physical and chemical properties across forested regions in China. Accurate knowledge of BD and pH is fundamental for estimating soil carbon stocks (Batjes, 2016), and monitoring forest ecosystem responses to land-use and climate change (Pan et al., 2011). Bulk density is critical for accurate estimation of soil carbon stocks, while pH governs nutrient availability, microbial activity, and forest productivity (Liu et al., 2024b), and is significant for understanding forest soil acidification (Farooq et al., 2022b). The dataset fills longstanding gaps in forest soil data coverage in China, and supports applications in ecosystem assessment and long-term soil monitoring. Beyond its scientific value, this product contributes to national strategies on carbon neutrality and ecological restoration and aligns with international environmental commitments including the UN Decade on Ecosystem Restoration and the Sustainable Development Goals (UNEP, 2021; IPCC, 2022).</p>
</sec>
<sec id="Ch1.S4.SS3">
  <label>4.3</label><title>Limitations and outlook</title>
      <p id="d2e3012">Our study advances high-resolution DSM in forest ecosystems; however, several methodological limitations remain and merit further investigation, particularly regarding the predictive reliability of machine learning approaches. Machine learning techniques, while significantly enhancing DSM by capturing nonlinear soil–environment relationships, are constrained by limitations in spatial coverage and feature-space representativeness (Yang et al., 2013; Chen et al., 2019). Forest ecosystems exhibit pronounced landscape heterogeneity, complicating sampling design and frequently resulting in imbalanced training datasets (Huang et al., 2022a; Liu et al., 2022b; Shao et al., 2022). As demonstrated by Westhuizen et al. (2024), models trained on such datasets may yield biased predictions in undersampled regions. Although ensemble methods can manage uncertainty in sparse data settings, they may prioritize statistical regularities over mechanistic soil formation processes (Sylvain et al., 2021; Liu et al., 2022b). Consistent with these broader concerns, the discrepancy observed between cross-validation and independent validation in our study suggests that random cross-validation may have benefited from residual spatial autocorrelation among nearby samples, whereas the independent validation results likely offer a more realistic assessment of model transferability.</p>
      <p id="d2e3015">In addition to data-related limitations, uncertainty may also arise from harmonizing environmental covariates originating from heterogeneous native spatial resolutions into a common fine grid. The integration of multi-resolution covariates through resampling is a standard practice in national-scale digital soil mapping to ensure spatially consistent modeling (Shi et al., 2025). However, this process inherently introduces scale-related uncertainty and potential smoothing effects in the resulting predictions, particularly for covariates derived from coarser-resolution sources (Cavazzi et al., 2013; Piedallu et al., 2022). The influence of covariate resolution on prediction accuracy is a recognized challenge in DSM, as the representativeness of environmental predictors can vary substantially with spatial scale (Guo et al., 2019; Adhikari et al., 2020). Importantly, this limitation is not unique to the present study but is widely acknowledged in large-scale mapping initiatives, including established global products such as SoilGrids (Poggio et al., 2021). While predictive models can quantify uncertainty associated with sampling density and algorithmic structure, explicit quantification of uncertainty arising solely from the resampling of multi-resolution covariates remains methodologically challenging and is rarely addressed in existing studies (Wadoux et al., 2020; Sylvain et al., 2021). Consequently, the resulting soil property maps should be interpreted as conditional estimates, representing the most probable spatial patterns given the selected covariates and their effective spatial resolution, rather than exact representations of fine-scale soil heterogeneity (Robb et al., 2025).</p>
      <p id="d2e3018">Finally, incomplete environmental data coverage restricted soil property prediction to areas where all covariates were consistently available. This decision reflects a fundamental trade-off in DSM between spatial completeness and predictive reliability. Omission-based approaches, such as those used in SoLIM (Nussbaum et al., 2018; Zhu et al., 2018), prioritize accuracy by excluding data-deficient regions but result in spatial gaps that limit practical applications. In contrast, imputation techniques such as <inline-formula><mml:math id="M84" display="inline"><mml:mrow><mml:mn mathvariant="normal">3</mml:mn><mml:mo>×</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula> window smoothing (Padarian et al., 2019; Fan et al., 2020) preserve continuity but may introduce significant errors, particularly in ecotones where environmental gradients are steep and covariate relationships become unstable (Hengl et al., 2015). In our study, such limitations were most evident along elevational boundaries and remote sensing index thresholds, where fragmented gaps produced compounding spatial uncertainty. To mitigate these effects, predictions were restricted to the maximal common spatial domain defined by all covariates. This conservative approach was based on three considerations: (1) the statistical instability of edge extrapolations, (2) elevated error propagation risks in transitional environments (Dong et al., 2023), and (3) a practical emphasis on accuracy over exhaustive coverage. While this approach slightly reduced the total mappable forest area, it improved the overall reliability of spatial outputs by minimizing uncertainty in marginal regions (Smith, 2001; Huettmann and Gottschalk, 2011). The resulting trade-off underscores the persistent tension between spatial coverage and predictive confidence in large-scale DSM. Future advances should explore adaptive frameworks that integrate uncertainty-aware imputation with hybrid models, using localized gap geometry to guide prediction boundaries (Arrouays et al., 2014). Such innovations may help reconcile spatial completeness with predictive precision in next-generation soil mapping.</p>
      <p id="d2e3033">Looking forward, emerging hybrid frameworks that integrate environmental similarity metrics with pedological expertise show promise for addressing both data imbalance and scale-related challenges (Zhao et al., 2024), although their scalability and operational feasibility at national extents require further validation (Miranda et al., 2023; Potash et al., 2023; Rodrigues et al., 2025). Specifically, strategic sampling designs incorporating stratified and adaptive approaches across diverse forest landscapes and soil types are crucial for mitigating dataset imbalance and capturing underlying heterogeneity (Brus et al., 2011). Concurrently, exploring novel covariates derived from multi-source remote sensing (e.g., hyperspectral, LiDAR, and radar) and proximal sensing (Xue et al., 2025), alongside improved representations of depth-dependent properties and long-term environmental legacies. These data could enrich the feature space, improve the characterization of deep-soil processes, and partly alleviate the current over-reliance on climatic predictors in deeper soil layers (Vaysse and Lagacherie, 2017; Wadoux et al., 2020). Integrating such refined datasets within hybrid modeling frameworks holds considerable potential for improving the accuracy and reliability of forest DSM predictions.</p>
</sec>
</sec>
<sec id="Ch1.S5">
  <label>5</label><title>Code and data availability</title>
      <p id="d2e3046">All resources for the ensemble machine learning model, including training and testing code, are publicly available at <ext-link xlink:href="https://doi.org/10.5281/zenodo.20422721" ext-link-type="DOI">10.5281/zenodo.20422721</ext-link> (Chen, 2026). The soil property maps generated in this study include soil pH and BD for five depth intervals (0–5, 5–15, 15–30, 30–60, and 60–100 cm), with a spatial resolution of 90 m. These maps are openly accessible via the platform link: <ext-link xlink:href="https://doi.org/10.57760/sciencedb.25375" ext-link-type="DOI">10.57760/sciencedb.25375</ext-link> (Chen and Huang, 2025). Users can download the datasets efficiently using the provided FTP credentials and any standard FTP client.</p>
</sec>
<sec id="Ch1.S6">
  <label>6</label><title>Usage note</title>
      <p id="d2e3063">It is important to highlight that uncertainties associated with the spatial predictions of soil pH and BD have been not only quantified but also explicitly embedded in the corresponding maps. These uncertainty estimates offer critical insights into the reliability of predictions. Users are strongly encouraged to interpret the pH and BD maps alongside their respective uncertainty layers to ensure scientific rigor in downstream analyses and to support evidence-based decision-making and policy formulation. The inclusion of uncertainty information should not be regarded as a drawback. In fact, the adoption of standardized protocols for uncertainty quantification and reporting, which are now commonly used in DSM, enhances the transparency and applicability of the dataset. Users should also be aware that no spatial map represents a perfect depiction of reality. Interpreting these predictions without considering uncertainty introduces scientific and practical risks. The uncertainty layers serve as a guide for context-sensitive interpretation.</p>
</sec>
<sec id="Ch1.S7" sec-type="conclusions">
  <label>7</label><title>Conclusions</title>
      <p id="d2e3074">Our study developed a high-resolution, forest-specific mapping of BD and pH across China, leveraging forest soil profiles from the latest national forest soil survey. We achieved this detailed characterization across complex forest soil landscapes by integrating the predictive soil mapping paradigm with FRFS, QRF, and a detailed suite of forest-specific soil-forming environmental factors within a high-performance parallel computing environment. This integrated approach not only effectively reduced errors and training time but also enhanced the performance of the final predictive models. The resultant multilayer maps delineate pronounced regional gradients and fine-scale forest soil heterogeneity across depths, outperforming existing products in accuracy, spatial detail, and provision of local uncertainty metrics. These high-resolution forest soil property maps represent a contribution to the GlobalSoilMap.net project and provide critical baseline data for China's forest carbon accounting and understanding of soil acidification processes.</p>
</sec>

      
      </body>
    <back><app-group>
        <supplementary-material position="anchor"><p id="d2e3076">The supplement related to this article is available online at <inline-supplementary-material xlink:href="https://doi.org/10.5194/essd-18-4451-2026-supplement" xlink:title="pdf">https://doi.org/10.5194/essd-18-4451-2026-supplement</inline-supplementary-material>.</p></supplementary-material>
        </app-group><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d2e3087">Conceptualization: JZC; Data curation: QWS, XYS, JZC, ZHF, XZ, and ZLH; Formal analysis: JZC; Funding acquisition: ZLH and WFX; Methodology: JZC and XZ; Supervision: JZC, ZLH, and WFX; Validation: JZC; Writing – original draft preparation: JZC, ZLH, and TL; Writing – review and editing: JZC, ZLH, and TL.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d2e3093">The contact author has declared that none of the authors has any competing interests.</p>
  </notes><notes notes-type="disclaimer"><title>Disclaimer</title>

      <p id="d2e3099">Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.</p>
  </notes><ack><title>Acknowledgements</title><p id="d2e3105">We would like to express our gratitude to  Feng Liu at the Institute of Soil Science, Chinese Academy of Sciences (Nanjing, China), for his valuable suggestions that contributed to this study.</p></ack><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d2e3110">This research has been supported by the National Key Research and Development Program of China, Chinese Polar Environment Comprehensive Investigation and Assessment Programmes (grant no. 2021FY100800).</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d2e3117">This paper was edited by Zhen Yu and reviewed by three anonymous referees.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bib1"><label>1</label><mixed-citation>Abatzoglou, J. T., Dobrowski, S. Z., Parks, S. A., and Hegewisch, K. C.: TerraClimate, a high-resolution global dataset of monthly climate and climatic water balance from 1958–2015, Sci. Data, 5, 170191, <ext-link xlink:href="https://doi.org/10.1038/sdata.2017.191" ext-link-type="DOI">10.1038/sdata.2017.191</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib2"><label>2</label><mixed-citation>Abdullah, H., Skidmore, A. K., Siegenthaler, A., and Neinavaz, E.: High-resolution prediction of soil pH in european temperate forests using sentinel-2 and ancillary environmental data, Sci. Rep.-UK, 15, 28509, <ext-link xlink:href="https://doi.org/10.1038/s41598-025-03942-4" ext-link-type="DOI">10.1038/s41598-025-03942-4</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bib3"><label>3</label><mixed-citation>Adhikari, K., Kheir, R. B., Greve, M. B., Bøcher, P. K., Malone, B. P., Minasny, B., McBratney, A. B., and Greve, M. H.: High-resolution 3-D mapping of soil texture in Denmark, Soil Sci. Soc. Am. J., 77, 860–876, <ext-link xlink:href="https://doi.org/10.2136/sssaj2012.0275" ext-link-type="DOI">10.2136/sssaj2012.0275</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib4"><label>4</label><mixed-citation>Adhikari, K., Mishra, U., Owens, P., Libohova, Z., Wills, S., Riley, W., Hoffman, F., and Smith, D.: Importance and strength of environmental controllers of soil organic carbon changes with scale, Geoderma, 375, <ext-link xlink:href="https://doi.org/10.1016/j.geoderma.2020.114472" ext-link-type="DOI">10.1016/j.geoderma.2020.114472</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib5"><label>5</label><mixed-citation>Angers, D. A., Arrouays, D., Saby, N. P. A., and Walter, C.: Estimating and mapping the carbon saturation deficit of French agricultural topsoils, Soil Use Manage., 27, 448–452, <ext-link xlink:href="https://doi.org/10.1111/j.1475-2743.2011.00366.x" ext-link-type="DOI">10.1111/j.1475-2743.2011.00366.x</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bib6"><label>6</label><mixed-citation>Arrouays, D., Grundy, M. G., Hartemink, A. E., Hempel, J. W., Heuvelink, G. B. M., Hong, S. Y., Lagacherie, P., Lelyk, G., McBratney, A. B., McKenzie, N. J., Mendonca-Santos, M. d. L., Minasny, B., Montanarella, L., Odeh, I. O. A., Sanchez, P. A., Thompson, J. A., and Zhang, G.-L.: GlobalSoilMap, in: Advances in Agronomy,  Elsevier, 125, 93–134, <ext-link xlink:href="https://doi.org/10.1016/B978-0-12-800137-0.00003-0" ext-link-type="DOI">10.1016/B978-0-12-800137-0.00003-0</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bib7"><label>7</label><mixed-citation>Augusto, L., Beaumont, F., Nguyen, C., Fraysse, J., Trichet, P., Meredieu, C., Vidal, D., and Sappin-Didier, V.: Response of soil and vegetation in a warm-temperate pine forest to intensive biomass harvests, phosphorus fertilisation, and wood ash application, Sci. Total Environ., 157907, <ext-link xlink:href="https://doi.org/10.1016/j.scitotenv.2022.157907" ext-link-type="DOI">10.1016/j.scitotenv.2022.157907</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib8"><label>8</label><mixed-citation>Batjes, N. H.: Harmonized soil property values for broad-scale modelling (WISE30sec) with estimates of global soil carbon stocks, Geoderma, 269, 61–68, <ext-link xlink:href="https://doi.org/10.1016/j.geoderma.2016.01.034" ext-link-type="DOI">10.1016/j.geoderma.2016.01.034</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib9"><label>9</label><mixed-citation>Bishop, T. F. A., McBratney, A. B., and Laslett, G. M.: Modelling soil attribute depth functions with equal-area quadratic smoothing splines, Geoderma, 91, 27–45, <ext-link xlink:href="https://doi.org/10.1016/S0016-7061(99)00003-8" ext-link-type="DOI">10.1016/S0016-7061(99)00003-8</ext-link>, 1999.</mixed-citation></ref>
      <ref id="bib1.bib10"><label>10</label><mixed-citation> Brady, N. C. and Weil, R. R.: The nature and properties of soils, 15th Edn., Global Edition, Pearson, Harlow, England, 1104 pp., ISBN 978-1-292-16224-9, 2016.</mixed-citation></ref>
      <ref id="bib1.bib11"><label>11</label><mixed-citation>Brus, D. J., Kempen, B., and Heuvelink, G. B. M.: Sampling for validation of digital soil maps, Eur. J. Soil Sci., 62, 394–407, <ext-link xlink:href="https://doi.org/10.1111/j.1365-2389.2011.01364.x" ext-link-type="DOI">10.1111/j.1365-2389.2011.01364.x</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bib12"><label>12</label><mixed-citation>Cavazzi, S., Corstanje, R., Mayr, T., Hannam, J., and Fealy, R.: Are fine resolution digital elevation models always the best choice in digital soil mapping?, Geoderma, 195–196, 111–121, <ext-link xlink:href="https://doi.org/10.1016/j.geoderma.2012.11.020" ext-link-type="DOI">10.1016/j.geoderma.2012.11.020</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib13"><label>13</label><mixed-citation>Chen, J.: cjz-ux/ChinaForestSoil-DSM: ChinaForestSoil-DSM (digital_soil_mapping), Zenodo [code], <ext-link xlink:href="https://doi.org/10.5281/zenodo.20422721" ext-link-type="DOI">10.5281/zenodo.20422721</ext-link>, 2026.</mixed-citation></ref>
      <ref id="bib1.bib14"><label>14</label><mixed-citation>Chen, J., Deng, Z., Jiang, Z., Sun, J., Meng, F., Zuo, X., Wu, L., Cao, G., and Cao, S.: Variations of rhizosphere and bulk soil microbial community in successive planting of chinese fir (cunninghamia lanceolata), Front. Plant Sci., 13, 954777, <ext-link xlink:href="https://doi.org/10.3389/fpls.2022.954777" ext-link-type="DOI">10.3389/fpls.2022.954777</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib15"><label>15</label><mixed-citation>Chen, J. Z. and Huang, Z. L.: High-resolution maps of forest soil bulk density and pH across China at 90 m resolution [DS/OL], V1, Science Data Bank [data set], <ext-link xlink:href="https://doi.org/10.57760/sciencedb.25375" ext-link-type="DOI">10.57760/sciencedb.25375</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bib16"><label>16</label><mixed-citation>Chen, L. F., He, Z. B., Du, J., Yang, J. J., and Zhu, X.: Patterns and environmental controls of soil organic carbon and total nitrogen in alpine ecosystems of northwestern China, Catena, 137, 37–43, <ext-link xlink:href="https://doi.org/10.1016/j.catena.2015.08.017" ext-link-type="DOI">10.1016/j.catena.2015.08.017</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib17"><label>17</label><mixed-citation>Chen, S., Mulder, V. L., Martin, M. P., Walter, C., Lacoste, M., Richer-de-Forges, A. C., Saby, N. P. A., Loiseau, T., Hu, B., and Arrouays, D.: Probability mapping of soil thickness by random survival forest at a national scale, Geoderma, 344, 184–194, <ext-link xlink:href="https://doi.org/10.1016/j.geoderma.2019.03.016" ext-link-type="DOI">10.1016/j.geoderma.2019.03.016</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib18"><label>18</label><mixed-citation>Chen, S., Chen, Z., Zhang, X., Luo, Z., Schillaci, C., Arrouays, D., Richer-de-Forges, A. C., and Shi, Z.: European topsoil bulk density and organic carbon stock database (0–20 cm) using machine-learning-based pedotransfer functions, Earth Syst. Sci. Data, 16, 2367–2383, <ext-link xlink:href="https://doi.org/10.5194/essd-16-2367-2024" ext-link-type="DOI">10.5194/essd-16-2367-2024</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bib19"><label>19</label><mixed-citation>Dai, Y., Shangguan, W., Wei, N., Xin, Q., Yuan, H., Zhang, S., Liu, S., Lu, X., Wang, D., and Yan, F.: A review of the global soil property maps for Earth system models, SOIL, 5, 137–158, <ext-link xlink:href="https://doi.org/10.5194/soil-5-137-2019" ext-link-type="DOI">10.5194/soil-5-137-2019</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib20"><label>20</label><mixed-citation>Ding, Y. and Peng, S.: Spatiotemporal change and attribution of potential evapotranspiration over china from 1901 to 2100, Theor. Appl. Climatol., 145, 79–94, <ext-link xlink:href="https://doi.org/10.1007/s00704-021-03625-w" ext-link-type="DOI">10.1007/s00704-021-03625-w</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib21"><label>21</label><mixed-citation>Dong, H., Huang, S., Wang, H., Huang, Q., Leng, G., Li, Z., Li, L., and Peng, J.: Identifying non-stationarity in the dependence structures of meteorological factors within and across seasons and exploring possible causes, Stoch. Env. Res. Risk A., 37, 4071–4089, <ext-link xlink:href="https://doi.org/10.1007/s00477-023-02496-z" ext-link-type="DOI">10.1007/s00477-023-02496-z</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bib22"><label>22</label><mixed-citation>Fan, N. Q., Zhu, A. X., Qin, C. Z., and Liang, P.: Digital soil mapping over large areas with invalid environmental covariate data, ISPRS Int. J. Geo-Inf., 9, 102, <ext-link xlink:href="https://doi.org/10.3390/ijgi9020102" ext-link-type="DOI">10.3390/ijgi9020102</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib23"><label>23</label><mixed-citation>Farooq, T. H., Chen, X., Shakoor, A., Rashid, M. H. U., Kumar, U., Alhomrani, M., Alamri, A. S., Ravindran, B., and Yan, W.: Unraveling the importance of forest structure and composition driving soil microbial and enzymatic responses in the subtropical forest soils, Forests, 13, 1535, <ext-link xlink:href="https://doi.org/10.3390/f13101535" ext-link-type="DOI">10.3390/f13101535</ext-link>, 2022a.</mixed-citation></ref>
      <ref id="bib1.bib24"><label>24</label><mixed-citation>Farooq, T. H., Li, Z., Yan, W., Shakoor, A., Kumar, U., Shabbir, R., Peng, Y., Gayathiri, E., Alotaibi, S. S., Wróbel, J., Kalaji, H. M., and Chen, X.: Variations in litterfall dynamics, C:N:P stoichiometry and associated nutrient return in pure and mixed stands of camphor tree and masson pine forests, Front. Environ. Sci., 10, 903039, <ext-link xlink:href="https://doi.org/10.3389/fenvs.2022.903039" ext-link-type="DOI">10.3389/fenvs.2022.903039</ext-link>, 2022b.</mixed-citation></ref>
      <ref id="bib1.bib25"><label>25</label><mixed-citation>Goovaerts, P.: Geostatistical modelling of uncertainty in soil science, Geoderma, 103, 3–26, <ext-link xlink:href="https://doi.org/10.1016/S0016-7061(01)00067-2" ext-link-type="DOI">10.1016/S0016-7061(01)00067-2</ext-link>, 2001.</mixed-citation></ref>
      <ref id="bib1.bib26"><label>26</label><mixed-citation>Guo, C., Wu, Y., Ni, J., and Guo, Y.: Forest carbon storage in guizhou province based on field measurement dataset, Acta Geochim., 38, 8–21, <ext-link xlink:href="https://doi.org/10.1007/s11631-018-0306-3" ext-link-type="DOI">10.1007/s11631-018-0306-3</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib27"><label>27</label><mixed-citation>Guo, J., Feng, H., McNie, P., Wang, W., Peng, C., Feng, L., Sun, J., Pan, C., and Yu, Y.: The effect of the conversion from natural broadleaved forests into Chinese fir (cunninghamia lanceolata (lamb.) hook.) plantations on soil microbial communities and nitrogen functional genes, Forests, 13, 158, <ext-link xlink:href="https://doi.org/10.3390/f13020158" ext-link-type="DOI">10.3390/f13020158</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib28"><label>28</label><mixed-citation>Hamza, M. A. and Anderson, W. K.: Soil compaction in cropping systems, Soil Till. Res., 82, 121–145, <ext-link xlink:href="https://doi.org/10.1016/j.still.2004.08.009" ext-link-type="DOI">10.1016/j.still.2004.08.009</ext-link>, 2005.</mixed-citation></ref>
      <ref id="bib1.bib29"><label>29</label><mixed-citation>Hengl, T., Heuvelink, G. B. M., Kempen, B., Leenaars, J. G. B., Walsh, M. G., Shepherd, K. D., Sila, A., MacMillan, R. A., Mendes De Jesus, J., Tamene, L., and Tondoh, J. E.: Mapping soil properties of Africa at 250 m resolution: random forests significantly improve current predictions, PLOS One, 10, e0125814, <ext-link xlink:href="https://doi.org/10.1371/journal.pone.0125814" ext-link-type="DOI">10.1371/journal.pone.0125814</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib30"><label>30</label><mixed-citation>Huang, H., Yang, L., Zhang, L., Pu, Y., Yang, C., Wu, Q., Cai, Y., Shen, F., and Zhou, C.: A review on digital mapping of soil carbon in cropland: progress, challenge, and prospect, Environ. Res. Lett., 17, 123004, <ext-link xlink:href="https://doi.org/10.1088/1748-9326/aca41e" ext-link-type="DOI">10.1088/1748-9326/aca41e</ext-link>, 2022a.</mixed-citation></ref>
      <ref id="bib1.bib31"><label>31</label><mixed-citation>Huang, H., Liu, Y., Liu, Y., Tong, Z., Ren, Z., Xie, Y., Huang, H., Liu, Y., Liu, Y., Tong, Z., Ren, Z., and Xie, Y.: Digital mapping of soil pH and driving factor analysis based on environmental variable screening, Sustainability-Basel, 17, <ext-link xlink:href="https://doi.org/10.3390/su17073173" ext-link-type="DOI">10.3390/su17073173</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bib32"><label>32</label><mixed-citation>Huang, X., Cui, C., Hou, E., Li, F., Liu, W., Jiang, L., Luo, Y., and Xu, X.: Acidification of soil due to forestation at the global scale, Forest Ecol. Manag., 505, 119951, <ext-link xlink:href="https://doi.org/10.1016/j.foreco.2021.119951" ext-link-type="DOI">10.1016/j.foreco.2021.119951</ext-link>, 2022b.</mixed-citation></ref>
      <ref id="bib1.bib33"><label>33</label><mixed-citation>Huettmann, F. and Gottschalk, T.: Simplicity, model fit, complexity and uncertainty in spatial prediction models applied over time: we are quite sure, aren't we?, in: Predictive Species and Habitat Modeling in Landscape Ecology, edited by: Drew, C. A., Wiersma, Y. F., and Huettmann, F., Springer New York, New York, NY, 189–208, <ext-link xlink:href="https://doi.org/10.1007/978-1-4419-7390-0_10" ext-link-type="DOI">10.1007/978-1-4419-7390-0_10</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bib34"><label>34</label><mixed-citation>IPCC: Climate Change 2022: Mitigation of Climate Change. Contribution of Working Group III to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change, <ext-link xlink:href="https://doi.org/10.1017/9781009157926" ext-link-type="DOI">10.1017/9781009157926</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib35"><label>35</label><mixed-citation> Jenny, H.: Factors of soil formation: a system of quantitative pedology,  OCLC no. 905466, McGraw-Hill, New York, 1941.</mixed-citation></ref>
      <ref id="bib1.bib36"><label>36</label><mixed-citation>Jobbágy, E. G. and Jackson, R. B.: The vertical distribution of soil organic carbon and its relation to climate and vegetation, Ecol. Appl., 10, 423–436, <ext-link xlink:href="https://doi.org/10.1890/1051-0761(2000)010[0423:TVDOSO]2.0.CO;2" ext-link-type="DOI">10.1890/1051-0761(2000)010[0423:TVDOSO]2.0.CO;2</ext-link>, 2000.</mixed-citation></ref>
      <ref id="bib1.bib37"><label>37</label><mixed-citation>Kleber, M., Bourg, I. C., Coward, E. K., Hansel, C. M., Myneni, S. C. B., and Nunan, N.: Dynamic interactions at the mineral–organic matter interface, Nat. Rev. Earth Environ., 2, 402–421, <ext-link xlink:href="https://doi.org/10.1038/s43017-021-00162-y" ext-link-type="DOI">10.1038/s43017-021-00162-y</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib38"><label>38</label><mixed-citation>Kumar, P. S., Shukla, G., Nath, A. J., and Chakravarty, S.: Soil properties, litter dynamics and biomass carbon storage in three-bamboo species of sub-himalayan region of eastern India, Water Air Soil Poll., 233, <ext-link xlink:href="https://doi.org/10.1007/s11270-021-05477-6" ext-link-type="DOI">10.1007/s11270-021-05477-6</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib39"><label>39</label><mixed-citation>Lal, R.: Forest soils and carbon sequestration, Forest Ecol. Manag., 220, 242–258, <ext-link xlink:href="https://doi.org/10.1016/j.foreco.2005.08.015" ext-link-type="DOI">10.1016/j.foreco.2005.08.015</ext-link>, 2005.</mixed-citation></ref>
      <ref id="bib1.bib40"><label>40</label><mixed-citation>Liang, Z., Chen, S., Yang, Y., Zhao, R., Shi, Z., and Viscarra Rossel, R. A.: National digital soil map of organic matter in topsoil and its associated uncertainty in 1980's China, Geoderma, 335, 47–56, <ext-link xlink:href="https://doi.org/10.1016/j.geoderma.2018.08.011" ext-link-type="DOI">10.1016/j.geoderma.2018.08.011</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib41"><label>41</label><mixed-citation>Liu, B., Zhao, P., Wu, Y., and Wang, X.: Clarifying the effects of potential evapotranspiration and soil moisture on transpiration in secondary forests of birch in semi-arid regions of China, J. Plant Interact., 19, 2389048, <ext-link xlink:href="https://doi.org/10.1080/17429145.2024.2389048" ext-link-type="DOI">10.1080/17429145.2024.2389048</ext-link>, 2024a.</mixed-citation></ref>
      <ref id="bib1.bib42"><label>42</label><mixed-citation>Liu, F., Wu, H., Zhao, Y., Li, D., Yang, J., Song, X., Shi, Z., Zhu, A., and Zhang, G.: Mapping high resolution national soil information grids of China, Sci. Bull., 67, 328–340, <ext-link xlink:href="https://doi.org/10.1016/j.scib.2021.10.013" ext-link-type="DOI">10.1016/j.scib.2021.10.013</ext-link>, 2022a.</mixed-citation></ref>
      <ref id="bib1.bib43"><label>43</label><mixed-citation>Liu, F., Yang, F., Zhao, Y., Zhang, G., and Li, D.: Predicting soil depth in a large and complex area using machine learning and environmental correlations, J. Integr. Agr., 21, 2422–2434, <ext-link xlink:href="https://doi.org/10.1016/S2095-3119(21)63692-4" ext-link-type="DOI">10.1016/S2095-3119(21)63692-4</ext-link>, 2022b.</mixed-citation></ref>
      <ref id="bib1.bib44"><label>44</label><mixed-citation>Liu, Z., Gu, H., Yao, Q., Jiao, F., Hu, X., Liu, J., Jin, J., Liu, X., and Wang, G.: Soil pH and carbon quality index regulate the biogeochemical cycle couplings of carbon, nitrogen and phosphorus in the profiles of isohumosols, Sci. Total Environ., 922, 171269, <ext-link xlink:href="https://doi.org/10.1016/j.scitotenv.2024.171269" ext-link-type="DOI">10.1016/j.scitotenv.2024.171269</ext-link>, 2024b.</mixed-citation></ref>
      <ref id="bib1.bib45"><label>45</label><mixed-citation>Lundberg, S. and Lee, S.-I.: A unified approach to interpreting model predictions, arXiv [preprint], <ext-link xlink:href="https://doi.org/10.48550/arXiv.1705.07874" ext-link-type="DOI">10.48550/arXiv.1705.07874</ext-link>, 25 November 2017.</mixed-citation></ref>
      <ref id="bib1.bib46"><label>46</label><mixed-citation>McBratney, A. B., Mendonça Santos, M. L., and Minasny, B.: On digital soil mapping, Geoderma, 117, 3–52, <ext-link xlink:href="https://doi.org/10.1016/S0016-7061(03)00223-4" ext-link-type="DOI">10.1016/S0016-7061(03)00223-4</ext-link>, 2003.</mixed-citation></ref>
      <ref id="bib1.bib47"><label>47</label><mixed-citation> Meinshausen, N.: Quantile regression forests, J. Mach. Learn. Res., 7, 983–999, 2006.</mixed-citation></ref>
      <ref id="bib1.bib48"><label>48</label><mixed-citation>Minasny, B., McBratney, A. B., Malone, B. P., and Wheeler, I.: Digital mapping of soil carbon, in: Advances in Agronomy, Elsevier, 118,1–47, <ext-link xlink:href="https://doi.org/10.1016/B978-0-12-405942-9.00001-3" ext-link-type="DOI">10.1016/B978-0-12-405942-9.00001-3</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib49"><label>49</label><mixed-citation>Miranda, R., Nobrega, R., Silva, E., Silva, J., Araújo Filho, J., Moura, M., Barros, A., Souza, A., Verhoef, A., Yang, W., Shao, H., Srinivasan, R., Ziadat, F., Montenegro, S., Araújo, M., and Galvíncio, J.: Hybrid machine learning for integrating pedological knowledge into digital soil mapping to advance next-generation earth system models, Earth ArXiv [preprint], <ext-link xlink:href="https://doi.org/10.31223/X57P9W" ext-link-type="DOI">10.31223/X57P9W</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bib50"><label>50</label><mixed-citation>Nussbaum, M., Spiess, K., Baltensweiler, A., Grob, U., Keller, A., Greiner, L., Schaepman, M. E., and Papritz, A.: Evaluation of digital soil mapping approaches with large sets of environmental covariates, SOIL, 4, 1–22, <ext-link xlink:href="https://doi.org/10.5194/soil-4-1-2018" ext-link-type="DOI">10.5194/soil-4-1-2018</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib51"><label>51</label><mixed-citation>Padarian, J., Minasny, B., and McBratney, A. B.: Using deep learning for digital soil mapping, SOIL, 5, 79–89, <ext-link xlink:href="https://doi.org/10.5194/soil-5-79-2019" ext-link-type="DOI">10.5194/soil-5-79-2019</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib52"><label>52</label><mixed-citation>Pan, Y., Birdsey, R. A., Fang, J., Houghton, R., Kauppi, P. E., Kurz, W. A., Phillips, O. L., Shvidenko, A., Lewis, S. L., Canadell, J. G., Ciais, P., Jackson, R. B., Pacala, S. W., McGuire, A. D., Piao, S., Rautiainen, A., Sitch, S., and Hayes, D.: A large and persistent carbon sink in the world's forests, Science, 333, 988–993, <ext-link xlink:href="https://doi.org/10.1126/science.1201609" ext-link-type="DOI">10.1126/science.1201609</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bib53"><label>53</label><mixed-citation>Patton, N. R., Lohse, K. A., Seyfried, M. S., Godsey, S. E., and Parsons, S. B.: Topographic controls of soil organic carbon on soil-mantled landscapes, Sci. Rep.-UK, 9, 6390, <ext-link xlink:href="https://doi.org/10.1038/s41598-019-42556-5" ext-link-type="DOI">10.1038/s41598-019-42556-5</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib54"><label>54</label><mixed-citation>Peng, S., Ding, Y., Liu, W., and Li, Z.: 1 km monthly temperature and precipitation dataset for China from 1901 to 2017, Earth Syst. Sci. Data, 11, 1931–1946, <ext-link xlink:href="https://doi.org/10.5194/essd-11-1931-2019" ext-link-type="DOI">10.5194/essd-11-1931-2019</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib55"><label>55</label><mixed-citation>Peng, X., He, G., Wang, G., Yin, R., Yang, R., Peng, Y., Long, T., Zhang, Z., Chen, Y., and Wang, J.: GF-1 WFV satellite images based forest cover mapping in China supported by open land use/cover datasets, Sci. Data, 11, 1355, <ext-link xlink:href="https://doi.org/10.1038/s41597-024-04202-2" ext-link-type="DOI">10.1038/s41597-024-04202-2</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bib56"><label>56</label><mixed-citation>Piedallu, C., Pedersoli, E., Chaste, E., Morneau, F., Seynave, I., and Gégout, J.-C.: Optimal resolution of soil properties maps varies according to their geographical extent and location, Geoderma, 412, 115723, <ext-link xlink:href="https://doi.org/10.1016/j.geoderma.2022.115723" ext-link-type="DOI">10.1016/j.geoderma.2022.115723</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib57"><label>57</label><mixed-citation>Poggio, L., de Sousa, L. M., Batjes, N. H., Heuvelink, G. B. M., Kempen, B., Ribeiro, E., and Rossiter, D.: SoilGrids 2.0: producing soil information for the globe with quantified spatial uncertainty, SOIL, 7, 217–240, <ext-link xlink:href="https://doi.org/10.5194/soil-7-217-2021" ext-link-type="DOI">10.5194/soil-7-217-2021</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib58"><label>58</label><mixed-citation>Potash, E., Guan, K., Margenot, A. J., Lee, D., Boe, A., Douglass, M., Heaton, E., Jang, C., Jin, V., Li, N., Mitchell, R., Namoi, N., Schmer, M., Wang, S., and Zumpf, C.: Multi-site evaluation of stratified and balanced sampling of soil organic carbon stocks in agricultural fields, Geoderma, 438, 116587, <ext-link xlink:href="https://doi.org/10.1016/j.geoderma.2023.116587" ext-link-type="DOI">10.1016/j.geoderma.2023.116587</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bib59"><label>59</label><mixed-citation>Pouladi, N., Møller, A. B., Tabatabai, S., and Greve, M. H.: Mapping soil organic matter contents at field level with cubist, random forest and kriging, Geoderma, 342, 85–92, <ext-link xlink:href="https://doi.org/10.1016/j.geoderma.2019.02.019" ext-link-type="DOI">10.1016/j.geoderma.2019.02.019</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib60"><label>60</label><mixed-citation>Robb, C., Aitkenhead, M., Coull, M., Macfarlane, F., and Matthews, K.: Soil property, carbon stock and peat extent mapping at 10 m resolution in Scotland using digital soil mapping techniques, Eur. J. Soil Sci., 76, <ext-link xlink:href="https://doi.org/10.1111/ejss.70123" ext-link-type="DOI">10.1111/ejss.70123</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bib61"><label>61</label><mixed-citation>Rodrigues, H., Ceddia, M. B., Vasques, G. M., Grunwald, S., and Babaeian, E.: AutoRA: an innovative algorithm for automatic delineation of reference areas in support of smart soil sampling and digital soil twins, Front. Soil Sci., 5, 1557566, <ext-link xlink:href="https://doi.org/10.3389/fsoil.2025.1557566" ext-link-type="DOI">10.3389/fsoil.2025.1557566</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bib62"><label>62</label><mixed-citation>Serrano, J., Shahidian, S., and Silva, J. M. da: Evaluation of Normalized Difference Water Index as a Tool for Monitoring Pasture Seasonal and Inter-Annual Variability in a Mediterranean Agro-Silvo-Pastoral System, Water, 11, <ext-link xlink:href="https://doi.org/10.3390/w11010062" ext-link-type="DOI">10.3390/w11010062</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib63"><label>63</label><mixed-citation>Shao, S., Su, B., Zhang, Y., Gao, C., Zhang, M., Zhang, H., and Yang, L.: Sample design optimization for soil mapping using improved artificial neural networks and simulated annealing, Geoderma, 413, 115749, <ext-link xlink:href="https://doi.org/10.1016/j.geoderma.2022.115749" ext-link-type="DOI">10.1016/j.geoderma.2022.115749</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib64"><label>64</label><mixed-citation>Shen, Y., Li, J., Chen, F., Cheng, R., Xiao, W., Wu, L., and Zeng, L.: Correlations between forest soil quality and aboveground vegetation characteristics in hunan province, china, Front. Plant Sci., 13, 1009109, <ext-link xlink:href="https://doi.org/10.3389/fpls.2022.1009109" ext-link-type="DOI">10.3389/fpls.2022.1009109</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib65"><label>65</label><mixed-citation>Shi, G., Sun, W., Shangguan, W., Wei, Z., Yuan, H., Li, L., Sun, X., Zhang, Y., Liang, H., Li, D., Huang, F., Li, Q., and Dai, Y.: A China dataset of soil properties for land surface modelling (version 2, CSDLv2), Earth Syst. Sci. Data, 17, 517–543, <ext-link xlink:href="https://doi.org/10.5194/essd-17-517-2025" ext-link-type="DOI">10.5194/essd-17-517-2025</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bib66"><label>66</label><mixed-citation>Smith, P. L.: Geostatistical error management: quantifying uncertainty for environmental sampling and mapping, Technometrics, 43, 238–239, <ext-link xlink:href="https://doi.org/10.1198/tech.2001.s593" ext-link-type="DOI">10.1198/tech.2001.s593</ext-link>, 2001.</mixed-citation></ref>
      <ref id="bib1.bib67"><label>67</label><mixed-citation>Sylvain, J. D., Anctil, F., and Thiffault, É.: Using bias correction and ensemble modelling for predictive mapping and related uncertainty: a case study in digital soil mapping, Geoderma, 403, 115153, <ext-link xlink:href="https://doi.org/10.1016/j.geoderma.2021.115153" ext-link-type="DOI">10.1016/j.geoderma.2021.115153</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib68"><label>68</label><mixed-citation>Szatmári, G., Laborczi, A., Mészáros, J., Takács, K., Benő, A., Koós, S., Bakacsi, Z., and Pásztor, L.: Gridded, temporally referenced spatial information on soil organic carbon for Hungary, Sci. Data, 11, 1312, <ext-link xlink:href="https://doi.org/10.1038/s41597-024-04158-3" ext-link-type="DOI">10.1038/s41597-024-04158-3</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bib69"><label>69</label><mixed-citation>Tang, S., Xu, X., Wu, Y., Meng, L., Tawaraya, K., and Cheng, W.: Long-term afforestation of black pine over two centuries asymptotically enhanced SOC and TN stocks in a typical coastal sand dune of Japan, Catena, 249, 108697, <ext-link xlink:href="https://doi.org/10.1016/j.catena.2024.108697" ext-link-type="DOI">10.1016/j.catena.2024.108697</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bib70"><label>70</label><mixed-citation> UNEP: Becoming #GenerationRestoration: Ecosystem Restoration for People, Nature and Climate, United Nations Environment Programme, ISBN 978-92-807-3864-3, 2021.</mixed-citation></ref>
      <ref id="bib1.bib71"><label>71</label><mixed-citation>Vaysse, K. and Lagacherie, P.: Using quantile regression forest to estimate uncertainty of digital soil mapping products, Geoderma, 291, 55–64, <ext-link xlink:href="https://doi.org/10.1016/j.geoderma.2016.12.017" ext-link-type="DOI">10.1016/j.geoderma.2016.12.017</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib72"><label>72</label><mixed-citation>Wadoux, A., Minasny, B., and McBratney, A.: Machine learning for digital soil mapping: applications, challenges and suggested solutions, EarthArXiv [preprint], <ext-link xlink:href="https://doi.org/10.31223/OSF.IO/8EQ6S" ext-link-type="DOI">10.31223/OSF.IO/8EQ6S</ext-link>, 6 February 2020.</mixed-citation></ref>
      <ref id="bib1.bib73"><label>73</label><mixed-citation>Westhuizen, S. V. D., Heuvelink, G. B. M., Hofmeyr, D. P., Poggio, L., Nussbaum, M., and Brungard, C.: Mapping soil thickness by accounting for right-censored data with survival probabilities and machine learning, Eur. J. Soil Sci., 75, e13589, <ext-link xlink:href="https://doi.org/10.1111/ejss.13589" ext-link-type="DOI">10.1111/ejss.13589</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bib74"><label>74</label><mixed-citation>Xiao, Y., Xue, J., Zhang, X., Wang, N., Hong, Y., Jiang, Y., Zhou, Y., Teng, H., Hu, B., Lugato, E., Richer-de-Forges, A. C., Arrouays, D., Shi, Z., and Chen, S.: Improving pedotransfer functions for predicting soil mineral associated organic carbon by ensemble machine learning, Geoderma, 428, 116208, <ext-link xlink:href="https://doi.org/10.1016/j.geoderma.2022.116208" ext-link-type="DOI">10.1016/j.geoderma.2022.116208</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib75"><label>75</label><mixed-citation>Xu, L., He, N. P., Yu, G. R., Wen, D., Gao, Y., and He, H. L.: Differences in pedotransfer functions of bulk density lead to high uncertainty in soil organic carbon estimation at regional scales: evidence from chinese terrestrial ecosystems, J. Geophys. Res.-Biogeo., 120, 1567–1575, <ext-link xlink:href="https://doi.org/10.1002/2015JG002929" ext-link-type="DOI">10.1002/2015JG002929</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib76"><label>76</label><mixed-citation>Xu, Z., Man, X., Cai, T., and Shang, Y.: How potential evapotranspiration regulates the response of canopy transpiration to soil moisture and leaf area index of the boreal larch forest in China, Forests, 13, <ext-link xlink:href="https://doi.org/10.3390/f13040571" ext-link-type="DOI">10.3390/f13040571</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib77"><label>77</label><mixed-citation>Xue, J., Zhang, X., Chen, S., Chen, Z., Lu, R., Liu, F., Van Wesemael, B., and Shi, Z.: National-scale mapping topsoil organic carbon of cropland in China using multitemporal sentinel-2 images, Geoderma, 456, 117272, <ext-link xlink:href="https://doi.org/10.1016/j.geoderma.2025.117272" ext-link-type="DOI">10.1016/j.geoderma.2025.117272</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bib78"><label>78</label><mixed-citation>Yang, L., Zhu, A. X., Qi, F., Qin, C. Z., Li, B., and Pei, T.: An integrative hierarchical stepwise sampling strategy for spatial sampling and its application in digital soil mapping, Int. J. Geogr. Inf. Sci., 27, 1–23, <ext-link xlink:href="https://doi.org/10.1080/13658816.2012.658053" ext-link-type="DOI">10.1080/13658816.2012.658053</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib79"><label>79</label><mixed-citation>Yu, Z., Chen, H. Y. H., Searle, E. B., Sardans, J., Ciais, P., Peñuelas, J., and Huang, Z.: Whole soil acidification and base cation reduction across subtropical china, Geoderma, 361, 114107, <ext-link xlink:href="https://doi.org/10.1016/j.geoderma.2019.114107" ext-link-type="DOI">10.1016/j.geoderma.2019.114107</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib80"><label>80</label><mixed-citation>Zhang, S., Zhou, X., Chen, Y., Du, F., and Zhu, B.: Soil organic carbon fractions in China: spatial distribution, drivers, and future changes, Sci. Total Environ., 919, 170890, <ext-link xlink:href="https://doi.org/10.1016/j.scitotenv.2024.170890" ext-link-type="DOI">10.1016/j.scitotenv.2024.170890</ext-link>, 2024a.</mixed-citation></ref>
      <ref id="bib1.bib81"><label>81</label><mixed-citation>Zhang, Y., Wang, X., Zheng, Y., and Duan, L.: Characteristics of soil organic carbon fractions and influencing factors in different understory mosses in karst urban parks, Sci. Rep.-UK, 14, 27691, <ext-link xlink:href="https://doi.org/10.1038/s41598-024-77735-6" ext-link-type="DOI">10.1038/s41598-024-77735-6</ext-link>, 2024b.</mixed-citation></ref>
      <ref id="bib1.bib82"><label>82</label><mixed-citation>Zhao, C., Long, J., Liao, H., Zheng, C., Li, J., Liu, L., and Zhang, M.: Dynamics of soil microbial communities following vegetation succession in a karst mountain ecosystem, southwest China, Sci. Rep.-UK, 9, 2160, <ext-link xlink:href="https://doi.org/10.1038/s41598-018-36886-z" ext-link-type="DOI">10.1038/s41598-018-36886-z</ext-link>, 2019. </mixed-citation></ref>
      <ref id="bib1.bib83"><label>83</label><mixed-citation>Zhao, F., Zhu, A., Zhu, L., and Qin, C.: iSoLIM: a similarity-based spatial prediction software for the big data era, Ann. Gis, 30, 535–549, <ext-link xlink:href="https://doi.org/10.1080/19475683.2024.2324381" ext-link-type="DOI">10.1080/19475683.2024.2324381</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bib84"><label>84</label><mixed-citation>Zhu, A. X., Qin, C. Z., Liang, P., and Du, F.: Digital soil mapping for smart agriculture: the solim method and software platforms, Vestn. Ross. univ. družby nar., Ser. Agron. životnovod., 13, 317–335, <ext-link xlink:href="https://doi.org/10.22363/2312-797X-2018-13-4-317-335" ext-link-type="DOI">10.22363/2312-797X-2018-13-4-317-335</ext-link>, 2018. </mixed-citation></ref>
      <ref id="bib1.bib85"><label>85</label><mixed-citation>Zhu, Q., De Vries, W., Liu, X., Zeng, M., Hao, T., Du, E., Zhang, F., and Shen, J.: The contribution of atmospheric deposition and forest harvesting to forest soil acidification in China since 1980, Atmos. Environ., 146, 215–222, <ext-link xlink:href="https://doi.org/10.1016/j.atmosenv.2016.04.023" ext-link-type="DOI">10.1016/j.atmosenv.2016.04.023</ext-link>, 2016.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>High-resolution, multi-depth mapping of soil bulk density and pH in China's forests using machine learning</article-title-html>
<abstract-html/>
<ref-html id="bib1.bib1"><label>1</label><mixed-citation>
       Abatzoglou, J. T., Dobrowski, S. Z., Parks, S. A., and Hegewisch, K. C.:
TerraClimate, a high-resolution global dataset of monthly climate and climatic water balance from 1958–2015, Sci. Data, 5, 170191, <a href="https://doi.org/10.1038/sdata.2017.191" target="_blank">https://doi.org/10.1038/sdata.2017.191</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>2</label><mixed-citation>
       Abdullah, H., Skidmore, A. K., Siegenthaler, A., and Neinavaz, E.:
High-resolution prediction of soil pH in european temperate forests using sentinel-2 and ancillary environmental data, Sci. Rep.-UK, 15, 28509, <a href="https://doi.org/10.1038/s41598-025-03942-4" target="_blank">https://doi.org/10.1038/s41598-025-03942-4</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>3</label><mixed-citation>
       Adhikari, K., Kheir, R. B., Greve, M. B., Bøcher, P. K., Malone, B. P., Minasny, B., McBratney, A. B., and Greve, M. H.:
High-resolution 3-D mapping of soil texture in Denmark, Soil Sci. Soc. Am. J., 77, 860–876, <a href="https://doi.org/10.2136/sssaj2012.0275" target="_blank">https://doi.org/10.2136/sssaj2012.0275</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>4</label><mixed-citation>
       Adhikari, K., Mishra, U., Owens, P., Libohova, Z., Wills, S., Riley, W., Hoffman, F., and Smith, D.:
Importance and strength of environmental controllers of soil organic carbon changes with scale, Geoderma, 375, <a href="https://doi.org/10.1016/j.geoderma.2020.114472" target="_blank">https://doi.org/10.1016/j.geoderma.2020.114472</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>5</label><mixed-citation>
       Angers, D. A., Arrouays, D., Saby, N. P. A., and Walter, C.:
Estimating and mapping the carbon saturation deficit of French agricultural topsoils, Soil Use Manage., 27, 448–452, <a href="https://doi.org/10.1111/j.1475-2743.2011.00366.x" target="_blank">https://doi.org/10.1111/j.1475-2743.2011.00366.x</a>, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>6</label><mixed-citation>
       Arrouays, D., Grundy, M. G., Hartemink, A. E., Hempel, J. W., Heuvelink, G. B. M., Hong, S. Y., Lagacherie, P., Lelyk, G., McBratney, A. B., McKenzie, N. J., Mendonca-Santos, M. d. L., Minasny, B., Montanarella, L., Odeh, I. O. A., Sanchez, P. A., Thompson, J. A., and Zhang, G.-L.:
GlobalSoilMap, in: Advances in Agronomy,  Elsevier, 125, 93–134, <a href="https://doi.org/10.1016/B978-0-12-800137-0.00003-0" target="_blank">https://doi.org/10.1016/B978-0-12-800137-0.00003-0</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>7</label><mixed-citation>
       Augusto, L., Beaumont, F., Nguyen, C., Fraysse, J., Trichet, P., Meredieu, C., Vidal, D., and Sappin-Didier, V.:
Response of soil and vegetation in a warm-temperate pine forest to intensive biomass harvests, phosphorus fertilisation, and wood ash application, Sci. Total Environ., 157907, <a href="https://doi.org/10.1016/j.scitotenv.2022.157907" target="_blank">https://doi.org/10.1016/j.scitotenv.2022.157907</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>8</label><mixed-citation>
       Batjes, N. H.:
Harmonized soil property values for broad-scale modelling (WISE30sec) with estimates of global soil carbon stocks, Geoderma, 269, 61–68, <a href="https://doi.org/10.1016/j.geoderma.2016.01.034" target="_blank">https://doi.org/10.1016/j.geoderma.2016.01.034</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>9</label><mixed-citation>
       Bishop, T. F. A., McBratney, A. B., and Laslett, G. M.:
Modelling soil attribute depth functions with equal-area quadratic smoothing splines, Geoderma, 91, 27–45, <a href="https://doi.org/10.1016/S0016-7061(99)00003-8" target="_blank">https://doi.org/10.1016/S0016-7061(99)00003-8</a>, 1999.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>10</label><mixed-citation>
      
Brady, N. C. and Weil, R. R.: The nature and properties of soils, 15th Edn., Global Edition, Pearson, Harlow, England, 1104 pp., ISBN 978-1-292-16224-9, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>11</label><mixed-citation>
       Brus, D. J., Kempen, B., and Heuvelink, G. B. M.:
Sampling for validation of digital soil maps, Eur. J. Soil Sci., 62, 394–407, <a href="https://doi.org/10.1111/j.1365-2389.2011.01364.x" target="_blank">https://doi.org/10.1111/j.1365-2389.2011.01364.x</a>, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>12</label><mixed-citation>
       Cavazzi, S., Corstanje, R., Mayr, T., Hannam, J., and Fealy, R.:
Are fine resolution digital elevation models always the best choice in digital soil mapping?, Geoderma, 195–196, 111–121, <a href="https://doi.org/10.1016/j.geoderma.2012.11.020" target="_blank">https://doi.org/10.1016/j.geoderma.2012.11.020</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>13</label><mixed-citation>
      
Chen, J.: cjz-ux/ChinaForestSoil-DSM: ChinaForestSoil-DSM (digital_soil_mapping), Zenodo [code], <a href="https://doi.org/10.5281/zenodo.20422721" target="_blank">https://doi.org/10.5281/zenodo.20422721</a>, 2026.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>14</label><mixed-citation>
       Chen, J., Deng, Z., Jiang, Z., Sun, J., Meng, F., Zuo, X., Wu, L., Cao, G., and Cao, S.:
Variations of rhizosphere and bulk soil microbial community in successive planting of chinese fir (cunninghamia lanceolata), Front. Plant Sci., 13, 954777, <a href="https://doi.org/10.3389/fpls.2022.954777" target="_blank">https://doi.org/10.3389/fpls.2022.954777</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>15</label><mixed-citation>
       Chen, J. Z. and Huang, Z. L.:
High-resolution maps of forest soil bulk density and pH across China at 90&thinsp;m resolution [DS/OL], V1, Science Data Bank [data set], <a href="https://doi.org/10.57760/sciencedb.25375" target="_blank">https://doi.org/10.57760/sciencedb.25375</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>16</label><mixed-citation>
       Chen, L. F., He, Z. B., Du, J., Yang, J. J., and Zhu, X.:
Patterns and environmental controls of soil organic carbon and total nitrogen in alpine ecosystems of northwestern China, Catena, 137, 37–43, <a href="https://doi.org/10.1016/j.catena.2015.08.017" target="_blank">https://doi.org/10.1016/j.catena.2015.08.017</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>17</label><mixed-citation>
       Chen, S., Mulder, V. L., Martin, M. P., Walter, C., Lacoste, M., Richer-de-Forges, A. C., Saby, N. P. A., Loiseau, T., Hu, B., and Arrouays, D.:
Probability mapping of soil thickness by random survival forest at a national scale, Geoderma, 344, 184–194, <a href="https://doi.org/10.1016/j.geoderma.2019.03.016" target="_blank">https://doi.org/10.1016/j.geoderma.2019.03.016</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>18</label><mixed-citation>
       Chen, S., Chen, Z., Zhang, X., Luo, Z., Schillaci, C., Arrouays, D., Richer-de-Forges, A. C., and Shi, Z.:
European topsoil bulk density and organic carbon stock database (0–20&thinsp;cm) using machine-learning-based pedotransfer functions, Earth Syst. Sci. Data, 16, 2367–2383, <a href="https://doi.org/10.5194/essd-16-2367-2024" target="_blank">https://doi.org/10.5194/essd-16-2367-2024</a>, 2024. 
    </mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>19</label><mixed-citation>
       Dai, Y., Shangguan, W., Wei, N., Xin, Q., Yuan, H., Zhang, S., Liu, S., Lu, X., Wang, D., and Yan, F.:
A review of the global soil property maps for Earth system models, SOIL, 5, 137–158, <a href="https://doi.org/10.5194/soil-5-137-2019" target="_blank">https://doi.org/10.5194/soil-5-137-2019</a>, 2019. 
    </mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>20</label><mixed-citation>
       Ding, Y. and Peng, S.:
Spatiotemporal change and attribution of potential evapotranspiration over china from 1901 to 2100, Theor. Appl. Climatol., 145, 79–94, <a href="https://doi.org/10.1007/s00704-021-03625-w" target="_blank">https://doi.org/10.1007/s00704-021-03625-w</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>21</label><mixed-citation>
       Dong, H., Huang, S., Wang, H., Huang, Q., Leng, G., Li, Z., Li, L., and Peng, J.:
Identifying non-stationarity in the dependence structures of meteorological factors within and across seasons and exploring possible causes, Stoch. Env. Res. Risk A., 37, 4071–4089, <a href="https://doi.org/10.1007/s00477-023-02496-z" target="_blank">https://doi.org/10.1007/s00477-023-02496-z</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>22</label><mixed-citation>
       Fan, N. Q., Zhu, A. X., Qin, C. Z., and Liang, P.:
Digital soil mapping over large areas with invalid environmental covariate data, ISPRS Int. J. Geo-Inf., 9, 102, <a href="https://doi.org/10.3390/ijgi9020102" target="_blank">https://doi.org/10.3390/ijgi9020102</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>23</label><mixed-citation>
       Farooq, T. H., Chen, X., Shakoor, A., Rashid, M. H. U., Kumar, U., Alhomrani, M., Alamri, A. S., Ravindran, B., and Yan, W.:
Unraveling the importance of forest structure and composition driving soil microbial and enzymatic responses in the subtropical forest soils, Forests, 13, 1535, <a href="https://doi.org/10.3390/f13101535" target="_blank">https://doi.org/10.3390/f13101535</a>, 2022a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>24</label><mixed-citation>
       Farooq, T. H., Li, Z., Yan, W., Shakoor, A., Kumar, U., Shabbir, R., Peng, Y., Gayathiri, E., Alotaibi, S. S., Wróbel, J., Kalaji, H. M., and Chen, X.:
Variations in litterfall dynamics, C:N:P stoichiometry and associated nutrient return in pure and mixed stands of camphor tree and masson pine forests, Front. Environ. Sci., 10, 903039, <a href="https://doi.org/10.3389/fenvs.2022.903039" target="_blank">https://doi.org/10.3389/fenvs.2022.903039</a>, 2022b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>25</label><mixed-citation>
       Goovaerts, P.:
Geostatistical modelling of uncertainty in soil science, Geoderma, 103, 3–26, <a href="https://doi.org/10.1016/S0016-7061(01)00067-2" target="_blank">https://doi.org/10.1016/S0016-7061(01)00067-2</a>, 2001.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>26</label><mixed-citation>
       Guo, C., Wu, Y., Ni, J., and Guo, Y.:
Forest carbon storage in guizhou province based on field measurement dataset, Acta Geochim., 38, 8–21, <a href="https://doi.org/10.1007/s11631-018-0306-3" target="_blank">https://doi.org/10.1007/s11631-018-0306-3</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>27</label><mixed-citation>
       Guo, J., Feng, H., McNie, P., Wang, W., Peng, C., Feng, L., Sun, J., Pan, C., and Yu, Y.:
The effect of the conversion from natural broadleaved forests into Chinese fir (cunninghamia lanceolata (lamb.) hook.) plantations on soil microbial communities and nitrogen functional genes, Forests, 13, 158, <a href="https://doi.org/10.3390/f13020158" target="_blank">https://doi.org/10.3390/f13020158</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>28</label><mixed-citation>
       Hamza, M. A. and Anderson, W. K.:
Soil compaction in cropping systems, Soil Till. Res., 82, 121–145, <a href="https://doi.org/10.1016/j.still.2004.08.009" target="_blank">https://doi.org/10.1016/j.still.2004.08.009</a>, 2005.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>29</label><mixed-citation>
       Hengl, T., Heuvelink, G. B. M., Kempen, B., Leenaars, J. G. B., Walsh, M. G., Shepherd, K. D., Sila, A., MacMillan, R. A., Mendes De Jesus, J., Tamene, L., and Tondoh, J. E.:
Mapping soil properties of Africa at 250&thinsp;m resolution: random forests significantly improve current predictions, PLOS One, 10, e0125814, <a href="https://doi.org/10.1371/journal.pone.0125814" target="_blank">https://doi.org/10.1371/journal.pone.0125814</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>30</label><mixed-citation>
       Huang, H., Yang, L., Zhang, L., Pu, Y., Yang, C., Wu, Q., Cai, Y., Shen, F., and Zhou, C.:
A review on digital mapping of soil carbon in cropland: progress, challenge, and prospect, Environ. Res. Lett., 17, 123004, <a href="https://doi.org/10.1088/1748-9326/aca41e" target="_blank">https://doi.org/10.1088/1748-9326/aca41e</a>, 2022a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>31</label><mixed-citation>
       Huang, H., Liu, Y., Liu, Y., Tong, Z., Ren, Z., Xie, Y., Huang, H., Liu, Y., Liu, Y., Tong, Z., Ren, Z., and Xie, Y.:
Digital mapping of soil pH and driving factor analysis based on environmental variable screening, Sustainability-Basel, 17, <a href="https://doi.org/10.3390/su17073173" target="_blank">https://doi.org/10.3390/su17073173</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>32</label><mixed-citation>
       Huang, X., Cui, C., Hou, E., Li, F., Liu, W., Jiang, L., Luo, Y., and Xu, X.:
Acidification of soil due to forestation at the global scale, Forest Ecol. Manag., 505, 119951, <a href="https://doi.org/10.1016/j.foreco.2021.119951" target="_blank">https://doi.org/10.1016/j.foreco.2021.119951</a>, 2022b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>33</label><mixed-citation>
      
Huettmann, F. and Gottschalk, T.: Simplicity, model fit, complexity and uncertainty in spatial prediction models applied over time: we are quite sure, aren't we?, in: Predictive Species and Habitat Modeling in Landscape Ecology, edited by: Drew, C. A., Wiersma, Y. F., and Huettmann, F., Springer New York, New York, NY, 189–208, <a href="https://doi.org/10.1007/978-1-4419-7390-0_10" target="_blank">https://doi.org/10.1007/978-1-4419-7390-0_10</a>, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>34</label><mixed-citation>
      
IPCC: Climate Change 2022: Mitigation of Climate Change. Contribution of Working Group III to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change, <a href="https://doi.org/10.1017/9781009157926" target="_blank">https://doi.org/10.1017/9781009157926</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>35</label><mixed-citation>
       Jenny, H.:
Factors of soil formation: a system of quantitative pedology,  OCLC no. 905466, McGraw-Hill, New York, 1941.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>36</label><mixed-citation>
       Jobbágy, E. G. and Jackson, R. B.:
The vertical distribution of soil organic carbon and its relation to climate and vegetation, Ecol. Appl., 10, 423–436, <a href="https://doi.org/10.1890/1051-0761(2000)010[0423:TVDOSO]2.0.CO;2" target="_blank">https://doi.org/10.1890/1051-0761(2000)010[0423:TVDOSO]2.0.CO;2</a>, 2000.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>37</label><mixed-citation>
       Kleber, M., Bourg, I. C., Coward, E. K., Hansel, C. M., Myneni, S. C. B., and Nunan, N.:
Dynamic interactions at the mineral–organic matter interface, Nat. Rev. Earth Environ., 2, 402–421, <a href="https://doi.org/10.1038/s43017-021-00162-y" target="_blank">https://doi.org/10.1038/s43017-021-00162-y</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>38</label><mixed-citation>
       Kumar, P. S., Shukla, G., Nath, A. J., and Chakravarty, S.:
Soil properties, litter dynamics and biomass carbon storage in three-bamboo species of sub-himalayan region of eastern India, Water Air Soil Poll., 233, <a href="https://doi.org/10.1007/s11270-021-05477-6" target="_blank">https://doi.org/10.1007/s11270-021-05477-6</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>39</label><mixed-citation>
       Lal, R.:
Forest soils and carbon sequestration, Forest Ecol. Manag., 220, 242–258, <a href="https://doi.org/10.1016/j.foreco.2005.08.015" target="_blank">https://doi.org/10.1016/j.foreco.2005.08.015</a>, 2005.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>40</label><mixed-citation>
       Liang, Z., Chen, S., Yang, Y., Zhao, R., Shi, Z., and Viscarra Rossel, R. A.:
National digital soil map of organic matter in topsoil and its associated uncertainty in 1980's China, Geoderma, 335, 47–56, <a href="https://doi.org/10.1016/j.geoderma.2018.08.011" target="_blank">https://doi.org/10.1016/j.geoderma.2018.08.011</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib41"><label>41</label><mixed-citation>
       Liu, B., Zhao, P., Wu, Y., and Wang, X.:
Clarifying the effects of potential evapotranspiration and soil moisture on transpiration in secondary forests of birch in semi-arid regions of China, J. Plant Interact., 19, 2389048, <a href="https://doi.org/10.1080/17429145.2024.2389048" target="_blank">https://doi.org/10.1080/17429145.2024.2389048</a>, 2024a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib42"><label>42</label><mixed-citation>
       Liu, F., Wu, H., Zhao, Y., Li, D., Yang, J., Song, X., Shi, Z., Zhu, A., and Zhang, G.:
Mapping high resolution national soil information grids of China, Sci. Bull., 67, 328–340, <a href="https://doi.org/10.1016/j.scib.2021.10.013" target="_blank">https://doi.org/10.1016/j.scib.2021.10.013</a>, 2022a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib43"><label>43</label><mixed-citation>
       Liu, F., Yang, F., Zhao, Y., Zhang, G., and Li, D.:
Predicting soil depth in a large and complex area using machine learning and environmental correlations, J. Integr. Agr., 21, 2422–2434, <a href="https://doi.org/10.1016/S2095-3119(21)63692-4" target="_blank">https://doi.org/10.1016/S2095-3119(21)63692-4</a>, 2022b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib44"><label>44</label><mixed-citation>
       Liu, Z., Gu, H., Yao, Q., Jiao, F., Hu, X., Liu, J., Jin, J., Liu, X., and Wang, G.:
Soil pH and carbon quality index regulate the biogeochemical cycle couplings of carbon, nitrogen and phosphorus in the profiles of isohumosols, Sci. Total Environ., 922, 171269, <a href="https://doi.org/10.1016/j.scitotenv.2024.171269" target="_blank">https://doi.org/10.1016/j.scitotenv.2024.171269</a>, 2024b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib45"><label>45</label><mixed-citation>
       Lundberg, S. and Lee, S.-I.:
A unified approach to interpreting model predictions, arXiv [preprint], <a href="https://doi.org/10.48550/arXiv.1705.07874" target="_blank">https://doi.org/10.48550/arXiv.1705.07874</a>, 25 November 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib46"><label>46</label><mixed-citation>
       McBratney, A. B., Mendonça Santos, M. L., and Minasny, B.:
On digital soil mapping, Geoderma, 117, 3–52, <a href="https://doi.org/10.1016/S0016-7061(03)00223-4" target="_blank">https://doi.org/10.1016/S0016-7061(03)00223-4</a>, 2003.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib47"><label>47</label><mixed-citation>
       Meinshausen, N.:
Quantile regression forests, J. Mach. Learn. Res., 7, 983–999, 2006.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib48"><label>48</label><mixed-citation>
       Minasny, B., McBratney, A. B., Malone, B. P., and Wheeler, I.:
Digital mapping of soil carbon, in: Advances in Agronomy, Elsevier, 118,1–47, <a href="https://doi.org/10.1016/B978-0-12-405942-9.00001-3" target="_blank">https://doi.org/10.1016/B978-0-12-405942-9.00001-3</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib49"><label>49</label><mixed-citation>
       Miranda, R., Nobrega, R., Silva, E., Silva, J., Araújo Filho, J., Moura, M., Barros, A., Souza, A., Verhoef, A., Yang, W., Shao, H., Srinivasan, R., Ziadat, F., Montenegro, S., Araújo, M., and Galvíncio, J.:
Hybrid machine learning for integrating pedological knowledge into digital soil mapping to advance next-generation earth system models, Earth ArXiv [preprint], <a href="https://doi.org/10.31223/X57P9W" target="_blank">https://doi.org/10.31223/X57P9W</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib50"><label>50</label><mixed-citation>
       Nussbaum, M., Spiess, K., Baltensweiler, A., Grob, U., Keller, A., Greiner, L., Schaepman, M. E., and Papritz, A.:
Evaluation of digital soil mapping approaches with large sets of environmental covariates, SOIL, 4, 1–22, <a href="https://doi.org/10.5194/soil-4-1-2018" target="_blank">https://doi.org/10.5194/soil-4-1-2018</a>, 2018. 
    </mixed-citation></ref-html>
<ref-html id="bib1.bib51"><label>51</label><mixed-citation>
       Padarian, J., Minasny, B., and McBratney, A. B.:
Using deep learning for digital soil mapping, SOIL, 5, 79–89, <a href="https://doi.org/10.5194/soil-5-79-2019" target="_blank">https://doi.org/10.5194/soil-5-79-2019</a>, 2019. 
    </mixed-citation></ref-html>
<ref-html id="bib1.bib52"><label>52</label><mixed-citation>
       Pan, Y., Birdsey, R. A., Fang, J., Houghton, R., Kauppi, P. E., Kurz, W. A., Phillips, O. L., Shvidenko, A., Lewis, S. L., Canadell, J. G., Ciais, P., Jackson, R. B., Pacala, S. W., McGuire, A. D., Piao, S., Rautiainen, A., Sitch, S., and Hayes, D.:
A large and persistent carbon sink in the world's forests, Science, 333, 988–993, <a href="https://doi.org/10.1126/science.1201609" target="_blank">https://doi.org/10.1126/science.1201609</a>, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib53"><label>53</label><mixed-citation>
       Patton, N. R., Lohse, K. A., Seyfried, M. S., Godsey, S. E., and Parsons, S. B.:
Topographic controls of soil organic carbon on soil-mantled landscapes, Sci. Rep.-UK, 9, 6390, <a href="https://doi.org/10.1038/s41598-019-42556-5" target="_blank">https://doi.org/10.1038/s41598-019-42556-5</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib54"><label>54</label><mixed-citation>
       Peng, S., Ding, Y., Liu, W., and Li, Z.:
1&thinsp;km monthly temperature and precipitation dataset for China from 1901 to 2017, Earth Syst. Sci. Data, 11, 1931–1946, <a href="https://doi.org/10.5194/essd-11-1931-2019" target="_blank">https://doi.org/10.5194/essd-11-1931-2019</a>, 2019. 
    </mixed-citation></ref-html>
<ref-html id="bib1.bib55"><label>55</label><mixed-citation>
       Peng, X., He, G., Wang, G., Yin, R., Yang, R., Peng, Y., Long, T., Zhang, Z., Chen, Y., and Wang, J.:
GF-1 WFV satellite images based forest cover mapping in China supported by open land use/cover datasets, Sci. Data, 11, 1355, <a href="https://doi.org/10.1038/s41597-024-04202-2" target="_blank">https://doi.org/10.1038/s41597-024-04202-2</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib56"><label>56</label><mixed-citation>
       Piedallu, C., Pedersoli, E., Chaste, E., Morneau, F., Seynave, I., and Gégout, J.-C.:
Optimal resolution of soil properties maps varies according to their geographical extent and location, Geoderma, 412, 115723, <a href="https://doi.org/10.1016/j.geoderma.2022.115723" target="_blank">https://doi.org/10.1016/j.geoderma.2022.115723</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib57"><label>57</label><mixed-citation>
       Poggio, L., de Sousa, L. M., Batjes, N. H., Heuvelink, G. B. M., Kempen, B., Ribeiro, E., and Rossiter, D.:
SoilGrids 2.0: producing soil information for the globe with quantified spatial uncertainty, SOIL, 7, 217–240, <a href="https://doi.org/10.5194/soil-7-217-2021" target="_blank">https://doi.org/10.5194/soil-7-217-2021</a>, 2021. 
    </mixed-citation></ref-html>
<ref-html id="bib1.bib58"><label>58</label><mixed-citation>
       Potash, E., Guan, K., Margenot, A. J., Lee, D., Boe, A., Douglass, M., Heaton, E., Jang, C., Jin, V., Li, N., Mitchell, R., Namoi, N., Schmer, M., Wang, S., and Zumpf, C.:
Multi-site evaluation of stratified and balanced sampling of soil organic carbon stocks in agricultural fields, Geoderma, 438, 116587, <a href="https://doi.org/10.1016/j.geoderma.2023.116587" target="_blank">https://doi.org/10.1016/j.geoderma.2023.116587</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib59"><label>59</label><mixed-citation>
       Pouladi, N., Møller, A. B., Tabatabai, S., and Greve, M. H.:
Mapping soil organic matter contents at field level with cubist, random forest and kriging, Geoderma, 342, 85–92, <a href="https://doi.org/10.1016/j.geoderma.2019.02.019" target="_blank">https://doi.org/10.1016/j.geoderma.2019.02.019</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib60"><label>60</label><mixed-citation>
       Robb, C., Aitkenhead, M., Coull, M., Macfarlane, F., and Matthews, K.:
Soil property, carbon stock and peat extent mapping at 10&thinsp;m resolution in Scotland using digital soil mapping techniques, Eur. J. Soil Sci., 76, <a href="https://doi.org/10.1111/ejss.70123" target="_blank">https://doi.org/10.1111/ejss.70123</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib61"><label>61</label><mixed-citation>
       Rodrigues, H., Ceddia, M. B., Vasques, G. M., Grunwald, S., and Babaeian, E.:
AutoRA: an innovative algorithm for automatic delineation of reference areas in support of smart soil sampling and digital soil twins, Front. Soil Sci., 5, 1557566, <a href="https://doi.org/10.3389/fsoil.2025.1557566" target="_blank">https://doi.org/10.3389/fsoil.2025.1557566</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib62"><label>62</label><mixed-citation>
      
Serrano, J., Shahidian, S., and Silva, J. M. da: Evaluation of Normalized Difference Water Index as a Tool for Monitoring Pasture Seasonal and Inter-Annual Variability in a Mediterranean Agro-Silvo-Pastoral System, Water, 11, <a href="https://doi.org/10.3390/w11010062" target="_blank">https://doi.org/10.3390/w11010062</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib63"><label>63</label><mixed-citation>
       Shao, S., Su, B., Zhang, Y., Gao, C., Zhang, M., Zhang, H., and Yang, L.:
Sample design optimization for soil mapping using improved artificial neural networks and simulated annealing, Geoderma, 413, 115749, <a href="https://doi.org/10.1016/j.geoderma.2022.115749" target="_blank">https://doi.org/10.1016/j.geoderma.2022.115749</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib64"><label>64</label><mixed-citation>
       Shen, Y., Li, J., Chen, F., Cheng, R., Xiao, W., Wu, L., and Zeng, L.:
Correlations between forest soil quality and aboveground vegetation characteristics in hunan province, china, Front. Plant Sci., 13, 1009109, <a href="https://doi.org/10.3389/fpls.2022.1009109" target="_blank">https://doi.org/10.3389/fpls.2022.1009109</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib65"><label>65</label><mixed-citation>
       Shi, G., Sun, W., Shangguan, W., Wei, Z., Yuan, H., Li, L., Sun, X., Zhang, Y., Liang, H., Li, D., Huang, F., Li, Q., and Dai, Y.:
A China dataset of soil properties for land surface modelling (version 2, CSDLv2), Earth Syst. Sci. Data, 17, 517–543, <a href="https://doi.org/10.5194/essd-17-517-2025" target="_blank">https://doi.org/10.5194/essd-17-517-2025</a>, 2025. 
    </mixed-citation></ref-html>
<ref-html id="bib1.bib66"><label>66</label><mixed-citation>
       Smith, P. L.:
Geostatistical error management: quantifying uncertainty for environmental sampling and mapping, Technometrics, 43, 238–239, <a href="https://doi.org/10.1198/tech.2001.s593" target="_blank">https://doi.org/10.1198/tech.2001.s593</a>, 2001.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib67"><label>67</label><mixed-citation>
       Sylvain, J. D., Anctil, F., and Thiffault, É.:
Using bias correction and ensemble modelling for predictive mapping and related uncertainty: a case study in digital soil mapping, Geoderma, 403, 115153, <a href="https://doi.org/10.1016/j.geoderma.2021.115153" target="_blank">https://doi.org/10.1016/j.geoderma.2021.115153</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib68"><label>68</label><mixed-citation>
       Szatmári, G., Laborczi, A., Mészáros, J., Takács, K., Benő, A., Koós, S., Bakacsi, Z., and Pásztor, L.:
Gridded, temporally referenced spatial information on soil organic carbon for Hungary, Sci. Data, 11, 1312, <a href="https://doi.org/10.1038/s41597-024-04158-3" target="_blank">https://doi.org/10.1038/s41597-024-04158-3</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib69"><label>69</label><mixed-citation>
       Tang, S., Xu, X., Wu, Y., Meng, L., Tawaraya, K., and Cheng, W.:
Long-term afforestation of black pine over two centuries asymptotically enhanced SOC and TN stocks in a typical coastal sand dune of Japan, Catena, 249, 108697, <a href="https://doi.org/10.1016/j.catena.2024.108697" target="_blank">https://doi.org/10.1016/j.catena.2024.108697</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib70"><label>70</label><mixed-citation>
      
UNEP: Becoming #GenerationRestoration: Ecosystem Restoration for People, Nature and Climate, United Nations Environment Programme, ISBN 978-92-807-3864-3, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib71"><label>71</label><mixed-citation>
       Vaysse, K. and Lagacherie, P.:
Using quantile regression forest to estimate uncertainty of digital soil mapping products, Geoderma, 291, 55–64, <a href="https://doi.org/10.1016/j.geoderma.2016.12.017" target="_blank">https://doi.org/10.1016/j.geoderma.2016.12.017</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib72"><label>72</label><mixed-citation>
       Wadoux, A., Minasny, B., and McBratney, A.:
Machine learning for digital soil mapping: applications, challenges and suggested solutions, EarthArXiv [preprint], <a href="https://doi.org/10.31223/OSF.IO/8EQ6S" target="_blank">https://doi.org/10.31223/OSF.IO/8EQ6S</a>, 6 February 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib73"><label>73</label><mixed-citation>
       Westhuizen, S. V. D., Heuvelink, G. B. M., Hofmeyr, D. P., Poggio, L., Nussbaum, M., and Brungard, C.:
Mapping soil thickness by accounting for right-censored data with survival probabilities and machine learning, Eur. J. Soil Sci., 75, e13589, <a href="https://doi.org/10.1111/ejss.13589" target="_blank">https://doi.org/10.1111/ejss.13589</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib74"><label>74</label><mixed-citation>
       Xiao, Y., Xue, J., Zhang, X., Wang, N., Hong, Y., Jiang, Y., Zhou, Y., Teng, H., Hu, B., Lugato, E., Richer-de-Forges, A. C., Arrouays, D., Shi, Z., and Chen, S.:
Improving pedotransfer functions for predicting soil mineral associated organic carbon by ensemble machine learning, Geoderma, 428, 116208, <a href="https://doi.org/10.1016/j.geoderma.2022.116208" target="_blank">https://doi.org/10.1016/j.geoderma.2022.116208</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib75"><label>75</label><mixed-citation>
       Xu, L., He, N. P., Yu, G. R., Wen, D., Gao, Y., and He, H. L.:
Differences in pedotransfer functions of bulk density lead to high uncertainty in soil organic carbon estimation at regional scales: evidence from chinese terrestrial ecosystems, J. Geophys. Res.-Biogeo., 120, 1567–1575, <a href="https://doi.org/10.1002/2015JG002929" target="_blank">https://doi.org/10.1002/2015JG002929</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib76"><label>76</label><mixed-citation>
       Xu, Z., Man, X., Cai, T., and Shang, Y.:
How potential evapotranspiration regulates the response of canopy transpiration to soil moisture and leaf area index of the boreal larch forest in China, Forests, 13, <a href="https://doi.org/10.3390/f13040571" target="_blank">https://doi.org/10.3390/f13040571</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib77"><label>77</label><mixed-citation>
       Xue, J., Zhang, X., Chen, S., Chen, Z., Lu, R., Liu, F., Van Wesemael, B., and Shi, Z.:
National-scale mapping topsoil organic carbon of cropland in China using multitemporal sentinel-2 images, Geoderma, 456, 117272, <a href="https://doi.org/10.1016/j.geoderma.2025.117272" target="_blank">https://doi.org/10.1016/j.geoderma.2025.117272</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib78"><label>78</label><mixed-citation>
       Yang, L., Zhu, A. X., Qi, F., Qin, C. Z., Li, B., and Pei, T.:
An integrative hierarchical stepwise sampling strategy for spatial sampling and its application in digital soil mapping, Int. J. Geogr. Inf. Sci., 27, 1–23, <a href="https://doi.org/10.1080/13658816.2012.658053" target="_blank">https://doi.org/10.1080/13658816.2012.658053</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib79"><label>79</label><mixed-citation>
       Yu, Z., Chen, H. Y. H., Searle, E. B., Sardans, J., Ciais, P., Peñuelas, J., and Huang, Z.:
Whole soil acidification and base cation reduction across subtropical china, Geoderma, 361, 114107, <a href="https://doi.org/10.1016/j.geoderma.2019.114107" target="_blank">https://doi.org/10.1016/j.geoderma.2019.114107</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib80"><label>80</label><mixed-citation>
       Zhang, S., Zhou, X., Chen, Y., Du, F., and Zhu, B.:
Soil organic carbon fractions in China: spatial distribution, drivers, and future changes, Sci. Total Environ., 919, 170890, <a href="https://doi.org/10.1016/j.scitotenv.2024.170890" target="_blank">https://doi.org/10.1016/j.scitotenv.2024.170890</a>, 2024a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib81"><label>81</label><mixed-citation>
       Zhang, Y., Wang, X., Zheng, Y., and Duan, L.:
Characteristics of soil organic carbon fractions and influencing factors in different understory mosses in karst urban parks, Sci. Rep.-UK, 14, 27691, <a href="https://doi.org/10.1038/s41598-024-77735-6" target="_blank">https://doi.org/10.1038/s41598-024-77735-6</a>, 2024b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib82"><label>82</label><mixed-citation>
       Zhao, C., Long, J., Liao, H., Zheng, C., Li, J., Liu, L., and Zhang, M.:
Dynamics of soil microbial communities following vegetation succession in a karst mountain ecosystem, southwest China, Sci. Rep.-UK, 9, 2160, <a href="https://doi.org/10.1038/s41598-018-36886-z" target="_blank">https://doi.org/10.1038/s41598-018-36886-z</a>, 2019.


    </mixed-citation></ref-html>
<ref-html id="bib1.bib83"><label>83</label><mixed-citation>
       Zhao, F., Zhu, A., Zhu, L., and Qin, C.:
iSoLIM: a similarity-based spatial prediction software for the big data era, Ann. Gis, 30, 535–549, <a href="https://doi.org/10.1080/19475683.2024.2324381" target="_blank">https://doi.org/10.1080/19475683.2024.2324381</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib84"><label>84</label><mixed-citation>
       Zhu, A. X., Qin, C. Z., Liang, P., and Du, F.:
Digital soil mapping for smart agriculture: the solim method and software platforms, Vestn. Ross. univ. družby nar., Ser. Agron. životnovod., 13, 317–335, <a href="https://doi.org/10.22363/2312-797X-2018-13-4-317-335" target="_blank">https://doi.org/10.22363/2312-797X-2018-13-4-317-335</a>, 2018.


    </mixed-citation></ref-html>
<ref-html id="bib1.bib85"><label>85</label><mixed-citation>
       Zhu, Q., De Vries, W., Liu, X., Zeng, M., Hao, T., Du, E., Zhang, F., and Shen, J.:
The contribution of atmospheric deposition and forest harvesting to forest soil acidification in China since 1980, Atmos. Environ., 146, 215–222, <a href="https://doi.org/10.1016/j.atmosenv.2016.04.023" target="_blank">https://doi.org/10.1016/j.atmosenv.2016.04.023</a>, 2016.

    </mixed-citation></ref-html>--></article>
