Articles | Volume 13, issue 2
Earth Syst. Sci. Data, 13, 255–267, 2021
https://doi.org/10.5194/essd-13-255-2021
Earth Syst. Sci. Data, 13, 255–267, 2021
https://doi.org/10.5194/essd-13-255-2021

Data description paper 03 Feb 2021

Data description paper | 03 Feb 2021

A restructured and updated global soil respiration database (SRDB-V5)

A restructured and updated global soil respiration database (SRDB-V5)
Jinshi Jian1, Rodrigo Vargas2, Kristina Anderson-Teixeira3,4, Emma Stell2, Valentine Herrmann3, Mercedes Horn5, Nazar Kholod1, Jason Manzon6, Rebecca Marchesi3, Darlin Paredes7, and Ben Bond-Lamberty1 Jinshi Jian et al.
  • 1Pacific Northwest National Laboratory, Joint Global Change Research Institute at the University of Maryland–College Park, 5825 University Research Court, Suite 3500, College Park, MD 20740, USA
  • 2Department of Plant and Soil Sciences, University of Delaware, Newark, DE 19716, USA
  • 3Conservation Ecology Center, Smithsonian Conservation Biology Institute, Front Royal, VA 22630, USA
  • 4Center for Tropical Forest Science-Forest Global Earth Observatory, Smithsonian Tropical Research Institute, Panama City, 0801, Republic of Panama
  • 5University of Vermont, Rubenstein School of Environment and Natural Resources, Burlington, VT 05405, USA
  • 6University of Maryland, College Park, MD 20740, USA
  • 7Georgetown University, School of Foreign Service, Washington, DC 20057, USA

Correspondence: Jinshi Jian (jinshi@vt.edu) and Ben Bond-Lamberty (bondlamberty@pnnl.gov)

Abstract

Field-measured soil respiration (RS, the soil-to-atmosphere CO2 flux) observations were compiled into a global soil respiration database (SRDB) a decade ago, a resource that has been widely used by the biogeochemistry community to advance our understanding of RS dynamics. Novel carbon cycle science questions require updated and augmented global information with better interoperability among datasets. Here, we restructured and updated the global RS database to version SRDB-V5. The updated version has all previous fields revised for consistency and simplicity, and it has several new fields to include ancillary information (e.g., RS measurement time, collar insertion depth, collar area). The new SRDB-V5 includes published papers through 2017 (800 independent studies), where total observations increased from 6633 in SRDB-V4 to 10 366 in SRDB-V5. The SRDB-V5 features more RS data published in the Russian and Chinese scientific literature and has an improved global spatio-temporal coverage and improved global climate space representation. We also restructured the database so that it has stronger interoperability with other datasets related to carbon cycle science. For instance, linking SRDB-V5 with an hourly timescale global soil respiration database (HGRsD) and a community database for continuous soil respiration (COSORE) enables researchers to explore new questions. The updated SRDB-V5 aims to be a data framework for the scientific community to share seasonal to annual field RS measurements, and it provides opportunities for the biogeochemistry community to better understand the spatial and temporal variability in RS, its components, and the overall carbon cycle.

The database can be downloaded at https://github.com/bpbond/srdb and will be made available in the Oak Ridge National Laboratory's Distributed Active Archive Center (ORNL DAAC).

All data and code to reproduce the results in this study can be found at https://doi.org/10.5281/zenodo.3876443 (Jian and Bond-Lamberty, 2020).

1 Introduction

Soil respiration (RS), the soil-surface-to-atmosphere CO2 flux, is one of the largest carbon fluxes between the terrestrial land surface and atmosphere (Luo and Zhou, 2010). The majority of RS is released by soil microbial/fauna (heterotrophic respiration) and plant root respiration (autotrophic respiration). Soils hold a large amount (> 2000 Pg C to 1 m depth) of carbon, more than the total carbon stock in the atmosphere and aboveground plants (Batjes, 2016; Tarnocai et al., 2009). Thus, its C efflux to the atmosphere has major implications for our understanding of ecosystem- to global-scale biogeochemical cycling. For better monitoring of soil carbon dynamics as well as to investigate how soil carbon responds to global climate change, it is important to measure RS across different vegetation types and climate conditions.

Many field experiments have been conducted in recent decades to measure RS in different climate conditions and vegetation types (Bond-Lamberty and Thomson, 2010b; Davidson et al., 1998; Raich and Potter, 1995). However, the resulting estimates of seasonal to annual RS fluxes are scattered throughout the scientific literature in a variety of formats. Therefore, compiling past RS measurements together into a standardized data framework to support synthesis analysis is very important to advance carbon cycle science.

https://essd.copernicus.org/articles/13/255/2021/essd-13-255-2021-f01

Figure 1Summary of studies citing the global soil respiration database (SRDB) between 2010 and 2019. More and more studies are using SRDB since the first version (SRDB-V1) was published (Bond-Lamberty and Thomson, 2010a).

Download

Published site-scale RS measurements across the globe have been compiled and standardized into global soil respiration databases to support synthesis studies, macro-to-global-scale RS estimates, and soil carbon response to climate change investigation (Bond-Lamberty and Thomson, 2010a; Raich and Schlesinger, 1992). Schlesinger (1977) compiled one of the earliest listings of RS estimates from diverse ecosystems. Raich and Schlesinger (1992) subsequently integrated RS from published papers which covered 13 ecosystems and developed a simple linear model between RS and climate factors (i.e., temperature and precipitation), estimating global RS to be 68 ± 4 Pg C yr−1. Later, more RS measurements (especially measured using the infrared gas analyzer, IRGA) were added, and the global RS was updated to 76–81 Pg C yr−1 (Raich et al., 2002; Raich and Potter, 1995). In 2010, Bond-Lamberty and Thomson (2010a) compiled a comprehensive global soil respiration database (SRDB), and this database was released for public usage. The SRDB contains annual and seasonal RS measurements, ancillary carbon pools and fluxes (e.g., gross primary production, net primary production, ecosystem respiration), response of RS to temperature and moisture (i.e., model parameters to describe the relationship between RS and temperature and moisture), and sites' background information (e.g., latitude, longitude, elevation, mean annual temperature, mean annual precipitation) (Bond-Lamberty and Thomson, 2018, 2010a). With more IRGA-based RS measurements added and alkaline-based measurements excluded, Bond-Lamberty and Thomson (2010b) estimated the global RS to be 98 ± 12 Pg C yr−1 and estimated that global RS was increasing at a rate of 0.1 Pg C yr−2. The SRDB has been widely used in the past decade since the first version was published (Bond-Lamberty and Thomson, 2010a), and to date it has been cited 359 times (searched in Google Scholar on 20 May 2020), but its use continues to increase each year (Fig. 1).

The SRDB of Bond-Lamberty and Thomson (2010a) however only recorded seasonal to annual RS fluxes, hindering analyses at finer temporal resolutions. Based on the SRDB, Jian et al. (2018c) collected SRDB studies reporting diurnal RS and compiled these into a global hourly soil respiration database (HGRsD). Similarly, Jian et al. (2018a) further collected detailed monthly and daily timescale RS measurements into a global monthly and daily soil respiration database (MGRsD). More recently, Bond-Lamberty et al. (2020) have built a database (COSORE) of continuous (typically half-hourly or hourly) datasets from globally distributed sites. With these different-timescale databases, RS temporal variability and its time-related driving processes and uncertainties can be analyzed (Jian et al., 2018a, b, c). There is still a need to improve interoperability among RS databases to expand available information, improve database usage, and advance our understanding of RS dynamics across multiple spatial and temporal scales.

In approaching a decadal reworking of the SRDB, we envisioned that it required improvements to increase its usage across different disciplines. Some important information (e.g., collar area, collar insertion depth, RS measure time, soil temperature, soil moisture, soil temperature measure depth, and soil moisture measure depth) was not included in the older versions (hereafter named SRDB-V1 to SRDB-V4), and thus important questions such as whether RS survey time (Cueva et al., 2017), collar insertion depth (Heinemeyer et al., 2011), and/or how collar cover area affected RS measurement accuracy could not be addressed. In addition, SRDB-V4 included data mainly published in English ( 98 %), while data published in other languages ( 2 %) were rarely included (Epule, 2015). Some metadata such as manipulation and measurement method were not standardized and thus were difficult to use in subsequent meta-analyses. For instance, the attempt to link SRDB to the Forest Carbon Database (ForC) showed that the old SRDB structure required modification before it could be linked with ForC (Anderson-Teixeira et al., 2018). Finally, information about how heterotrophic (RH) and autotrophic respiration (RA) respond to environmental conditions (i.e., temperature and soil moisture) was not included.

The older SRDB followed certain data integration principles, including inclusion criteria, database structure design, and quality control (Bond-Lamberty and Thomson, 2010a), but improvements could be made. We have updated it to a new version (hereafter named SRDB-V5) following FAIR protocols (i.e., findable, accessible, interoperable, and reusable) (Wilkinson et al., 2016). This has been accomplished by (1) restructuring SRDB and improving its interoperability so that data from SRDB-V5 can more easily be linked to external datasets; (2) separating the RS, RH, and RA responses to temperature and soil moisture functions into a separate file to simplify the database and improve its reusability; (3)  adding collar area, collar insertion depth, and RS measurement time information to SRDB-V5; (4) collecting more RS data published in the Russian and Chinese scientific literature; (5) updating RS records available throughout the world from recently published literature (until 2017); and (6) improving the metadata description. We hope that these efforts will significantly improve the future interoperability and reusability of SRDB-V5.

2 Methods

2.1 Soil respiration database restructuring

We restructured the SRDB for easier data collection and quality control. The previous global RS database versions (SRDB-V1 to SRDB-V4) mainly included two files: a “studies” file, which recorded the detailed metadata for all published papers examined by the SRDB, and a “data” file, which stores all the RS data; a variety of ancillary site, soil, and carbon cycle data (e.g., gross primary production, GPP; net primary production, NPP; ecosystem respiration); and related background information such as site location, ecosystem type, and management (Bond-Lamberty and Thomson, 2010a). In SRDB-V5 the “studies” file remains unchanged, but the “data” file is now separated into two files: “srdb-data” and “srdb-equations”. This simplifies the structure of the former while moving all the “Response of RS to temperature and moisture” columns in the SRDB to the latter. Note that the SRDB-V5 file format remains the same as the older versions as comma-separated value data are easy to work with and universally readable by software.

Table 1Summary of metadata updates in SRDB-V5 compared with the old version SRDB-V4.

Download Print Version | Download XLSX

Table 2Summary of standardized measurement method (Meas_method) in SRDB-V5.

Download Print Version | Download XLSX

Table 3Summary of standardized partition method (Partition_method) in SRDB-V5.

Download Print Version | Download XLSX

2.2 Metadata

We standardized the background information of SRDB-V5. Most of the metadata are described by Bond-Lamberty and Thomson (2010a), and here we only describe new added columns or metadata with updates (Tables 1 to 3). We added five columns (i.e., Site_ID, Collar_height, Collar_depth, Chamber_area, Time_of_day) in SRDB-V5. Four columns (Rs_max, Rs_maxday, Rs_min, Rs_minday) were deleted (Table 1) because they were rarely reported and had not been used by the community in the past 10 years (according to our literature search, Rs_max, Rs_maxday, Rs_min, and Rs_minday have never been used). In the Quality_flag column, we added two more flags related to RS temperature equations: Q15 means the equation was developed based on seasonal RS data rather than covering at least a whole year, and Q16 notes that there is a soil water content (SWC) component within the reported equation (Table 1).

For many analyses SRDB needs to be connected with other datasets, and a unique observation ID is essential for this process. In the SRDB-V5, we added a “Site_ID” column to guarantee a unique ID for each Rs_annual observation within a study, enabling users to easily link SRDB-V5 records with external data such as MGRsD and HGRsD. The Site_ID is in the form of “CC–RC–IC”, where CC is the ISO Alpha-2 country code (https://www.nationsonline.org/oneworld/country_code_list.htm, last access: 31 January 2021), RC is region code (state/province), and IC is identity code. Country code and region code are always present, but some studies report only one annual RS value, and thus IC may or may not be present.

We standardized the coding of experimental manipulation, collapsing the previous ad hoc categories into a smaller set of standardized terms. This decreased the number of unique Manipulation field values from 689 to 276. We used the following criteria to simplify the manipulation in SRDB-V5: (1) measurements from no treatment (i.e., control) were categorized as “None”; (2) manipulation names were standardized (e.g., “clipping”, “clip”, and “clipped” are now all standardized as “Clip”); (3) we used the manipulation level to further describe the difference within a specific manipulation (e.g., “Litter manipulation” could have “double litter”, “50 % litter removal”, “100 % litter removal”). With manipulation standardized, scientists can further analyze how manipulation affects RS. For instance, comparing RS measurements from the “CO2” group (i.e., elevated CO2 concentration treatment) with “None” (i.e., control) enables researchers to analyze how RS responds to CO2 concentration increase caused by CO2 released from fossil fuel combustion. Similarly, data from the “Warm” and “Precipitation amount change” groupings will enable scientists to more easily explore how soil carbon responds to global climate change. Barba et al. (2018) suggested that bias could arise from measurements made in “hotspots” (i.e., areas with high values compared with the surrounding environment), and groupings such as “Ant mound” and “High N” facilitate data interpretation and analyses regarding “hotspots”.

We also standardized the RS measurement method (the Meas_method) and RS partition method (Partition_method) fields. Measurement method was grouped into nine types (Table 2), and the partition method was grouped into eight types (Table 3). With these changes, scientists can more easily investigate whether different measure methods affect RS results as well as whether different partition methods affect RH and RA partitioning.

Latitude and longitude are key metadata as they can be used to link RS measurements to spatial data (e.g., precipitation and air temperature). During the data collecting process, latitude and longitude values reported in the original paper were recorded in our database, generally to two significant digits. However, the precision of SRDB latitude and longitude can be affected by many factors: first, studies report latitude and longitude at different and sometimes uncertain levels of precision; second, studies use different methods for recording latitude and longitude; and finally, some studies have multiple nearby sites but report one general latitude and longitude for all those sites. However, it is unlikely that the error is very large, and in general we assume that linking RS measurements to relatively coarse spatial data (e.g., 0.1–0.5 resolution) should be unproblematic. When linking to high-spatial-resolution data (such as 30 m resolution remote-sensing images), users should be aware that the variable and uncertain SRDB latitude and longitude precision may cause data quality issues. That said, SRDB-V5 was revised to avoid unrealistic locations such as points in the ocean. Furthermore, the latitude and longitude fields should be within 90 to 90 and 180 to 180, respectively; whenever they are out of these ranges, a warning is raised.

2.3 Soil respiration database update

We updated the SRDB-V5 so that it has temporal coverage to 2017 and made an effort to collect RS data published in the Russian and Chinese literature to be more inclusive and expand its spatial coverage. Papers published in English are the majority ( 98 %) of sources in SRDB, while papers published in other languages are rarely included (Bond-Lamberty and Thomson, 2018, 2010a). This reflects the dominance of English as the language of international science, but there are some data available from the Russian-language literature, representing data from a large area (Russia represents  11 % of the terrestrial land surface) and a variety of climate types and vegetation types. In addition, in MGRsD and HGRsD, there were some Chinese-language papers or recently published papers (103 studies,  5 % of the total studies in SRDB-V5) which were not included by SRDB. Now we have compiled data from those papers into SRDB-V5.

2.4 Data quality control

We developed an R (R Core Team, 2019) script to perform data quality and consistency checks. For example, the latitude and longitude fields have to be in specific ranges, otherwise a warning is raised. For details about the data constraints used to check each column in SRDB-V5, please see the “srdb_check.R” script, which is available in the GitHub repository and as part of every release download (https://github.com/bpbond/srdb/releases, last access: 31 January 2021). This script is also run on all pull requests to the Github repository, which enables us to flag data quality problems before changes are made to the database.

2.5 Data coverage analysis

We compared mean annual temperature (MAT) and mean annual precipitation (MAP) of sites from SRDB with the global MAT and MAP to test the representation of the SRDB. We connected the sites from SRDB with external climate data (Willmott and Matsuura, 2001) through latitude and longitude and obtained MAT and MAP. Barren area was masked according to the MODIS land cover (Friedl et al., 2002). Climate region was retrieved from the climate Köppen classification (Peel et al., 2007). We also obtained International Geosphere–Biosphere Programme (IGBP) vegetation classification of the SRDB sites by connecting IGBP classification data (IGBP, 1990); vegetation was grouped into agriculture, arctic, desert, tropical forest (tropic FOR), temperate & boreal forest (T&B FOR), grassland, savanna, shrubland, urban, and wetland. If the MAT and MAP distribution of SRDB sites is similar to that of global MAT and MAP distribution, it should mean that the SRDB better represents the global flux RS distribution as well. We also assume that as data sample size increases, the new database (e.g., SRDB-V5) should improve its representation compared with the older version (e.g., SRDB-V1). We tested the representation of sites in different vegetation types (IGBP, 1990).

https://essd.copernicus.org/articles/13/255/2021/essd-13-255-2021-f02

Figure 2Spatial distribution of soil respiration (RS) sites. The gray circles are RS sites from the fourth version of the global soil respiration database (SRDB-V4; n= 1584); the red dots are sites from the literature published in Chinese and added in the fifth version of the global soil respiration database (SRDB-V5; n= 41); the orange dots represent sites from the literature published in Russian and added in SRDB-V5 (n= 16); the blue dots are sites from the literature published in other languages (mainly in English) and added in the SRDB-V5 (n= 840). The size of circles represents the sample size at each measurement site (i.e., bigger circles represent more data).

3 Results

The number of records of SRDB-V5 is much larger compared with older versions. Collecting RS measurements from newly published literature (until 2017) greatly improves the total number of observations in the database (increased from 6633 to 10 366) in SRDB-V5 but only somewhat improved its spatial coverage (Fig. 2). The Northern Hemisphere mid-latitude regions, where SRDB-V4 has the most RS sites, had the largest RS increase in SRDB-V5 as well (blue dots in Fig. 2). Adding literature in Chinese did not substantially improve the spatial coverage either, possibly because more and more RS measurements in China have been published in the English scientific literature. However, most sites in China are from the eastern part of the country, and measurements from western China, if available, will be important to include in future SRDB updates. We collected  50 papers published in Russian, but only 14 of them ( 0.7 % of total studies of all languages in SRDB-V5) met the criteria (see Bond-Lamberty and Thomson, 2010a, for details) and were included in the database. This small number of papers nonetheless substantially improved the database's spatial coverage of the Russian landmass (orange circles in Fig. 2).

https://essd.copernicus.org/articles/13/255/2021/essd-13-255-2021-f03

Figure 3Comparison of mean annual temperature (MAT; C) around the globe (in red) vs. MAT from the sites in the global soil respiration database (SRDB; in teal) by the vegetation types. SRDB-V4 represents the older SRDB released in 2018, and SRDB-V5 represents the newest SRDB published in 2020. Data from SRDB cover 10 vegetation types (agriculture, arctic, desert, tropical forest (tropic FOR), temperate and boreal forest (T&B FOR), grassland, savanna, shrubland, urban, and wetland). Comparing the fourth version (SRDB-V4) to the newest version (SRDB-V5), MAT values of agriculture, forest, and grassland sites generally well represent the global MAT; in contrast, MAT from shrubland sites in the database did not well represent global means in the older SRDB-V4, but their representation significantly improved in the newest SRDB-V5; for other vegetation types (arctic, desert, savanna, urban, and wetland (including peatland) in the right panel), the MAT of the database sites does not well represent the global MAT distribution. Note that the barren region was masked using MODIS land cover data. The number within each panel represents the number of records for each vegetation type.

Download

https://essd.copernicus.org/articles/13/255/2021/essd-13-255-2021-f04

Figure 4Comparison of mean annual precipitation (MAP; mm) around the globe (in red) vs. MAP from the sites in the global soil respiration database (SRDB; in teal) by the vegetation types. SRDB-V4 is the older SRDB published in 2018, and SRDB-V5 is the newest SRDB published in 2020. Data from SRDB covered 10 vegetation types (see Fig. 3). Sites from agriculture, savanna, forest, and urban generally well represent the global MAP (left panel), while sites from arctic, desert, grassland, shrubland, and wetland (including peatland) do not have a good MAP representation (right panel). Note that the barren region was masked using MODIS land cover data. The number of records of each panel is the same as Fig. 3.

Download

MAT and MAP distribution of SRDB sites are very similar to global distribution in agriculture, forest, and grassland regions, indicating good representativeness of SRDB sites in these three types of vegetation (Figs. 3 and 4). For shrublands, sites in the oldest versions of the database (e.g., SRDB-V4) did not represent the global distribution well, but this distribution was greatly improved as more RS measurements were included in SRDB-V5 (Fig. 3). Sites from other vegetation types, however, were less representative of the corresponding global climate space, with barren lands masked out (Fig. 3, right panel). More specifically, arctic sites in SRDB have relatively narrow MAT and MAP coverage compared with the global arctic MAT and MAP distribution, probably because many regions in the arctic are covered by snow all year round, and thus it is difficult to measure RS at those sites (Virkkala et al., 2019). Desert SRDB sites have lower MAT but higher MAP than the global distribution, probably because (1) the disproportionate number of samples in temperate regions (Fig. 2) means that most samples in deserts are likely from wetter deserts; (2) the Sahara has low MAP and high MAT and covers a large area of the world, but few studies were conducted there, so that area of the world may simply represent the bias; and (3) many “deserts” that have been studied are in relatively close proximity to urban developments (e.g., southwestern USA, southern Europe), and those deserts are neither as harsh nor extensive as the Sahara. Urban and savanna sites in SRDB had lower MAT compared to their global distribution, probably because many tropical cities and savannas in South America, Asia, and Africa were rarely measured (Jian et al., 2020; Martin et al., 2012). We suggest that papers written in other languages, especially those in Portuguese, Spanish, and French, could potentially increase the RS measurements in South America and Africa.

https://essd.copernicus.org/articles/13/255/2021/essd-13-255-2021-f05

Figure 5Comparison of annual soil respiration (RS) and seasonal RS (growing, dry, and wet seasons; spring, summer, autumn, and winter) observations from SRDB-V4 vs. those from SRDB-V5. In summary, adding new measurements does not change the distribution of annual RS or seasonal RS in the databases.

Download

Adding new measurements in SRDB-V5 has substantially increased total observations, and the spatial coverage of sites was improved compared with SRDB-V4 (Fig. 2). However, the distributions of annual RS and seasonal RS (growing, dry, wet, spring, summer, autumn, and winter season RS) were similar in the SRDB-V5 compared to SRDB-V4 (Fig. 5). We suspect that new RS measurements are collected disproportionately from the same regions as previously sampled, and thus future studies should focus more on those regions with fewer data. For the future SRDB update, measurements from the Southern Hemisphere, desert, arctic, and tropical forests, if available, will be important to include.

4 Discussion

4.1 Forecasting global RS, RH, and RA

The updated SRDB-V5 provides opportunities for constraining global RS estimates in the future. Currently, estimated global RS ranged from 68–101 Pg C yr−1, with many uncertainties associated with measurements and propagation of errors evident when upscaling site-specific RS measurements to regional and global scales (Bond-Lamberty and Thomson, 2010b; Jian et al., 2018a, b; Raich et al., 2002; Raich and Potter, 1995; Raich and Schlesinger, 1992; Warner et al., 2019). For example, RS has been usually measured during daylight hours, implicitly assuming that measurements during this period represent the mean daily RS. In a water-limited ecosystem, however, Cueva et al. (2017) estimated a time-of-day bias ranging from 29 % to +40 %. On the global scale, based on the HGRsD, Jian et al. (2018c) found that not measuring RS 24 h continuously contributed less than 6 % of bias when estimating diurnal RS. Quantifying the amount of bias required detailed information about when RS was measured and how long the measurement lasted (Jian et al., 2018c). In the SRDB-V5, we revised all the studies and collected the “Time_of_day” information, which should enable future analyses of how RS measurement bias is related to when RS measurements were collected.

It is also widely accepted that chamber properties (e.g., volume, area) (Davidson et al., 2002) and collar insertion depth (Heinemeyer et al., 2011) affect the RS measurement accuracy, but on a global scale, this has not been quantitatively tested before to our knowledge. We added information in the SRDB-V5 to enable researchers to investigate whether chamber area (smaller chambers are more vulnerable to edge effects, while larger chambers may experience inadequate air mixing), collar height (which may affect air mixing in the chamber), and insertion depth (which may cut off roots) affect RS measurement accuracy and bias at seasonal to annual scales.

Comparing SRDB-V1 through SRDB-V5, we found that the uneven spatial distribution of RS sites has improved, but bias still remains, with measurements conducted unevenly around the world and in climate space (Figs. 2–4). The reason for the spatially uneven coverage of RS sites is a combination of economy, national policy, environmental conditions, spatial heterogeneity, and many other issues. Most obviously, the Northern Hemisphere has much more data than the Southern Hemisphere as the most economically developed and wealthiest countries tend to be in the middle latitude of the Northern Hemisphere, and thus more funds, infrastructure, and a broader and deeper pool of students and technical experts are all available to support on-site RS measurement in these regions.

Improving modeling frameworks may help mitigate the uneven spatial distribution of RS sites. For example, Jian et al. (2018b) found that how RS responds to temperature is significantly different among climate regions, and therefore climate-specific models may be more appropriate than a single global model to estimate global RS. Alternatively, machine-learning approaches that account for non-linearity and multiple potential combinations of environmental factors have been used to estimate global RS (Warner et al., 2019). SRDB-V5 also significantly increased the RS sample size, and analyses could be conducted to test whether the increasing sample size of RS helps reduce uncertainty when upscaling from site- to global-scale RS. We recognize that there are many other possible sources of bias, but it is nonetheless possible that the biogeochemistry community will be able to use SRDB-V5 to improve the confidence of global RS modeling and constrain global carbon cycle estimates.

Linking SRDB-V5, MGRsD, HGRsD, and COSORE provides an opportunity for global RH and RA estimates. Soil respiration mainly consists of two parts, RH and RA, but it is difficult to separate these two components, and much fewer RH and RA data are available in the SRDB (Bond-Lamberty and Thomson, 2010a). Due to a lack of data, far fewer studies have analyzed RH and RA and estimated global RH and RA in the past decades. According to our knowledge, there are only four global RH (or RA) estimates based on the very limited extant data (n<500) (Hashimoto et al., 2015; Konings et al., 2019; Tang et al., 2020; Warner et al., 2019). In the “srdb-equations” file, response of RH and RA to temperature and moisture information will be recorded, which will inspire the study of RH and RA and how they respond to temperature and soil moisture in the future. Further, we argue that a big advantage of global soil respiration databases with finer temporal resolution (i.e., MGRsD, HGRsD, and COSORE) is that the sample size of RH and RA could be greatly increased (e.g., sample size could be increased 10-fold if using a monthly timescale). In addition, the spatial coverage of RH and RA data could also be improved. Based on the monthly RH and RA data and how they relate to environmental conditions (such as temperature and precipitation), monthly global RH and RA products could be generated, which provide useful data products for the earth system models' (ESMs) benchmarking. The disadvantages of the smaller-timescale databases (MGRsD, HGRsD, and COSORE) is that those databases usually have much less spatial coverage, and much more data are available from the growing season than from the non-growing season. Therefore, spatial upscaling including time may result in additional bias and associated uncertainty that must be carefully investigated.

4.2 Perspective

The updated SRBD-V5 will further support the analysis of how different manipulations affect RS. In the past decades, many field experiments have been conducted to study different questions, for example, how soil carbon responds to global climatic warming and changes in precipitation patterns (Vicca et al., 2014) or how human activities (forest management, agriculture cultivation, and pollution) affect terrestrial carbon cycling and soil carbon stock (Carrillo et al., 2014; Jasek et al., 2014). However, inconsistent results from different experiments have generated debate regarding the effects of environmental factors and manipulations in RS. Now SRDB-V5 includes RS measurements from both control and different kinds of treatments, providing opportunities for synthesis analysis of how manipulation affects RS. However, these treatment data about RS measurements were rarely used in the past decade as the manipulation information in older versions of SRDB was not standardized and thus could not easily be used. The updated and standardized SRDB-V5 manipulation codes have the potential to enable manipulation-driven studies on the macro to global scale.

4.3 Future improvements

We made an effort to resolve some issues in the old versions of SRDB (V1–V4), but the database needs to be continuously improved in the future. There is much more potentially useful information that could be included in future SRDB updates, although it is important to remember that every additional piece of information comes with a never-ending cost (in terms of data entry time, quality assurance and quality control, etc).

  1. Number_of_collar: the number of collars within a certain study area is important information to evaluate the representability of the RS measurements.

  2. Soil organic carbon (SOC): SOC measured in situ or obtained from regional or global datasets should be compiled into the database (Guevara et al., 2020; Hengl et al., 2017).

  3. Currently, Site_ID in SRDB-V5 is only comparable with Site_ID of MGRsD and HGRsD; further updates to Site_ID are necessary so it can connect with more external datasets (e.g., FLUXNET, COSORE, and AmeriFlux and a global database of forest carbon stocks and fluxes (ForC); Anderson-Teixeira et al., 2018).

  4. Annual_soil_moisture: including a mean value of soil moisture or intra-annual soil variability derived from remote sensing (Guevara and Vargas, 2019) when this variable was not measured at the site.

In addition, some meta information can be improved. For example, there are still 276 manipulation types in the SRDB-V5 and many manipulation types (n= 96 out of 276) with only one row of records. Efforts could be made in the database update to further simplify the manipulation of SRDB. We recognize that with thousands of publications included in the SRDB, it is known that some entries are incorrect, and some information may have been missed during literature collection. In the past years, users have pointed out many data input errors and missing data issues in the SRDB; we made a great effort to check, and many corrections have been made. However, it is inevitable that mistakes and missing information still exist; therefore, there is a pressing need to continue with the development of quality assurance and quality control for each update.

4.4 Reducing interoperability barriers

High interoperability is needed to maximize the benefits of SRDB-V5 to improve our understanding of the global carbon cycle. Interoperability has been defined as an organized collective effort with the ultimate goal to maximize sharing and using information to produce knowledge, and high interoperability is achieved by reducing conceptual, technological, organizational, and cultural barriers (Vargas et al., 2017). The improved SRDB-V5 has reduced conceptual barriers as it provides a standardized and replicable framework to organize global RS information that has been used for over a decade (Bond-Lamberty and Thomson, 2010a). It has reduced technological barriers by improving standardization of data fields (see Tables 1–3) and data formats compatible with other databases as well as and providing flexible R scripts (for details please see Sect. 2.4) in a Github repository for end users and potential data contributors. We recognize that measuring RS has other technological barriers (e.g., standardization of instrumentation, electrical power supply) that limit the collection of new measurements in harsh environments or wide implementation in developing countries. Organizational barriers remain a challenge as this is a bottom-up effort in need of long-term support to continue improving the quality and the development of the new versions of the SRDB. Finally, we believe that cultural barriers have been reduced as the global scientific community has improved in recognizing the importance of standardized databases and data sharing following FAIR principles.

5 Code availability

All data and code to reproduce the results in this study can be found at https://doi.org/10.5281/zenodo.3876443 (Jian and Bond-Lamberty, 2020).

6 Data availability

Findability and accessibility were well considered and described when SRDB-V1 was published (Bond-Lamberty and Thomson, 2010a). To summarize the updating progress, SRDB-V1 was the first full available dataset, released on 28 May 2010; SRDB-V2 was released on 13 March 2012, and RS data of publications from 2011 were integrated into the database; SRDB-V3 was released on 4 August 2014, and RS data of the literature from 2012 were collected and added; SRDB-V4 was released on 21 November 2018, and RS data of the literature through 2015 were collected and compiled into the database; SRDB-V5 was released on 24 April 2020, and RS data of the literature from 2017 were collected and added (Jian and Bond-Lamberty, 2020). The version release information was recorded at the Oak Ridge National Laboratory's Distributed Active Archive Center (ORNL-DAAC). All data and code to reproduce the results in this study can be found at https://doi.org/10.5281/zenodo.3876443 (Jian and Bond-Lamberty, 2020).

Using and citing SRDB-V5

SRDB-V5 can be used for individual, academic, research, commercial, and other purposes and can be repackaged without written permission. Research and non-research products using SRDB-V5 should cite this publication.

7 Conclusions

A global soil respiration database (SRDB) was developed to integrate soil respiration measurements from the globe a decade ago. Since the first release in 2010 (SRDB-V1), it has been widely used to advance our understanding of carbon-dynamic-related questions. Here, we restructured SRDB to a new version (SRDB-V5) following FAIR principles. We show that the SRDB substantially improved its representativeness compared with the older versions (SRDB-V1 to SRDB-V4; Figs. S1 and S2 in the Supplement) and improved its spatial coverage. A primary goal of SRDB-V5 is to improve the interoperability and reusability and make it possible for scientists to contribute in the future with the ultimate goal to improve our understanding of the global carbon cycle. With those goals in mind, the revised SRDB-V5 is now more user-friendly for the ecology, biogeochemistry, and modeling communities.

Supplement

The supplement related to this article is available online at: https://doi.org/10.5194/essd-13-255-2021-supplement.

Author contributions

BBL and JJ designed the new version of the global soil respiration database (SRDB-V5). BBL searched and downloaded the new papers until 2017 and compiled the meta-information. BBL, MH, RM, JM, DP, and JJ contributed to data collection; NK collected data in Russian; KAT and VH raised many useful suggestions while working to integrate with ForC; RV and ES provided feedback and insights in all phases. JJ wrote the paper in close collaboration with all authors.

Competing interests

The authors declare that they have no conflict of interest.

Acknowledgements

This research was supported by the US Department of Energy, Office of Science, Biological and Environmental Research, as part of the Terrestrial Ecosystem Science program. The Pacific Northwest National Laboratory is operated for the DOE by the Battelle Memorial Institute under contract DE-AC05-76RL01830. We would like to thank Dalei Hao for his help with the MODIS land cover data processing. Rodrigo Vargas acknowledges support from NASA CMS (grant no. 80NSSC18K0173).

Financial support

This research has been supported by the Terrestrial Ecosystem Science program (grant no. DE-AC05-76RL01830) and the NASA CMS grant (grant no. 80NSSC18K0173).

Review statement

This paper was edited by Attila Demény and reviewed by two anonymous referees.

References

Anderson-Teixeira, K. J., Wang, M. M. H., McGarvey, J. C., Herrmann, V., Tepley, A. J., Bond-Lamberty, B., and LeBauer, D. S.: ForC: a global database of forest carbon stocks and fluxes, Ecology, 99, 1507, https://doi.org/10.1002/ecy.2229, 2018. 

Barba, J., Cueva, A., Bahn, M., Barron-Gafford, G. A., Bond-Lamberty, B., Hanson, P. J., Jaimes, A., Kulmala, L., Pumpanen, J., Scott, R. L., Wohlfahrt, G., and Vargas, R.: Comparing ecosystem and soil respiration: Review and key challenges of tower-based and soil measurements, Agr. Forest Meteorol., 249, 434–443, https://doi.org/10.1016/j.agrformet.2017.10.028, 2018. 

Batjes, N. H.: Harmonized soil property values for broad-scale modelling (WISE30sec) with estimates of global soil carbon stocks, Geoderma, 269, 61–68, https://doi.org/10.1016/j.geoderma.2016.01.034, 2016. 

Bond-Lamberty, B. and Thomson, A.: A global database of soil respiration data, Biogeosciences, 7, 1915–1926, https://doi.org/10.5194/bg-7-1915-2010, 2010a. 

Bond-Lamberty, B. and Thomson, A.: Temperature-associated increases in the global soil respiration record, Nature, 464, 579–582, https://doi.org/10.1038/nature08930, 2010b. 

Bond-Lamberty, B., Wang, C., and Gower, S. T.: A global relationship between the heterotrophic and autotrophic components of soil respiration?, Glob. Change Biol., 10, 1756–1766, https://doi.org/10.1111/j.1365-2486.2004.00816.x, 2004. 

Bond-Lamberty, B. and Thomson, A. M.: A Global Database of Soil Respiration Data, Version 4.0, ORNL DAAC, available at: https://daac.ornl.gov/cgi-bin/download.pl?ds_id=1578&source=schema_org_metadata (last access: 31 January 2021), 2018. 

Bond-Lamberty, B., Christianson, D. S., Malhotra, A., Pennington, S. C., Sihi, D., AghaKouchak, A., and Ataka, M.: COSORE: A community database for continuous soil respiration and other soil-atmosphere greenhouse gas flux data, Glob. Change Biol., 26, 7268–7283, https://doi.org/10.1111/gcb.15353, 2020. 

Carrillo, Y., Dijkstra, F. A., Pendall, E., LeCain, D., and Tucker, C.: Plant rhizosphere influence on microbial C metabolism: the role of elevated CO2, N availability and root stoichiometry, Biogeochemistry, 117, 229–240, 2014. 

Cueva, A., Bullock, S. H., López-Reyes, E., and Vargas, R.: Potential bias of daily soil CO2 efflux estimates due to sampling time, Sci. Rep.-UK, 7, 11925, https://doi.org/10.1038/s41598-017-11849-y, 2017. 

Davidson, E. A., Belk, E., and Boone, R. D.: Soil water content and temperature as independent or confounded factors controlling soil respiration in a temperate mixed hardwood forest, Glob. Change Biol., 4, 217–227, https://doi.org/10.1046/j.1365-2486.1998.00128.x, 1998. 

Davidson, E. A., Savage, K., Verchot, L. V., and Navarro, R.: Minimizing artifacts and biases in chamber-based measurements of soil respiration, Agr. Forest Meteorol., 113, 21–37, https://doi.org/10.1016/S0168-1923(02)00100-4, 2002. 

Epule, T. E.: A New Compendium of Soil Respiration Data for Africa, Challenges, 6, 88–97, https://doi.org/10.3390/challe6010088, 2015. 

Friedl, M. A., McIver, D. K., Hodges, J. C. F., Zhang, X. Y., Muchoney, D., Strahler, A. H., Woodcock, C. E., Gopal, S., Schneider, A., Cooper, A., Baccini, A., Gao, F., and Schaaf, C.: Global land cover mapping from MODIS: algorithms and early results, Remote Sens. Environ., 83, 287–302, https://doi.org/10.1016/S0034-4257(02)00078-0, 2002. 

Guevara, M. and Vargas, R.: Downscaling satellite soil moisture using geomorphometry and machine learning, PLoS One, 14, e0219639, https://doi.org/10.1371/journal.pone.0219639, 2019. 

Guevara, M., Arroyo, C., and Brunsell, N.: Soil Organic Carbon across Mexico and the conterminous United States (1991–2010), available at: https://agupubs.onlinelibrary.wiley.com/doi/abs/10.1029/2019GB006219?casa_token=LWL2D4HFN4wAAAAA:pB-LbLOifYvo83VtBTvbtUEBAPpiALnhv2mShJoEYGD0QSJ_7_VHPuSkF-lYrRm0SxYkMuUuxXsekMzP (last access: 31 January 2021), 2020. 

Hashimoto, S., Carvalhais, N., Ito, A., Migliavacca, M., Nishina, K., and Reichstein, M.: Global spatiotemporal distribution of soil respiration modeled using a global database, Biogeosciences, 12, 4121–4132, https://doi.org/10.5194/bg-12-4121-2015, 2015. 

Heinemeyer, A., Di Bene, C., Lloyd, A. R., Tortorella, D., Baxter, R., Huntley, B., Gelsomino, A., and Ineson, P.: Soil respiration: implications of the plant-soil continuum and respiration chamber collar-insertion depth on measurement and modelling of soil CO2 efflux rates in three ecosystems, Eur. J. Soil Sci., 62, 82–94, https://doi.org/10.1111/j.1365-2389.2010.01331.x, 2011. 

Hengl, T., Mendes de Jesus, J., Heuvelink, B. M., Gerard, B. M., Heuvelink, B. M. G., Ruiperez Gonzalez, M., Kilibarda, M., Blagotic, A., Shangguan, W., Wright, M. N., Geng, X., Bauer-Marschallinger, B., Guevara, M. A., Vargas, R., MacMillan, R. A., Batjes, N. H., Leenaars, J. G. B., Ribeiro, E., Wheeler, I., Mantel, S., and Kempen, B.: SoilGrids250m: Global gridded soil information based on machine learning, PLoS One, 12, e0169748, https://doi.org/10.1371/journal.pone.0169748, 2017. 

IGBP: The International Geosphere-Biosphere Programme: A Study of Global Change, The Initial Core Projects, IGBP Secretariat, available at: http://www.igbp.net/about.4.6285fa5a12be4b403968000417.html (last access: 31 January 2021), 1990. 

Jasek, A., Zimnoch, M., Gorczyca, Z., Smula, E., and Rozanski, K.: Seasonal variability of soil CO2 flux and its carbon isotope composition in Krakow urban area, Southern Poland, Isotopes Environ. Health Stud., 50, 143–155, https://doi.org/10.1080/10256016.2014.868455, 2014. 

Jian, J. and Bond-Lamberty, B.: jinshijian/ESSD: SRDB-V5 first release (Version v1.0.0) [Data set], Zenodo, https://doi.org/10.5281/zenodo.3876443, 2020. 

Jian, J., Steele, M. K., Thomas, R. Q., Day, S. D., and Hodges, S. C.: Constraining estimates of global soil respiration by quantifying sources of variability, Glob. Change Biol., 24, 4143–4159, https://doi.org/10.1111/gcb.14301, 2018a. 

Jian, J., Steele, M. K., Day, S. D., and Thomas, R. Q.: Future global soil respiration rates will swell despite regional decreases in temperature sensitivity caused by rising temperature, Earths Future, 6, 1539–1554, https://doi.org/10.1029/2018EF000937, 2018b. 

Jian, J., Steele, M. K., Day, S. D., and Thomas, R. Q.: Measurement strategies to account for soil respiration temporal heterogeneity across diverse regions, Soil Biol. Biochem., 125, 167–177, available at: https://www.sciencedirect.com/science/article/pii/S0038071718302311 (last access: 31 January 2021), 2018c. 

Jian, J., Bahn, M., Wang, C., Bailey, V. L., and Bond-Lamberty, B.: Prediction of annual soil respiration from its flux at mean annual temperature, Agr. Forest Meteorol., 287, 107961, https://doi.org/10.1016/j.agrformet.2020.107961, 2020. 

Konings, A. G., Bloom, A. A., Liu, J., Parazoo, N. C., Schimel, D. S., and Bowman, K. W.: Global satellite-driven estimates of heterotrophic respiration, Biogeosciences, 16, 2269–2284, https://doi.org/10.5194/bg-16-2269-2019, 2019. 

Luo, Y. and Zhou, X.: Soil Respiration and the Environment, Elsevier, San Diego, California, USA, available at: https://play.google.com/store/books/details?id=BILt0bdU6AsC (last access: 31 January 2021), 2010. 

Martin, L. J., Blossey, B., and Ellis, E.: Mapping where ecologists work: biases in the global distribution of terrestrial ecological observations, Front. Ecol. Environ., 10, 195–201, https://doi.org/10.1890/110154, 2012. 

Peel, M. C., Finlayson, B. L., and McMahon, T. A.: Updated world map of the Köppen–Geiger climate classification, Hydrol. Earth Syst. Sci., 11, 1633–1644, https://doi.org/10.5194/hess-11-1633-2007, 2007. 

R Core Team: R: A Language and Environment for Statistical Computing, Version 3.6.1, available at: https://www.R-project.org/ (last access: 31 January 2021), 2019. 

Raich, J. W. and Potter, C. S.: Global patterns of carbon dioxide emissions from soils, Global Biogeochem. Cy., 9, 23–36, https://doi.org/10.1029/94GB02723, 1995. 

Raich, J. W. and Schlesinger, W. H.: The global carbon dioxide flux in soil respiration and its relationship to vegetation and climate, Tellus B, 44, 81–99, https://doi.org/10.1034/j.1600-0889.1992.t01-1-00001.x, 1992. 

Raich, J. W., Potter, C. S., and Bhagawati, D.: Interannual variability in global soil respiration, 1980–94, Glob. Change Biol., 8, 800–812, https://doi.org/10.1046/j.1365-2486.2002.00511.x, 2002. 

Schlesinger, W. H.: Carbon balance in terrestrial detritus, Annual Reviews in Ecology and Systematics, 8, 51–81, 1977. 

Tang, X., Fan, S., Du, M., Zhang, W., Gao, S., Liu, S., Chen, G., Yu, Z., and Yang, W.: Spatial and temporal patterns of global soil heterotrophic respiration in terrestrial ecosystems, Earth Syst. Sci. Data, 12, 1037–1051, https://doi.org/10.5194/essd-12-1037-2020, 2020. 

Tarnocai, C., Canadell, J. G., Schuur, E. A. G., Kuhry, P., Mazhitova, G., and Zimov, S.: Soil organic carbon pools in the northern circumpolar permafrost region, Global Biogeochem. Cy., 23, GB2023,https://doi.org/10.1029/2008GB003327, 2009. 

Vargas, R., Alcaraz-Segura, D., Birdsey, R., Brunsell, N. A., Cruz-Gaistardo, C. O., de Jong, B., Etchevers, J., Guevara, M., Hayes, D. J., Johnson, K., Loescher, H. W., Paz, F., Ryu, Y., Sanchez-Mejia, Z., and Toledo-Gutierrez, K. P.: Enhancing interoperability to facilitate implementation of REDD: case study of Mexico, Carbon Manag., 8, 57–65, https://doi.org/10.1080/17583004.2017.1285177, 2017. 

Vicca, S., Bahn, M., Estiarte, M., van Loon, E. E., Vargas, R., Alberti, G., Ambus, P., Arain, M. A., Beier, C., Bentley, L. P., Borken, W., Buchmann, N., Collins, S. L., de Dato, G., Dukes, J. S., Escolar, C., Fay, P., Guidolotti, G., Hanson, P. J., Kahmen, A., Kröel-Dulay, G., Ladreiter-Knauss, T., Larsen, K. S., Lellei-Kovacs, E., Lebrija-Trejos, E., Maestre, F. T., Marhan, S., Marshall, M., Meir, P., Miao, Y., Muhr, J., Niklaus, P. A., Ogaya, R., Peñuelas, J., Poll, C., Rustad, L. E., Savage, K., Schindlbacher, A., Schmidt, I. K., Smith, A. R., Sotta, E. D., Suseela, V., Tietema, A., van Gestel, N., van Straaten, O., Wan, S., Weber, U., and Janssens, I. A.: Corrigendum to “Can current moisture responses predict soil CO2 efflux under altered precipitation regimes? A synthesis of manipulation experiments”, Biogeosciences, 11, 3307–3308, https://doi.org/10.5194/bg-11-3307-2014, 2014. 

Virkkala, A.-M., Abdi, A. M., Luoto, M., and Metcalfe, D. B.: Identifying multidisciplinary research gaps across Arctic terrestrial gradients, Environ. Res. Lett., 14, 124061, https://doi.org/10.1088/1748-9326/ab4291, 2019.  

Warner, D. L., Bond-Lamberty, B., Jian, J., Stell, E., and Vargas, R.: Spatial Predictions and Associated Uncertainty of Annual Soil Respiration at the Global Scale, Global Biogeochem. Cy., 7, 983, https://doi.org/10.1029/2019GB006264, 2019. 

Wilkinson, M. D., Dumontier, M., Aalbersberg, I. J. J., Appleton, G., Axton, M., Baak, A., Blomberg, N., Boiten, J.-W., da Silva Santos, L. B., Bourne, P. E., Bouwman, J., Brookes, A. J., Clark, T., Crosas, M., Dillo, I., Dumon, O., Edmunds, S., Evelo, C. T., Finkers, R., Gonzalez-Beltran, A., Gray, A. J. G., Groth, P., Goble, C., Grethe, J. S., Heringa, J., 't Hoen, P. A. C., Hooft, R., Kuhn, T., Kok, R., Kok, J., Lusher, S. J., Martone, M. E., Mons, A., Packer, A. L., Persson, B., Rocca-Serra, P., Roos, M., van Schaik, R., Sansone, S.-A., Schultes, E., Sengstag, T., Slater, T., Strawn, G., Swertz, M. A., Thompson, M., van der Lei, J., van Mulligen, E., Velterop, J., Waagmeester, A., Wittenburg, P., Wolstencroft, K., Zhao, J., and Mons, B.: The FAIR Guiding Principles for scientific data management and stewardship, Sci. Data, 3, 160018, https://doi.org/10.1038/sdata.2016.18, 2016. 

Willmott, C. J. and Matsuura, K.: Terrestrial air temperature and precipitation: Monthly and annual time series (1950–1999) Version 1.02, Center for Climatic Research, University of Delaware, Newark, 2001. 

Download
Short summary
Field soil-to-atmosphere CO2 flux (soil respiration, Rs) observations were compiled into a global database (SRDB) a decade ago. Here, we restructured and updated the database to the fifth version, SRDB-V5, with data published through 2017 included. SRDB-V5 aims to be a data framework for the scientific community to share seasonal to annual field Rs measurements, and it provides opportunities for the scientific community to better understand the spatial and temporal variability of Rs.