A biomass equation dataset for common shrub species in China

Wang, Yang; Xu, Wenting; Tang, Zhiyao; Xie, Zongqiang

doi:10.5194/essd-2021-44

Preprints

https://doi.org/10.5194/essd-2021-44

Preprints

20 May 2021

| 20 May 2021

Status: this preprint was under review for the journal ESSD but the revision was not accepted.

A biomass equation dataset for common shrub species in China

Yang Wang, Wenting Xu, Zhiyao Tang, and Zongqiang Xie

Abstract. Shrub biomass equations provide an accurate, efficient and convenient method in estimating biomass of shrubland ecosystems and biomass of the shrub layer in forest ecosystems at various spatial and temporal scales. In recent decades, many shrub biomass equations have been reported mainly in journals, books and postgraduate's dissertations. However, these biomass equations are applicable for limited shrub species with respect to a large number of shrub species widely distributed in China, which severely restricted the study of terrestrial ecosystem structure and function, such as biomass, production, and carbon budge. Therefore, we firstly carried out a critical review of published literature (from 1982 to 2019) on shrub biomass equations in China, and then developed biomass equations for the dominant shrub species using a unified method based on field measurements of 738 sites in shrubland ecosystems across China. Finally, we constructed the first comprehensive biomass equation dataset for China’s common shrub species. This dataset consists of 822 biomass equations specific to 167 shrub species and has significant representativeness to the geographical, climatic and shrubland vegetation features across China. The dataset is freely available at https://doi.org/10.11922/sciencedb.00641 for noncommercial scientific applications, and this dataset fills a significant gap in woody biomass equations and provides key parameters for biomass estimation in studies on terrestrial ecosystem structure and function.

Received: 08 Feb 2021 – Discussion started: 20 May 2021

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Yang Wang, Wenting Xu, Zhiyao Tang, and Zongqiang Xie

Status: closed

CC1:
'Comment on essd-2021-44', Dayong Fan, 06 Aug 2021
In the present study, the authors developed the first biomass equation dataset for common shrub species in China. This impressive dataset is systematic and comprehensive, providing the critical base for the terrestrial ecosystem biomass and carbon sink capacity evaluation in China for future studies.

Here I have some comments:
The dataset needs more clarifications. 1) in the “Description” page, there are two identical symbols named “n” with different meanings, and two identical symbols named “R2[R]” with different meanings. 2) in the “General” page, what do the “xx” and “mx” mean? 3) in the “Equation” page, symbol “H”, “C”, “Ac”, “D”, and “M” have not been defined. 4) the year of the raw data acquired should be mentioned for each equation. 5) the longitude and latitude, or the longitude and latitude range in which the raw data was collected and corresponding equation was generated, should be provided. The last two points are important, because the journal has one principle for the data description: “Specify the temporal and geographical scopes, and temporal and spatial resolutions of your data wherever appropriate”.

The text to explain the dataset needs some revisions. 1) line 40, “Representative researches” should be “Representative researches in China”. 2) line 100, it is good if a schematic diagram could be presented to show the difference among the three types of shrub species. 3) the style of parameters in functions should be 4) line 148, the word “Consequently” can be replaced by “Collectively”. 5) Line 176, “The sample size varied from 5 and 312 shrubs”, this means in some samples data cannot be split into 10% and 90%, and the 90% part cannot be split into 75% and 25% for accuracy test. How many of them? And what’s the lowest limit of the sample size which can satisfy the statistical requirement of the recommended method? 6) line 183 and later, “valve” should be “value”.
Citation: https://doi.org/10.5194/essd-2021-44-CC1
- AC1: 'Reply on CC1', Yang Wang, 17 Oct 2021
  
  Thank you very much for your detailed suggestions. The specific revisions are as follows:
  
  1、In the Description sheet, the first “n” is the number of shrub samples used in equation creation, we revised it to “n in equation creation”; the second “n” is the number of shrub samples used in equation evaluation, we revised it to “n in equation evaluation”.
  
  The first “R2[R]” is the goodness-of-fit statistics used in equation creation, we revised it to “R2[R] in equation creation”; the second “R2[R]” is the equation evaluation statistics used in equation evaluation, we revised it to “n in equation evaluation”.
  
  2、In the General sheet, we revised “xx” and “mx” to “inf” and “equ” which are short for “information” and “equation”, respectively.
  
  3、In the Equation sheet, we clearly illustrated the meaning of symbols in “Remarks” column, include “H”, “C”, “Ac”, “Vc”, “D”, “D10”, “P”, “N”, “Ma” and “M”.
  
  4、Field measured data of shrubs were obtained from 2011 to 2013, we added this information in the Introduction sheet instead of demonstrating it for each equation.
  
  5、In the General sheet, we added the longitude and latitude, or the longitude and latitude range in which the raw data was collected and corresponding equation was generated.
  
  6、We revised the text according to your suggestions in line 40, line148 and line183.
  
  7、In some cases, equations with small sample sizes lack validation, they are all from the literature. The lowest limit of the sample size of each shrub species obtained from field measurement is 16.
  
  Citation: https://doi.org/10.5194/essd-2021-44-AC1
CC2:
'Comment on essd-2021-44', Xiangping Wang, 07 Aug 2021

This manuscript (MS) reports a large dataset for allometric equations to estimate shrub biomass, for a variety of species and sites across China. Interestingly, it seems that most equations were constructed by the present study based on sampling of 738 sites using a unified method. This greatly improved the quality and comparability of the equations, which avoid the weakness of compiling data from literatures (which were generally somehow different in sampling methods and thus led to uncertainty in data quality). Consequently, the dataset is clearly useful for improving the carbon pool/sink estimation of shrub ecosystems.
Considering the importance of this dataset, here I have some suggestions for the authors to improve the MS:
1) The Methods section needs to be clearer. As mentioned above, a major advantage of the dataset is that they have many equations based on their own measurements (the abstract said that they have 738 sites). However, the methods to obtain these equations were introduced together with the equations complied from literatures. This leads the readers not very clear about the details of the methods. For instance, did they obtain one or more equations for each of the 738 sites? How many equations from the 822 equations were measured by this study? In my opinion, similar methods issues are better introduced independent of the equations collected from the literatures.
2) As for the equations from the literatures, these are also good data in supplementary to the measured equations. However, the methods to compile, select and validate equations from literatures clearly are different, and may be better to be introduced in another section.
3) In the Excel file reporting the data, I suggest to add a column in the “Equation” sheet, which clearly indicate the data source of the equation (e.g. “this study” or “the reference”). Presently, this information is reported in the “General” sheet. This is not convenient for the readers (personally I would prefer the equations measured by the present study, as explained above). Meanwhile, I also suggest the ranges of shrub height, crown, diameter etc. to be given for each equation in the “Equation” sheet. These ranges are critical for readers to determine whether an equation can be used for their estimation. However, these ranges are now only given for each species, which is not convenient for the readers.
4) The abbreviations in the “Equation” sheet seemed not well described. For instance, what does Vc, Ma, Ac, N, etc. mean? I did not find these in the “Description” sheet. Did I miss something?

Citation: https://doi.org/10.5194/essd-2021-44-CC2
- AC2: 'Reply on CC2', Yang Wang, 17 Oct 2021
  
  Thank you very much for your revision suggestions.
  
  1、In the “Materials and methods” section, methods of equation collection from the literatures were illustrated in “2.1 Literature retrieval” and “2.2 Equation collection and screening”. Methods to create equations with field measured data were illustrated in “2.3 Equation creation and evaluation”.
  
  2、In the “Equation” sheet, we arranged information most directly related to equations, such as predictor variables, equation forms, equation coefficients and statistical parameters in equation evaluation, etc. Therefore, other information not most directly related to equations can be found in the “General” sheet through retrieval.
  
  3、Description of abbreviations such as “H”, “C”, “Ac”, “D”, “M” in the “Equation” sheet were added in the “Description” sheet.
  
  Citation: https://doi.org/10.5194/essd-2021-44-AC2

Status: closed

CC1:
'Comment on essd-2021-44', Dayong Fan, 06 Aug 2021
In the present study, the authors developed the first biomass equation dataset for common shrub species in China. This impressive dataset is systematic and comprehensive, providing the critical base for the terrestrial ecosystem biomass and carbon sink capacity evaluation in China for future studies.

Here I have some comments:
The dataset needs more clarifications. 1) in the “Description” page, there are two identical symbols named “n” with different meanings, and two identical symbols named “R2[R]” with different meanings. 2) in the “General” page, what do the “xx” and “mx” mean? 3) in the “Equation” page, symbol “H”, “C”, “Ac”, “D”, and “M” have not been defined. 4) the year of the raw data acquired should be mentioned for each equation. 5) the longitude and latitude, or the longitude and latitude range in which the raw data was collected and corresponding equation was generated, should be provided. The last two points are important, because the journal has one principle for the data description: “Specify the temporal and geographical scopes, and temporal and spatial resolutions of your data wherever appropriate”.

The text to explain the dataset needs some revisions. 1) line 40, “Representative researches” should be “Representative researches in China”. 2) line 100, it is good if a schematic diagram could be presented to show the difference among the three types of shrub species. 3) the style of parameters in functions should be 4) line 148, the word “Consequently” can be replaced by “Collectively”. 5) Line 176, “The sample size varied from 5 and 312 shrubs”, this means in some samples data cannot be split into 10% and 90%, and the 90% part cannot be split into 75% and 25% for accuracy test. How many of them? And what’s the lowest limit of the sample size which can satisfy the statistical requirement of the recommended method? 6) line 183 and later, “valve” should be “value”.
Citation: https://doi.org/10.5194/essd-2021-44-CC1
- AC1: 'Reply on CC1', Yang Wang, 17 Oct 2021
  
  Thank you very much for your detailed suggestions. The specific revisions are as follows:
  
  1、In the Description sheet, the first “n” is the number of shrub samples used in equation creation, we revised it to “n in equation creation”; the second “n” is the number of shrub samples used in equation evaluation, we revised it to “n in equation evaluation”.
  
  The first “R2[R]” is the goodness-of-fit statistics used in equation creation, we revised it to “R2[R] in equation creation”; the second “R2[R]” is the equation evaluation statistics used in equation evaluation, we revised it to “n in equation evaluation”.
  
  2、In the General sheet, we revised “xx” and “mx” to “inf” and “equ” which are short for “information” and “equation”, respectively.
  
  3、In the Equation sheet, we clearly illustrated the meaning of symbols in “Remarks” column, include “H”, “C”, “Ac”, “Vc”, “D”, “D10”, “P”, “N”, “Ma” and “M”.
  
  4、Field measured data of shrubs were obtained from 2011 to 2013, we added this information in the Introduction sheet instead of demonstrating it for each equation.
  
  5、In the General sheet, we added the longitude and latitude, or the longitude and latitude range in which the raw data was collected and corresponding equation was generated.
  
  6、We revised the text according to your suggestions in line 40, line148 and line183.
  
  7、In some cases, equations with small sample sizes lack validation, they are all from the literature. The lowest limit of the sample size of each shrub species obtained from field measurement is 16.
  
  Citation: https://doi.org/10.5194/essd-2021-44-AC1
CC2:
'Comment on essd-2021-44', Xiangping Wang, 07 Aug 2021

This manuscript (MS) reports a large dataset for allometric equations to estimate shrub biomass, for a variety of species and sites across China. Interestingly, it seems that most equations were constructed by the present study based on sampling of 738 sites using a unified method. This greatly improved the quality and comparability of the equations, which avoid the weakness of compiling data from literatures (which were generally somehow different in sampling methods and thus led to uncertainty in data quality). Consequently, the dataset is clearly useful for improving the carbon pool/sink estimation of shrub ecosystems.
Considering the importance of this dataset, here I have some suggestions for the authors to improve the MS:
1) The Methods section needs to be clearer. As mentioned above, a major advantage of the dataset is that they have many equations based on their own measurements (the abstract said that they have 738 sites). However, the methods to obtain these equations were introduced together with the equations complied from literatures. This leads the readers not very clear about the details of the methods. For instance, did they obtain one or more equations for each of the 738 sites? How many equations from the 822 equations were measured by this study? In my opinion, similar methods issues are better introduced independent of the equations collected from the literatures.
2) As for the equations from the literatures, these are also good data in supplementary to the measured equations. However, the methods to compile, select and validate equations from literatures clearly are different, and may be better to be introduced in another section.
3) In the Excel file reporting the data, I suggest to add a column in the “Equation” sheet, which clearly indicate the data source of the equation (e.g. “this study” or “the reference”). Presently, this information is reported in the “General” sheet. This is not convenient for the readers (personally I would prefer the equations measured by the present study, as explained above). Meanwhile, I also suggest the ranges of shrub height, crown, diameter etc. to be given for each equation in the “Equation” sheet. These ranges are critical for readers to determine whether an equation can be used for their estimation. However, these ranges are now only given for each species, which is not convenient for the readers.
4) The abbreviations in the “Equation” sheet seemed not well described. For instance, what does Vc, Ma, Ac, N, etc. mean? I did not find these in the “Description” sheet. Did I miss something?

Citation: https://doi.org/10.5194/essd-2021-44-CC2
- AC2: 'Reply on CC2', Yang Wang, 17 Oct 2021
  
  Thank you very much for your revision suggestions.
  
  1、In the “Materials and methods” section, methods of equation collection from the literatures were illustrated in “2.1 Literature retrieval” and “2.2 Equation collection and screening”. Methods to create equations with field measured data were illustrated in “2.3 Equation creation and evaluation”.
  
  2、In the “Equation” sheet, we arranged information most directly related to equations, such as predictor variables, equation forms, equation coefficients and statistical parameters in equation evaluation, etc. Therefore, other information not most directly related to equations can be found in the “General” sheet through retrieval.
  
  3、Description of abbreviations such as “H”, “C”, “Ac”, “D”, “M” in the “Equation” sheet were added in the “Description” sheet.
  
  Citation: https://doi.org/10.5194/essd-2021-44-AC2

Yang Wang, Wenting Xu, Zhiyao Tang, and Zongqiang Xie

Data sets

A biomass equation dataset for common shrub species in China Yang Wang, Wenting Xu, Zhiyao Tang, Zongqiang Xie https://doi.org/10.11922/sciencedb.00641

Yang Wang, Wenting Xu, Zhiyao Tang, and Zongqiang Xie

Viewed

Total article views: 2,849 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
2,019	701	129	2,849	130	185

HTML: 2,019
PDF: 701
XML: 129
Total: 2,849
BibTeX: 130
EndNote: 185

Views and downloads (calculated since 20 May 2021)

Month	HTML	PDF	XML	Total
May 2021	132	24	4	160
Jun 2021	49	7	1	57
Jul 2021	31	8	0	39
Aug 2021	48	25	3	76
Sep 2021	26	6	2	34
Oct 2021	41	28	5	74
Nov 2021	36	23	2	61
Dec 2021	17	11	2	30
Jan 2022	16	9	1	26
Feb 2022	20	1	0	21
Mar 2022	17	3	2	22
Apr 2022	18	7	1	26
May 2022	21	7	1	29
Jun 2022	8	5	2	15
Jul 2022	24	6	0	30
Aug 2022	18	10	4	32
Sep 2022	30	9	1	40
Oct 2022	25	3	3	31
Nov 2022	16	7	0	23
Dec 2022	32	6	2	40
Jan 2023	25	11	0	36
Feb 2023	18	6	1	25
Mar 2023	35	9	2	46
Apr 2023	18	6	0	24
May 2023	13	5	2	20
Jun 2023	11	8	1	20
Jul 2023	12	9	1	22
Aug 2023	7	9	1	17
Sep 2023	37	5	1	43
Oct 2023	41	8	2	51
Nov 2023	19	9	1	29
Dec 2023	18	3	1	22
Jan 2024	32	13	4	49
Feb 2024	26	13	2	41
Mar 2024	35	13	4	52
Apr 2024	19	2	7	28
May 2024	25	4	8	37
Jun 2024	16	7	3	26
Jul 2024	20	7	6	33
Aug 2024	25	5	3	33
Sep 2024	22	7	0	29
Oct 2024	14	6	0	20
Nov 2024	16	4	0	20
Dec 2024	13	5	0	18
Jan 2025	30	5	3	38
Feb 2025	24	8	0	32
Mar 2025	41	13	4	58
Apr 2025	23	7	2	32
May 2025	13	7	2	22
Jun 2025	26	11	0	37
Jul 2025	32	19	4	55
Aug 2025	52	10	1	63
Sep 2025	309	8	0	317
Oct 2025	40	57	5	102
Nov 2025	35	28	3	66
Dec 2025	28	32	1	61
Jan 2026	38	12	3	53
Feb 2026	41	23	2	66
Mar 2026	37	34	6	77
Apr 2026	52	22	2	76
May 2026	44	12	3	59
Jun 2026	3	5	2	10
Jul 2026	9	9	0	18

Cumulative views and downloads (calculated since 20 May 2021)

Month	HTML	PDF	XML	Total
May 2021	132	24	4	160
Jun 2021	49	7	1	57
Jul 2021	31	8	0	39
Aug 2021	48	25	3	76
Sep 2021	26	6	2	34
Oct 2021	41	28	5	74
Nov 2021	36	23	2	61
Dec 2021	17	11	2	30
Jan 2022	16	9	1	26
Feb 2022	20	1	0	21
Mar 2022	17	3	2	22
Apr 2022	18	7	1	26
May 2022	21	7	1	29
Jun 2022	8	5	2	15
Jul 2022	24	6	0	30
Aug 2022	18	10	4	32
Sep 2022	30	9	1	40
Oct 2022	25	3	3	31
Nov 2022	16	7	0	23
Dec 2022	32	6	2	40
Jan 2023	25	11	0	36
Feb 2023	18	6	1	25
Mar 2023	35	9	2	46
Apr 2023	18	6	0	24
May 2023	13	5	2	20
Jun 2023	11	8	1	20
Jul 2023	12	9	1	22
Aug 2023	7	9	1	17
Sep 2023	37	5	1	43
Oct 2023	41	8	2	51
Nov 2023	19	9	1	29
Dec 2023	18	3	1	22
Jan 2024	32	13	4	49
Feb 2024	26	13	2	41
Mar 2024	35	13	4	52
Apr 2024	19	2	7	28
May 2024	25	4	8	37
Jun 2024	16	7	3	26
Jul 2024	20	7	6	33
Aug 2024	25	5	3	33
Sep 2024	22	7	0	29
Oct 2024	14	6	0	20
Nov 2024	16	4	0	20
Dec 2024	13	5	0	18
Jan 2025	30	5	3	38
Feb 2025	24	8	0	32
Mar 2025	41	13	4	58
Apr 2025	23	7	2	32
May 2025	13	7	2	22
Jun 2025	26	11	0	37
Jul 2025	32	19	4	55
Aug 2025	52	10	1	63
Sep 2025	309	8	0	317
Oct 2025	40	57	5	102
Nov 2025	35	28	3	66
Dec 2025	28	32	1	61
Jan 2026	38	12	3	53
Feb 2026	41	23	2	66
Mar 2026	37	34	6	77
Apr 2026	52	22	2	76
May 2026	44	12	3	59
Jun 2026	3	5	2	10
Jul 2026	9	9	0	18

Viewed (geographical distribution)

Total article views: 2,737 (including HTML, PDF, and XML) Thereof 2,737 with geography defined and 0 with unknown origin.

Country	#	Views	%

Cited

Latest update: 21 Jul 2026

Short summary

A dataset consists of 822 biomass equations specific to 167 shrub species in China was developed based on field measurement and literature review. The equations featured excellent goodness-of-fit (mean value of R² and Fitness Index are larger than 0.8) and prediction precision (mean value of slope, R² and Relative Error of the simple linear regression between predicted and measured data are 0.96, 0.85 and −4.1%). The dataset provides key parameters for terrestrial ecosystem biomass estimation.


Total:	0
HTML:	0
PDF:	0
XML:	0

A biomass equation dataset for common shrub species in China

Data sets

Viewed

Viewed (geographical distribution)

Cited

6 citations as recorded by crossref.