the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
A new cropland area database by country circa 2020
Francesco Nicola Tubiello
Giulia Conchedda
Leon Casse
Pengyu Hao
Giorgia De Santis
Zhongxin Chen
Abstract. We describe a new dataset of cropland area circa the year 2020, with global coverage, with data for 221 countries and territories and 34 regional aggregates. Data are generated from geospatial information on the agreement-disagreement characteristics of six open access high-resolution cropland maps derived from remote sensing. The cropland area mapping dataset (CAM) provides information on: i) mean cropland area and its uncertainty; ii) cropland area by six distinct cropland agreement classes; and iii) cropland area by specific combinations of underlying land cover product. The data indicated that world cropland area is 1500 ± 400 million hectares (Mha) (mean and 95 % confidence interval), with a relative uncertainty of 25 % that increased across regions. It was 50 % in Central Asia (40 ± 20 Mha), South America (180 ± 80 Mha), and Southern Europe (40 ± 20 Mha); up to 40 % in Australia and New Zealand (50 ± 20 Mha), South-eastern Asia (80 ± 30 Mha) and Southern Africa (16 ± 6 Mha). Conversely, cropland area was estimated with better precision, i.e. smaller uncertainties in the range 10 %–25 % in Southern Asia (230 ± 30 Mha), Northern America (200 ± 40 Mha), Northern Africa (40 ± 10 Mha), Eastern and Western Europe (40 ± 10 Mha). The new data can be used to investigate coherence of information across the six underlying products, as well as to investigate important disagreement features. Overall, 70 % or more of the estimated mean cropland area globally and by region corresponded to good agreement of underlying land cover maps–four or more. Conversely, in Africa cropland area estimates found significant disagreement, highlighting mapping difficulties in complex landscapes. Finally, the new cropland area data were consistent with FAOSTAT in 15 out of 18 world regions, and for 114 out of 182 countries with a cropland area above 10 kha. By helping to highlight features of cropland characteristics and underlying causes for agreement/disagreement across land cover products, the CAM dataset can be used as a tool to assess quality of country statistics and help guide future mapping efforts towards improved agricultural monitoring. Data are publicly available at: https://doi.org/10.5281/zenodo.7987515 (Tubiello et al., 2023a).
- Preprint
(1587 KB) - Metadata XML
- BibTeX
- EndNote
Francesco Nicola Tubiello et al.
Status: closed
-
RC1: 'Comment on essd-2023-211', Anonymous Referee #1, 11 Jul 2023
Summary
Reliable mapping cropland area is essential for assessing and monitoring the sustainability of agriculture from regional to global scale. Here Tubiello et al., generated country-level crop area dataset based on six remote sensing-derived crop area dataset at the global scale. The generated dataset agreed well with FAO land use statistics; with the generated dataset, the authors conducted a comprehensive analysis regarding the crop area and its uncertainty. The manuscript is generally well organized, and the research is important. Here, I listed a few concerns regarding the manuscript.
Specific comments
1) Line 47-49, since there have been cropland agreement maps in the previous study, is it suitable to refer to the new dataset in this study as “Cropland Agreement Mapping dataset”? It seems that the name did not reflect the difference between the dataset in this study and that of previous study. My understanding is that one major difference between the two datasets is that this study aggregated the previous dataset to the country level, correct?
2) Line 50, 25% of what? I cannot understand this sentence.
3) Line 86-87, actually, the R2 and MAE between the crop area of CAM and FAO can be compared with the performance between the crop area of a single crop data (one of the six crop area data) and FAO.
4) Line 118-119, how did you aggregate the pixel-level information to national scale? What kind of coordinate system is used for the dataset? Did you consider the area differences across different grid cells or pixels? Please clarify it.
5) Line 127, “Conversely” or “Similarly”?
6) Line 131, “definitional bias would be higher for SAk classes” not clear. Here you mean “definitional bias would be higher for SAk classes with lower k values” correct? Please clarify it.
7) Line 153, Pearson correlation coefficient is R instead of R2, here you mean R square?
8) Line 171-174, it’s hard to follow this part. I assume many readers would have problems similar to me. What’s relative uncertainty, what does “communicate” here mean, is it a commonly used mathematical term? Here “dimension” corresponds to number of samples? Please clarify those terms or definitions and use plain language to explain what does this part exactly mean.
9) Fig. 2-3, what does the red dashed line and black line represent, respectively? Please clarify it in the figure caption. Will the R2 between the crop area of CAM and FOASTAT be higher than that between the crop area of a single data (one of the six crop area data)? Will the NRME be lower by using multiple data ensemble mean?
10) Fig.4, the label and unit of y-axis should be given
11) Fig. 5, is this regression line statistically significant? Please show the p-value.
12) Fig.7, please add necessary description in the figure caption to explain what does the size of the circle and color respectively mean.
13) In addition to the results, brief discussion about how the findings could aid further development of land cover or crop area products could potentially make this study more influential.Citation: https://doi.org/10.5194/essd-2023-211-RC1 - AC1: 'Reply on RC1', Francesco N. Tubiello, 18 Aug 2023
-
RC2: 'Comment on essd-2023-211', Anonymous Referee #2, 26 Jul 2023
Tubiello et al., have presented an important contribution to the field through the development of a novel cropland area mapping dataset (CAM). This dataset, which aggregates information from six high-resolution remote sensing-based global cropland maps, includes an estimation of uncertainty at country and regional levels. The systematic analysis of agreement and disagreement among the six contributing maps has provided valuable insights into the underlying uncertainties associated not only with individual map accuracies but also with their definitional differences.
The manuscript is generally well-written and presents the data in a clear and effective manner. The identification of regions with large uncertainty could prove particularly useful for future endeavors in land cover and land use product development and data evaluation.
Nonetheless, there are a few points that could use further clarification:
Line 153: It is mentioned that R2 is the Pearson correlation coefficient, which seems to be a misunderstanding. R2 is the coefficient of determination.
Figure 1: I noticed a large area of low agreement in the Southeastern United States. It would be beneficial to readers if you could provide a more in-depth discussion about this. Specifically, what are the differences in how each of the six datasets defines 'cropland' in these areas? Are these differences primarily due to methodological variations or could there be other underlying factors contributing to the disparity?
Additionally, it would be helpful if you could discuss the limitations of your approach, including any potential biases and how they might be addressed in future research.
I hope these comments help to further strengthen the manuscript.
Citation: https://doi.org/10.5194/essd-2023-211-RC2 - AC2: 'Reply on RC2', Francesco N. Tubiello, 18 Aug 2023
Status: closed
-
RC1: 'Comment on essd-2023-211', Anonymous Referee #1, 11 Jul 2023
Summary
Reliable mapping cropland area is essential for assessing and monitoring the sustainability of agriculture from regional to global scale. Here Tubiello et al., generated country-level crop area dataset based on six remote sensing-derived crop area dataset at the global scale. The generated dataset agreed well with FAO land use statistics; with the generated dataset, the authors conducted a comprehensive analysis regarding the crop area and its uncertainty. The manuscript is generally well organized, and the research is important. Here, I listed a few concerns regarding the manuscript.
Specific comments
1) Line 47-49, since there have been cropland agreement maps in the previous study, is it suitable to refer to the new dataset in this study as “Cropland Agreement Mapping dataset”? It seems that the name did not reflect the difference between the dataset in this study and that of previous study. My understanding is that one major difference between the two datasets is that this study aggregated the previous dataset to the country level, correct?
2) Line 50, 25% of what? I cannot understand this sentence.
3) Line 86-87, actually, the R2 and MAE between the crop area of CAM and FAO can be compared with the performance between the crop area of a single crop data (one of the six crop area data) and FAO.
4) Line 118-119, how did you aggregate the pixel-level information to national scale? What kind of coordinate system is used for the dataset? Did you consider the area differences across different grid cells or pixels? Please clarify it.
5) Line 127, “Conversely” or “Similarly”?
6) Line 131, “definitional bias would be higher for SAk classes” not clear. Here you mean “definitional bias would be higher for SAk classes with lower k values” correct? Please clarify it.
7) Line 153, Pearson correlation coefficient is R instead of R2, here you mean R square?
8) Line 171-174, it’s hard to follow this part. I assume many readers would have problems similar to me. What’s relative uncertainty, what does “communicate” here mean, is it a commonly used mathematical term? Here “dimension” corresponds to number of samples? Please clarify those terms or definitions and use plain language to explain what does this part exactly mean.
9) Fig. 2-3, what does the red dashed line and black line represent, respectively? Please clarify it in the figure caption. Will the R2 between the crop area of CAM and FOASTAT be higher than that between the crop area of a single data (one of the six crop area data)? Will the NRME be lower by using multiple data ensemble mean?
10) Fig.4, the label and unit of y-axis should be given
11) Fig. 5, is this regression line statistically significant? Please show the p-value.
12) Fig.7, please add necessary description in the figure caption to explain what does the size of the circle and color respectively mean.
13) In addition to the results, brief discussion about how the findings could aid further development of land cover or crop area products could potentially make this study more influential.Citation: https://doi.org/10.5194/essd-2023-211-RC1 - AC1: 'Reply on RC1', Francesco N. Tubiello, 18 Aug 2023
-
RC2: 'Comment on essd-2023-211', Anonymous Referee #2, 26 Jul 2023
Tubiello et al., have presented an important contribution to the field through the development of a novel cropland area mapping dataset (CAM). This dataset, which aggregates information from six high-resolution remote sensing-based global cropland maps, includes an estimation of uncertainty at country and regional levels. The systematic analysis of agreement and disagreement among the six contributing maps has provided valuable insights into the underlying uncertainties associated not only with individual map accuracies but also with their definitional differences.
The manuscript is generally well-written and presents the data in a clear and effective manner. The identification of regions with large uncertainty could prove particularly useful for future endeavors in land cover and land use product development and data evaluation.
Nonetheless, there are a few points that could use further clarification:
Line 153: It is mentioned that R2 is the Pearson correlation coefficient, which seems to be a misunderstanding. R2 is the coefficient of determination.
Figure 1: I noticed a large area of low agreement in the Southeastern United States. It would be beneficial to readers if you could provide a more in-depth discussion about this. Specifically, what are the differences in how each of the six datasets defines 'cropland' in these areas? Are these differences primarily due to methodological variations or could there be other underlying factors contributing to the disparity?
Additionally, it would be helpful if you could discuss the limitations of your approach, including any potential biases and how they might be addressed in future research.
I hope these comments help to further strengthen the manuscript.
Citation: https://doi.org/10.5194/essd-2023-211-RC2 - AC2: 'Reply on RC2', Francesco N. Tubiello, 18 Aug 2023
Francesco Nicola Tubiello et al.
Data sets
Cropland Agreement MApping Dataset Francesco N. Tubiello, Giulia Conchedda, Leon Casse, Pengyu Hao, Giorgia De Santis, and Zhongxin Chen https://doi.org/10.5281/zenodo.7987515
Francesco Nicola Tubiello et al.
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
615 | 213 | 19 | 847 | 10 | 15 |
- HTML: 615
- PDF: 213
- XML: 19
- Total: 847
- BibTeX: 10
- EndNote: 15
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1