GWL_FCS30: global 30 m wetland map with fine classification system using multi-sourced and time-series remote sensing imagery in 2020
- 1International Research Center of Big Data for Sustainable Development Goals, Beijing 100094, China
- 2Key Laboratory of Digital Earth Science, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China
- 3School of Electronic, Electrical and Communication Engineering, University of Chinese Academy of Sciences, Beijing 100049, China
- 4College of Geomatics, Xi’an University of Science and Technology, Xi’an 710054, China
- 5North China University of Water Resources and Electric Power, Zhengzhou 450046, China
- 6School of Atmospheric Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-sen University, Zhuhai 519082, Guangdong, China
- 1International Research Center of Big Data for Sustainable Development Goals, Beijing 100094, China
- 2Key Laboratory of Digital Earth Science, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China
- 3School of Electronic, Electrical and Communication Engineering, University of Chinese Academy of Sciences, Beijing 100049, China
- 4College of Geomatics, Xi’an University of Science and Technology, Xi’an 710054, China
- 5North China University of Water Resources and Electric Power, Zhengzhou 450046, China
- 6School of Atmospheric Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-sen University, Zhuhai 519082, Guangdong, China
Abstract. Wetlands, often called the “kidneys of the earth”, play an important role in maintaining ecological balance, conserving water resources, replenishing groundwater, and controlling soil erosion. Wetland mapping is very challenging because of its complicated temporal dynamics and large spatial and spectral heterogeneity. An accurate global 30-m wetland dataset that can simultaneously cover inland and coastal zones is lacking. This study proposes a novel method for wetland mapping by combining an automatic sample extraction method, multisource existing products, time-series satellite images, and a stratified classification strategy. This approach allowed for the generation of the first global 30-m wetland map with a fine classification system (GWL_FCS30), including four inland wetland sub-categories (swamp, marsh, flooded flat, and saline) and three coastal wetland sub-categories (mangrove, salt marsh, and tidal flats), which was developed using Google Earth Engine platform. We first combined existing multi-sourced global wetland products, expert knowledge, training sample refinement rules, and visual interpretation to generate a large and geographically distributed wetland training samples. Second, we integrated the time-series Landsat reflectance products and Sentinel-1 SAR imagery to generate water-level and phenological information to capture the complicated temporal dynamics and spectral heterogeneity of wetlands. Third, we applied a stratified classification strategy and the local adaptive random forest classification models to produce the wetland dataset with a fine classification system at each 5°×5° geographical tile in 2020. Lastly, the GWL_FCS30, mosaicked by 961 5°×5° regional wetland maps, was validated using 18,701 validation samples, which achieved an overall accuracy of 87.7 % and a kappa coefficient of 0.810. The cross-comparisons with other global wetland products demonstrated that the GWL_FCS30 dataset performed better in capturing the spatial patterns of wetlands and had significant advantages over the diversity of wetland subcategories. The statistical analysis showed that the global wetland area reached 3.57 million km2, including 3.10 million km2 of inland wetlands and 0.47 million km2 of coastal wetlands, approximately 62.3 % of which were distributed poleward of 40° N. Therefore, we can conclude that the proposed method is suitable for large-area wetland mapping and that the GWL_FCS30 dataset is an accurate wetland mapping product that has the potential to provide vital support for wetland management. The GWL_FCS30 dataset in 2020 is freely available at https://doi.org/10.5281/zenodo.6575731 (Liu et al. 2022).
-
Notice on discussion status
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
-
Preprint
(2891 KB)
-
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
Journal article(s) based on this preprint
Xiao Zhang et al.
Interactive discussion
Status: closed
-
RC1: 'Comment on essd-2022-180', Anonymous Referee #1, 01 Sep 2022
In terms of the whole workload, there is no doubt that this paper carried out a lot of data processing and analysis. The submitted MS also has good writing. However, I still have some cocerns about the method and accuracy of classification results. My major comments are as below.
1. Besides the mangrove, how did you consider other coastal swamp in the classification system?
2. As mentioned, the wetlands have clear seasonal changes within a year? how did you define the final date for the wetland product?
3. Besides the time-series feature? did you consisder other featuresï¼
4. How did you training the model? One model for the global wetland or one model per grid? There are distinctly phenological differences for the different wetland types and even the same wetland type. Please clarify it.
5. the authors used Sentinel SAR data, why do not produce the 10 m resolution product based on Sentinel 2?- AC1: 'Reply on RC1', Liangyun Liu, 22 Nov 2022
-
RC2: 'Comment on essd-2022-180', Yangfan Li, 12 Sep 2022
The submitted manuscript provides a global wetland map including inland and tidal sub-classes based on remote sensing data. Currently, we are still lacking a multi-class global wetland data including inland and tidal wetlands simultaneously, and the map produced by this work provides valuable information for related wetland studies. The manuscript is well-written and easy to follow. Above all, I recommend their publication provided that a moderate revision is carried out.
(1) Wetlands are classified as inland or coastal wetlands in this study, and the latter includes mangroves, salt marshes, and tidal flats. For these three wetland types, the term “tidal wetlands” is more appropriate than “coastal wetlands”, for example, in Murray et al., 2022. Coastal wetlands include other terrestrial and shoreline constituents like riparian wetlands and tidal freshwater marshes, but not just mangroves, salt marshes and tidal flats. As such, I suggest using “tidal wetlands” to make the classification system more accurate.
(2) Section 2.4: This section is about generating validation samples, thus should be moved to the “Accuracy assessment” section as a validation step. Another thing is how did the authors determine the size of total validation samples (i.e., 18,701)? This amount seems disproportionately less than the number of training samples (more than 20 million).
(3) Lines 250-255: The tidal flat samples were collected from the global tidal flat map (Murray et al., 2019), and thus would suffer from the inherent error of the data. Several studies found that Murray’s tidal flat map failed to distinguish between nearshore ponds and tidal flats, mainly because these ponds also have water-level variations (Jia et al., 2021; Zhang et al., 2022). The error of commission (i.e., classifying ponds into tidal flats) is also indicated in the tidal flat map generated by this study, as shown in the upper panels of Fig. 13. I suggest the authors mask out ponds and lakes from their tidal flat map because it would substantially improve the accuracy. There is a new dataset that provides global lakes and reservoirs may be helpful: Khandelwal et al. 2022.
(4) Line 286: These thresholds proposed by Wang et al. 2020 were designed for tidal wetlands, but their application in this study was to inland wetlands. Therefore, the authors need to prove that these thresholds have robust performance in mapping inland wetlands.
(5) Line 297, Equation 3: This maximum extent of inland wetlands also contains tidal wetlands (since the wetland layer in the global land cover data failed to distinguish them), so how did the authors ensure that the generated samples from inland wetland have corrected labels?
(6) Section 4.2: The description for obtaining training samples is unclear. What are the strata here, wetland classes or 5°×5° tiles? In addition, the training samples selected from the maximum wetland extent may be of low quality. The authors do explain that the map accuracy is insensitive to low-quality samples within a 20% threshold, but it’s still missing a map representing the percentage of real erroneous samples. I think the training samples need to be filtered according to some criterion before classification to improve their accuracy. I recommend clarifying the process of sample generating and the quality-control procedures.
(7) Section 5.3: The comparison here uses the old-version GMW mangrove map. However, the GMW mangrove map was updated to version 3.0 recently (Bunting et al., 2022), which substantially improved the accuracy by filling gaps caused by the strips in the Landsat-7 images. A detailed comparison with this new version is encouraged. Also, a product of global tidal wetland dynamics provided by Murray et al. (2022) could be an important reference for comparison.
(8) Figure 8 lacks a legend.
Reference
Bunting, P., Rosenqvist, A., Hilarides, L., Lucas, R. M., Thomas, N., Tadono, T., ... & Rebelo, L. M. (2022). Global Mangrove Extent Change 1996–2020: Global Mangrove Watch Version 3.0. Remote Sensing, 14(15), 3657.
Jia, M., Wang, Z., Mao, D., Ren, C., Wang, C., & Wang, Y. (2021). Rapid, robust, and automated mapping of tidal flats in China using time series Sentinel-2 images and Google Earth Engine. Remote Sensing of Environment, 255, 112285.
Khandelwal, A., Karpatne, A., Ravirathinam, P., Ghosh, R., Wei, Z., Dugan, H. A., ... & Kumar, V. (2022). ReaLSAT, a global dataset of reservoir and lake surface area variations. Scientific Data, 9(1), 1-12.
Murray, N. J., Phinn, S. R., DeWitt, M., Ferrari, R., Johnston, R., Lyons, M. B., ... & Fuller, R. A. (2019). The global distribution and trajectory of tidal flats. Nature, 565(7738), 222-225.
Murray, N. J., Worthington, T. A., Bunting, P., Duce, S., Hagger, V., Lovelock, C. E., ... & Lyons, M. B. (2022). High-resolution mapping of losses and gains of Earth’s tidal wetlands. Science, 376(6594), 744-749.
Zhang, Z., Xu, N., Li, Y., & Li, Y. (2022). Sub-continental-scale mapping of tidal wetland composition for East Asia: A novel algorithm integrating satellite tide-level and phenological features. Remote Sensing of Environment, 269, 112799.
- AC2: 'Reply on RC2', Liangyun Liu, 22 Nov 2022
-
RC3: 'Comment on essd-2022-180', Anonymous Referee #3, 14 Sep 2022
The authors developed a global wetland mapping product based on multiple approaches in the GEE environment, called the GWL_FCS30. They reported some 3.6 million km2 of global wetlands, making the data freely available. The authors’ efforts are laudable, yet I have many concerns about the presentation and the analyses themselves that preclude my acceptance of this paper for publication. For the presentation, I would argue that the paper itself is overly long and dense. The approaches could be more clearly articulated and sign-posted for the readers. Parts that are results are introduced in the Discussion section (e.g., some validation data, as I note below) and the length of the paper makes it a long slog. However, my main issues are with the analytical approaches and base assumption. First, the authors introduce wetlands in the very first sentence using the Ramsar Convention definition to include waters up to 6 m in depth. Then, they go on to conduct their analysis but exclude any and all inland open waters as they are assumed to be greater than 6 m in depth. They backstop their findings on global wetland abundance by stating at L535 “the estimated total wetland area in this study was more reasonable [than four previous analyses] because permanent water bodies with depths of more than six meters were not considered wetlands, according to the RAMSAR (sic) Convention…”. The assumption that any and all open water on the global landmass is >6m in depth – and hence not possibly a wetland – does not resonate. Yes, larger and deeper lakes could be greater than 6m. But open waters, especially smaller ones are frequently considered wetlands and are typically <6m in water depth (see, e.g., China’s State Forestry Administration [www.forestry.gov.cn] or a recent paper by Ye et al. (2022, https://doi.org/10.3390/w14071152); see also the Canadian Wetland Inventory [https://open.canada.ca/data/en/dataset/09f46d71-6feb-4f8f-8eb5-a58a58b06af5] or the United States National Wetlands Inventory [https://www.fws.gov/program/national-wetlands-inventory] identifying open waters as a wetland type). The point is that the Ramsar definition of wetlands is used, but then a major type of wetlands are excluded. The authors must acknowledge this in their study. For instance, it could be noted in the title and should definitely be noted in the abstract. I do wish that the authors would redo their analysis and incorporate open waters as a wetland type to include a major wetland type in their global analysis, alas.
A further issue I have with this paper is that the data are considered mis-classified if they occur as wetlands in an area outside the [wetland type] maximum extent. However, this max extent assumes that all the previous analyses had zero omission error. Another concern of mine is that their error assessment was done using a relatively paltry number of wetlands for the global extent of their analysis. For instance, they have ~8,000 wetland validation points to cover seven different wetland types. From Figure 2, it appears that ~7,000 of these points are inland “wetlands” versus coastal systems. Even with 7000 points for validation, that seems small considering the global extent of inland systems (swamps, marshes, flooded flats). And ~1000 points are used to validate the global population of saline, salt marsh, mangrove, and tidal flats. Their validation points were visually validated – though the authors explain five experts had to agree on the typology, the disagreements or data supporting those validations are also not presented. I would argue that there exist multiple independent data layers that could be used to provide a much greater assessment of their relative accuracy (perhaps in addition their visual validation). For instance, the Chinese SFA, Canadian CWI, US NWI are all available datasets for validation. Within the US, there’s also the National Land Cover Data (e.g., Wickham et al. 2018 that has the contiguous US land cover at 30 m pixel resolution, including both wetlands AND permanent water; https://doi.org/10.1080/01431161.2017.1410298).
Lastly, the Discussion section should focus on their position in the data libraries of the world and not have more results within (e.g., why do they have relatively few wetlands versus other global data?). Ultimately, there’s excitement and possibility with these data – the inclusion of multiple data layers and stacks in a random forest analysis within the GEE is exciting, especially considering the abundance of spatial data available for analyses. Yet while the authors have presented a welcome analysis, I find they leave enough to be desired to suggest a major revision to a) shorten, b) clarify approaches so that they can be repeated, c) appropriately and abundantly defend their approach to not include any open waters as a wetland type (which I do not agree with), d) place their findings against other datasets through accuracy analyses (e.g., CWI, NLCD, NWI, etc.) such that readers can determine that this data layer is better to use than those that have come before. We’re lacking that confidence at this juncture, at least from my point of view.
L43 Ramsar is a city in Iran and not an abbreviation to be capitalized.
L108: Is there indeed “…no 30-m dataset covering both inland and coastal wetlands” until now? One could argue that the authors introduce ~8 different data layers doing that. For instance, the ESA products, the CCI, etc. Tootchi et al. (2019), referenced in this paper, have Table 1, “Summary of water body, wetland, and related proxy maps and datasets from the literature” that summarize the state of the literature in 2019, too. ESA recently released a worldcover database at 10-m – how does this contrast to the authors’ analyses (and ESA includes herbaceous wetlands and mangroves as specific land covers; https://esa-worldcover.org/en ).
L119 Why 2019-2021? I recognize that the authors ended up with nearly 800,000 LS images, yet since the GEE can handle so much, why stop there? It’s not a fault, but the authors should explain why this time period was selected versus any other available time period.
L123 what are saturated pixels? How does CFMask assist w that (vs cloud, cloud shadow, and snow)?
L124 Which Landsat platforms were used? Which LS satellite data were used? What sort of processing was done on the LS images? Which bands were used? Etc. etc.
L125 LS images were used to select the “water level” or the presence of inundation as inferred from reflectance values?
L126 These are not necessarily clear sky, but they are images that passed through the CFMask filter. Please clarify in text.
L135 How did the authors discern what were sufficient Sentinel-1 images to “capture the temporal dynamics of wetlands”? What are those temporal dynamics? Seasonal? Intermittent inundation from rainstorms? Please clarify in text.
L138 How were the ASTER data used as ancillary information? Please specify how these data on slope, aspect, etc. were used here for the purposes of the paper. As I see later that it was used in the random forest, the authors need to introduce to the readers that a random forest approach is used (and conduct a literature review noting the utility of random forest and limitations).
L142 Figure 1 would be much clearer if it were a vertical panel of a) over b) versus a) next to b). Please modify. Also please change the caption to clarify that that images were not necessarily ‘clear sky’ but did otherwise pass the CFMask filter. See, e.g., L395.
L165 The JRC_GSW data layer does not identify wetlands per se but identifies inundated pixels. Therefore it is inaccurate to say that the JRC captured “wetlands around rivers, ponds, etc.” because the data layer would include rivers and ponds – or any pixel that was deemed to be inundated by the Pekel et al. (2016) algorithm. Please revise to acknowledge these data from Pekel identify inundated pixels. Note this also comes up with L281 wherein the authors state they are “excluding permanent water bodies”. Why? Permanent water bodies are a massive abundance of the global wetland data layers (e.g., in addition to the Ramsar Convention definition used earlier, see also
Davidson, N. C. 2014. How much wetland has the world lost? Long-term and recent trends in global wetland area. Marine and Freshwater Research 65: 934-941
Dixon, M. J. R. et al. 2016. Tracking global change in ecosystem area: the Wetland Extent Trends index. Biological Conservation 193: 27-35
Hu, S. et al. 2017. Global wetlands: Potential distribution, wetland loss, and status. Science of the Total Environment 586: 319-327
L169 Table 1 – considering this product is a global data layer, it would be useful to the readers to see the relative abundance of wetlands that each of these named datasets have identified. Furthermore, it’s important to note if indeed these are global products (versus near-global products, such as those within the latitudinal bands of 60N and 60S, for instance). Also convert the arc-seconds to meters (at the equator) for consistency between the data products.
L189 How many of the 18,701 data validation points did NOT have complete agreement between the five validation experts? Noting here that 8,355 points were used to discern amongst the seven classes of wetlands. Relative to the other possible ways to assess their study – and convince people to use it – this number of validation points is very small. Too small, by my assessment.
L207 There are many wetland definitions. That the Ramsar definition is quoted, noting that it includes waters to the depth of 6 m, suggests that open waters should be a wetland type in this analysis. I recognize that flooded flats – located along rivers and lakes – are included. But what of lakes themselves? Ponds? Smaller waters that are important to the global wetland data layer? Are these considered lakes? This is an important factor to consider when assessing global wetland coverage. Another consideration would be submergent vegetation. The marsh class is noted as including grasses, herbs, and low shrubs. What about, say, ponds covered with Nymphaea spp (lily pads)? What about Potamogeton spp. growing submersed in the water? Are these not wetland species? Wetland scientists would say they are. Here’s a good reference in re: this discussion:
Richardson, D. C., et al. 2022. A functional definition to distinguish ponds from lakes and wetlands. Scientific Reports 12(1): 10472.
L252 Provide a number of LS images used for this analysis.
L257 Clarify – 50 km buffer along the coastal zone between 60N – 90N are salt marsh? That seems to be quite excessive, a 50 km buffer. Please clarify.
L258 What’s the proportion of overlap between the different data layers? A spatial correlation table/matrix should be presented to the readers (see, e.g., Tootchi et al. 2019, supplemental information Table S1).
L270 These data were imported…and what was done with them?
L296 The GLWD data are at 1 km pixel. How did the authors include 1 km data plus all the 30-m data products? What’s the final resolution of these data? Also, what’s the proportion of the overlap between them (a spatial correlation table/matrix would be interesting here).
L338 This is the first time that the use of random forest was noted. In L138, I questioned how the ASTER data were used – I suggest revising the methods to introduce the reader early on to the overall approach (i.e., letting them know that the RF algorithm was used).
L341 Figure 4 the use of the ASTER DEM includes slope and aspect? Or was slope and DEM used? If the DEM was used, what information within the DEM was used? See, e.g., L400. Furthermore, the Landsat and Sentinel data were used for identifying inundated pixels. NOT for identifying water levels. I recommend changing the heading title in 4.1 as well.
L393 Ultimately, why were five LS clusters chosen versus three or just the one? Was parsimony considered in the analyses?
L404 This assumes that the maximum extent of the coastal wetlands (equation 1 for mangroves) has zero omission error. I understand why this was done, yet it requires explanation and accounting for the reader here and possibly in the Discussion section as well.
L410 The local adaptive modeling section is too quickly glossed over. Explain more on how this was done. How were the data trained? How many of the 961 5x5 tiles had zero coverage of wetlands (e.g., mid-ocean tiles)? What were the specifications of the training here? It would be hard for others to replicate the process based on the data provided thus far.
L413 What statistical program was used to conduct the RF analyses? Furthermore, while RF may have advantageous, it also has detractions. Please introduce the “obvious advantageous” for those who are not aware as well as mention some of the drawbacks.
L435 Note that 18k samples were analyzed across the globe. Consider the relative dearth noted in Figure 2 (see summary above).
L461 The authors need to introduce Figures 7-10 before introducing Figure 11.
L538 The point behind the Ramsar Convention’s of 6 m was to address depths that diving birds were expected/known to use aquatic systems. It is disingenuous to state that all permanent water bodies have depths >=6 m. This is a possibly fatal flaw in this analysis.
L555 It would be good to see the analyses done in Table 4 for these two areas shown in Figure 10. For instance, the authors have chosen to not include permanent water as a wetland type but yet show ‘water body’ in their panel map, which implies it was correctly mapped yet it is not a land use type they map.
L576 Figure 10 the panel caption for GWL_FCS30 doesn’t match the panel (GWM_FCS30).
L630 These selected training sample results should be in the Results section, not here.
L634 Was this inclusion of steps noted in the methods? I don’t recall it.
L675 These are results and need to be in that section explaining the outcomes of the RF analysis.
- AC3: 'Reply on RC3', Liangyun Liu, 22 Nov 2022
Peer review completion






Interactive discussion
Status: closed
-
RC1: 'Comment on essd-2022-180', Anonymous Referee #1, 01 Sep 2022
In terms of the whole workload, there is no doubt that this paper carried out a lot of data processing and analysis. The submitted MS also has good writing. However, I still have some cocerns about the method and accuracy of classification results. My major comments are as below.
1. Besides the mangrove, how did you consider other coastal swamp in the classification system?
2. As mentioned, the wetlands have clear seasonal changes within a year? how did you define the final date for the wetland product?
3. Besides the time-series feature? did you consisder other featuresï¼
4. How did you training the model? One model for the global wetland or one model per grid? There are distinctly phenological differences for the different wetland types and even the same wetland type. Please clarify it.
5. the authors used Sentinel SAR data, why do not produce the 10 m resolution product based on Sentinel 2?- AC1: 'Reply on RC1', Liangyun Liu, 22 Nov 2022
-
RC2: 'Comment on essd-2022-180', Yangfan Li, 12 Sep 2022
The submitted manuscript provides a global wetland map including inland and tidal sub-classes based on remote sensing data. Currently, we are still lacking a multi-class global wetland data including inland and tidal wetlands simultaneously, and the map produced by this work provides valuable information for related wetland studies. The manuscript is well-written and easy to follow. Above all, I recommend their publication provided that a moderate revision is carried out.
(1) Wetlands are classified as inland or coastal wetlands in this study, and the latter includes mangroves, salt marshes, and tidal flats. For these three wetland types, the term “tidal wetlands” is more appropriate than “coastal wetlands”, for example, in Murray et al., 2022. Coastal wetlands include other terrestrial and shoreline constituents like riparian wetlands and tidal freshwater marshes, but not just mangroves, salt marshes and tidal flats. As such, I suggest using “tidal wetlands” to make the classification system more accurate.
(2) Section 2.4: This section is about generating validation samples, thus should be moved to the “Accuracy assessment” section as a validation step. Another thing is how did the authors determine the size of total validation samples (i.e., 18,701)? This amount seems disproportionately less than the number of training samples (more than 20 million).
(3) Lines 250-255: The tidal flat samples were collected from the global tidal flat map (Murray et al., 2019), and thus would suffer from the inherent error of the data. Several studies found that Murray’s tidal flat map failed to distinguish between nearshore ponds and tidal flats, mainly because these ponds also have water-level variations (Jia et al., 2021; Zhang et al., 2022). The error of commission (i.e., classifying ponds into tidal flats) is also indicated in the tidal flat map generated by this study, as shown in the upper panels of Fig. 13. I suggest the authors mask out ponds and lakes from their tidal flat map because it would substantially improve the accuracy. There is a new dataset that provides global lakes and reservoirs may be helpful: Khandelwal et al. 2022.
(4) Line 286: These thresholds proposed by Wang et al. 2020 were designed for tidal wetlands, but their application in this study was to inland wetlands. Therefore, the authors need to prove that these thresholds have robust performance in mapping inland wetlands.
(5) Line 297, Equation 3: This maximum extent of inland wetlands also contains tidal wetlands (since the wetland layer in the global land cover data failed to distinguish them), so how did the authors ensure that the generated samples from inland wetland have corrected labels?
(6) Section 4.2: The description for obtaining training samples is unclear. What are the strata here, wetland classes or 5°×5° tiles? In addition, the training samples selected from the maximum wetland extent may be of low quality. The authors do explain that the map accuracy is insensitive to low-quality samples within a 20% threshold, but it’s still missing a map representing the percentage of real erroneous samples. I think the training samples need to be filtered according to some criterion before classification to improve their accuracy. I recommend clarifying the process of sample generating and the quality-control procedures.
(7) Section 5.3: The comparison here uses the old-version GMW mangrove map. However, the GMW mangrove map was updated to version 3.0 recently (Bunting et al., 2022), which substantially improved the accuracy by filling gaps caused by the strips in the Landsat-7 images. A detailed comparison with this new version is encouraged. Also, a product of global tidal wetland dynamics provided by Murray et al. (2022) could be an important reference for comparison.
(8) Figure 8 lacks a legend.
Reference
Bunting, P., Rosenqvist, A., Hilarides, L., Lucas, R. M., Thomas, N., Tadono, T., ... & Rebelo, L. M. (2022). Global Mangrove Extent Change 1996–2020: Global Mangrove Watch Version 3.0. Remote Sensing, 14(15), 3657.
Jia, M., Wang, Z., Mao, D., Ren, C., Wang, C., & Wang, Y. (2021). Rapid, robust, and automated mapping of tidal flats in China using time series Sentinel-2 images and Google Earth Engine. Remote Sensing of Environment, 255, 112285.
Khandelwal, A., Karpatne, A., Ravirathinam, P., Ghosh, R., Wei, Z., Dugan, H. A., ... & Kumar, V. (2022). ReaLSAT, a global dataset of reservoir and lake surface area variations. Scientific Data, 9(1), 1-12.
Murray, N. J., Phinn, S. R., DeWitt, M., Ferrari, R., Johnston, R., Lyons, M. B., ... & Fuller, R. A. (2019). The global distribution and trajectory of tidal flats. Nature, 565(7738), 222-225.
Murray, N. J., Worthington, T. A., Bunting, P., Duce, S., Hagger, V., Lovelock, C. E., ... & Lyons, M. B. (2022). High-resolution mapping of losses and gains of Earth’s tidal wetlands. Science, 376(6594), 744-749.
Zhang, Z., Xu, N., Li, Y., & Li, Y. (2022). Sub-continental-scale mapping of tidal wetland composition for East Asia: A novel algorithm integrating satellite tide-level and phenological features. Remote Sensing of Environment, 269, 112799.
- AC2: 'Reply on RC2', Liangyun Liu, 22 Nov 2022
-
RC3: 'Comment on essd-2022-180', Anonymous Referee #3, 14 Sep 2022
The authors developed a global wetland mapping product based on multiple approaches in the GEE environment, called the GWL_FCS30. They reported some 3.6 million km2 of global wetlands, making the data freely available. The authors’ efforts are laudable, yet I have many concerns about the presentation and the analyses themselves that preclude my acceptance of this paper for publication. For the presentation, I would argue that the paper itself is overly long and dense. The approaches could be more clearly articulated and sign-posted for the readers. Parts that are results are introduced in the Discussion section (e.g., some validation data, as I note below) and the length of the paper makes it a long slog. However, my main issues are with the analytical approaches and base assumption. First, the authors introduce wetlands in the very first sentence using the Ramsar Convention definition to include waters up to 6 m in depth. Then, they go on to conduct their analysis but exclude any and all inland open waters as they are assumed to be greater than 6 m in depth. They backstop their findings on global wetland abundance by stating at L535 “the estimated total wetland area in this study was more reasonable [than four previous analyses] because permanent water bodies with depths of more than six meters were not considered wetlands, according to the RAMSAR (sic) Convention…”. The assumption that any and all open water on the global landmass is >6m in depth – and hence not possibly a wetland – does not resonate. Yes, larger and deeper lakes could be greater than 6m. But open waters, especially smaller ones are frequently considered wetlands and are typically <6m in water depth (see, e.g., China’s State Forestry Administration [www.forestry.gov.cn] or a recent paper by Ye et al. (2022, https://doi.org/10.3390/w14071152); see also the Canadian Wetland Inventory [https://open.canada.ca/data/en/dataset/09f46d71-6feb-4f8f-8eb5-a58a58b06af5] or the United States National Wetlands Inventory [https://www.fws.gov/program/national-wetlands-inventory] identifying open waters as a wetland type). The point is that the Ramsar definition of wetlands is used, but then a major type of wetlands are excluded. The authors must acknowledge this in their study. For instance, it could be noted in the title and should definitely be noted in the abstract. I do wish that the authors would redo their analysis and incorporate open waters as a wetland type to include a major wetland type in their global analysis, alas.
A further issue I have with this paper is that the data are considered mis-classified if they occur as wetlands in an area outside the [wetland type] maximum extent. However, this max extent assumes that all the previous analyses had zero omission error. Another concern of mine is that their error assessment was done using a relatively paltry number of wetlands for the global extent of their analysis. For instance, they have ~8,000 wetland validation points to cover seven different wetland types. From Figure 2, it appears that ~7,000 of these points are inland “wetlands” versus coastal systems. Even with 7000 points for validation, that seems small considering the global extent of inland systems (swamps, marshes, flooded flats). And ~1000 points are used to validate the global population of saline, salt marsh, mangrove, and tidal flats. Their validation points were visually validated – though the authors explain five experts had to agree on the typology, the disagreements or data supporting those validations are also not presented. I would argue that there exist multiple independent data layers that could be used to provide a much greater assessment of their relative accuracy (perhaps in addition their visual validation). For instance, the Chinese SFA, Canadian CWI, US NWI are all available datasets for validation. Within the US, there’s also the National Land Cover Data (e.g., Wickham et al. 2018 that has the contiguous US land cover at 30 m pixel resolution, including both wetlands AND permanent water; https://doi.org/10.1080/01431161.2017.1410298).
Lastly, the Discussion section should focus on their position in the data libraries of the world and not have more results within (e.g., why do they have relatively few wetlands versus other global data?). Ultimately, there’s excitement and possibility with these data – the inclusion of multiple data layers and stacks in a random forest analysis within the GEE is exciting, especially considering the abundance of spatial data available for analyses. Yet while the authors have presented a welcome analysis, I find they leave enough to be desired to suggest a major revision to a) shorten, b) clarify approaches so that they can be repeated, c) appropriately and abundantly defend their approach to not include any open waters as a wetland type (which I do not agree with), d) place their findings against other datasets through accuracy analyses (e.g., CWI, NLCD, NWI, etc.) such that readers can determine that this data layer is better to use than those that have come before. We’re lacking that confidence at this juncture, at least from my point of view.
L43 Ramsar is a city in Iran and not an abbreviation to be capitalized.
L108: Is there indeed “…no 30-m dataset covering both inland and coastal wetlands” until now? One could argue that the authors introduce ~8 different data layers doing that. For instance, the ESA products, the CCI, etc. Tootchi et al. (2019), referenced in this paper, have Table 1, “Summary of water body, wetland, and related proxy maps and datasets from the literature” that summarize the state of the literature in 2019, too. ESA recently released a worldcover database at 10-m – how does this contrast to the authors’ analyses (and ESA includes herbaceous wetlands and mangroves as specific land covers; https://esa-worldcover.org/en ).
L119 Why 2019-2021? I recognize that the authors ended up with nearly 800,000 LS images, yet since the GEE can handle so much, why stop there? It’s not a fault, but the authors should explain why this time period was selected versus any other available time period.
L123 what are saturated pixels? How does CFMask assist w that (vs cloud, cloud shadow, and snow)?
L124 Which Landsat platforms were used? Which LS satellite data were used? What sort of processing was done on the LS images? Which bands were used? Etc. etc.
L125 LS images were used to select the “water level” or the presence of inundation as inferred from reflectance values?
L126 These are not necessarily clear sky, but they are images that passed through the CFMask filter. Please clarify in text.
L135 How did the authors discern what were sufficient Sentinel-1 images to “capture the temporal dynamics of wetlands”? What are those temporal dynamics? Seasonal? Intermittent inundation from rainstorms? Please clarify in text.
L138 How were the ASTER data used as ancillary information? Please specify how these data on slope, aspect, etc. were used here for the purposes of the paper. As I see later that it was used in the random forest, the authors need to introduce to the readers that a random forest approach is used (and conduct a literature review noting the utility of random forest and limitations).
L142 Figure 1 would be much clearer if it were a vertical panel of a) over b) versus a) next to b). Please modify. Also please change the caption to clarify that that images were not necessarily ‘clear sky’ but did otherwise pass the CFMask filter. See, e.g., L395.
L165 The JRC_GSW data layer does not identify wetlands per se but identifies inundated pixels. Therefore it is inaccurate to say that the JRC captured “wetlands around rivers, ponds, etc.” because the data layer would include rivers and ponds – or any pixel that was deemed to be inundated by the Pekel et al. (2016) algorithm. Please revise to acknowledge these data from Pekel identify inundated pixels. Note this also comes up with L281 wherein the authors state they are “excluding permanent water bodies”. Why? Permanent water bodies are a massive abundance of the global wetland data layers (e.g., in addition to the Ramsar Convention definition used earlier, see also
Davidson, N. C. 2014. How much wetland has the world lost? Long-term and recent trends in global wetland area. Marine and Freshwater Research 65: 934-941
Dixon, M. J. R. et al. 2016. Tracking global change in ecosystem area: the Wetland Extent Trends index. Biological Conservation 193: 27-35
Hu, S. et al. 2017. Global wetlands: Potential distribution, wetland loss, and status. Science of the Total Environment 586: 319-327
L169 Table 1 – considering this product is a global data layer, it would be useful to the readers to see the relative abundance of wetlands that each of these named datasets have identified. Furthermore, it’s important to note if indeed these are global products (versus near-global products, such as those within the latitudinal bands of 60N and 60S, for instance). Also convert the arc-seconds to meters (at the equator) for consistency between the data products.
L189 How many of the 18,701 data validation points did NOT have complete agreement between the five validation experts? Noting here that 8,355 points were used to discern amongst the seven classes of wetlands. Relative to the other possible ways to assess their study – and convince people to use it – this number of validation points is very small. Too small, by my assessment.
L207 There are many wetland definitions. That the Ramsar definition is quoted, noting that it includes waters to the depth of 6 m, suggests that open waters should be a wetland type in this analysis. I recognize that flooded flats – located along rivers and lakes – are included. But what of lakes themselves? Ponds? Smaller waters that are important to the global wetland data layer? Are these considered lakes? This is an important factor to consider when assessing global wetland coverage. Another consideration would be submergent vegetation. The marsh class is noted as including grasses, herbs, and low shrubs. What about, say, ponds covered with Nymphaea spp (lily pads)? What about Potamogeton spp. growing submersed in the water? Are these not wetland species? Wetland scientists would say they are. Here’s a good reference in re: this discussion:
Richardson, D. C., et al. 2022. A functional definition to distinguish ponds from lakes and wetlands. Scientific Reports 12(1): 10472.
L252 Provide a number of LS images used for this analysis.
L257 Clarify – 50 km buffer along the coastal zone between 60N – 90N are salt marsh? That seems to be quite excessive, a 50 km buffer. Please clarify.
L258 What’s the proportion of overlap between the different data layers? A spatial correlation table/matrix should be presented to the readers (see, e.g., Tootchi et al. 2019, supplemental information Table S1).
L270 These data were imported…and what was done with them?
L296 The GLWD data are at 1 km pixel. How did the authors include 1 km data plus all the 30-m data products? What’s the final resolution of these data? Also, what’s the proportion of the overlap between them (a spatial correlation table/matrix would be interesting here).
L338 This is the first time that the use of random forest was noted. In L138, I questioned how the ASTER data were used – I suggest revising the methods to introduce the reader early on to the overall approach (i.e., letting them know that the RF algorithm was used).
L341 Figure 4 the use of the ASTER DEM includes slope and aspect? Or was slope and DEM used? If the DEM was used, what information within the DEM was used? See, e.g., L400. Furthermore, the Landsat and Sentinel data were used for identifying inundated pixels. NOT for identifying water levels. I recommend changing the heading title in 4.1 as well.
L393 Ultimately, why were five LS clusters chosen versus three or just the one? Was parsimony considered in the analyses?
L404 This assumes that the maximum extent of the coastal wetlands (equation 1 for mangroves) has zero omission error. I understand why this was done, yet it requires explanation and accounting for the reader here and possibly in the Discussion section as well.
L410 The local adaptive modeling section is too quickly glossed over. Explain more on how this was done. How were the data trained? How many of the 961 5x5 tiles had zero coverage of wetlands (e.g., mid-ocean tiles)? What were the specifications of the training here? It would be hard for others to replicate the process based on the data provided thus far.
L413 What statistical program was used to conduct the RF analyses? Furthermore, while RF may have advantageous, it also has detractions. Please introduce the “obvious advantageous” for those who are not aware as well as mention some of the drawbacks.
L435 Note that 18k samples were analyzed across the globe. Consider the relative dearth noted in Figure 2 (see summary above).
L461 The authors need to introduce Figures 7-10 before introducing Figure 11.
L538 The point behind the Ramsar Convention’s of 6 m was to address depths that diving birds were expected/known to use aquatic systems. It is disingenuous to state that all permanent water bodies have depths >=6 m. This is a possibly fatal flaw in this analysis.
L555 It would be good to see the analyses done in Table 4 for these two areas shown in Figure 10. For instance, the authors have chosen to not include permanent water as a wetland type but yet show ‘water body’ in their panel map, which implies it was correctly mapped yet it is not a land use type they map.
L576 Figure 10 the panel caption for GWL_FCS30 doesn’t match the panel (GWM_FCS30).
L630 These selected training sample results should be in the Results section, not here.
L634 Was this inclusion of steps noted in the methods? I don’t recall it.
L675 These are results and need to be in that section explaining the outcomes of the RF analysis.
- AC3: 'Reply on RC3', Liangyun Liu, 22 Nov 2022
Peer review completion






Journal article(s) based on this preprint
Xiao Zhang et al.
Data sets
GWL_FCS30: global 30 m wetland map with fine classification system in 2020 Liangyun Liu, Xiao Zhang, Tingting Zhao, and Xidong Chen https://doi.org/10.5281/zenodo.6575731
Xiao Zhang et al.
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
734 | 303 | 36 | 1,073 | 12 | 15 |
- HTML: 734
- PDF: 303
- XML: 36
- Total: 1,073
- BibTeX: 12
- EndNote: 15
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
- Preprint
(2891 KB) - Metadata XML