the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Landsat- and Sentinel-derived glacial lake dataset in the China–Pakistan Economic Corridor from 1990 to 2020
Muchu Lesi
Dan Hirsh Shugar
Jida Wang
Qian Deng
Huayong Chen
Jianrong Fan
Download
- Final revised paper (published on 15 Dec 2022)
- Preprint (discussion started on 01 Feb 2022)
- Supplement to the preprint
Interactive discussion
Status: closed
- 
                     RC1:  'Comment on essd-2021-468', Anonymous Referee #1, 14 Feb 2022
            
            
            
            
                        overall comment: this manuscript is in good shape. But there are two issues that MUST have attention given to them. (i) Terminology..the classes of lakes are not named at all well. I have suggested what to do. (ii) Argument of why we need to detect small lakes.; I a totally unconvinced by the GLOF angle...rather I suggest thinking about the effects of lakes on glaciers and the fact that many of these newly-formed small lakes will become larger with ongoing glacier mass loss. I have offerred a coitation to start these thoughts but really a whole paragraph needs adding. Else the abstract needs a complete overhaul too. specific comments: Line 19: suggest rewording to …’one of a number of flagship projects…’ Line 22, suggest delete ‘critical parameters’ and state ‘…parameters X and Y and Z…’ (list them out) Are these ALL glacial lakes? Or just ice-marginal ones? Supraglacial? Subglacial? Please specify. Add this specification into your methods. Line 24. I suggest to put the resolution(s) after the dataset type. Split sentence into two. One for lakes, one for glaciers, for clarity (because as written it is not clear if OI was for lakes or glaciers or both). Line 30…is this 5 pixel threshold for both Landsat and Sentinel? Please clarify the thresholds for BOTH datasets. Line 31…I’m not sure this is ‘discrepancy’, rather simply a result that can be interpreted to be due to many small lakes. Line 32. Are (very) small lakes important? For hazards/GLOFs? Why? I think you need to discuss/show this….in the main text of the manuscript as well as here in the abstract…else the whole premise of your work is not represented/defended/argued (?!). Line 36…would be more useful to state the types of lakes please. And state the two classifications systems please. Be explicit (!). what is the improved equation?! Name it! Line 37. Potentials is not plural. Remove the ‘s’. Line 48. You really must have to cite Carrivick and Tweed (2016) https://www.sciencedirect.com/science/article/pii/S0921818116301023?via%3Dihub please! here. Furthermore, if you read that paper, the size of lakes producing hazardous GLOFs is reported. Small lakes (like the ones detected by your sentinel analysis v landsat) are not hazardous (!). Line 97. Please explain ‘type’ is this glacier terminus environment? Is it lake dam type? Is it lake position (supraglacial or ice-marginal for example?). Line 173. A glacial lake is one that receives meltwater from a glacier. Of these most are proglacial (beyond the glacier) and can be attached (ice-marginal or ice-contact) or detached from the edge of the glacier. PLEASE correct this terminology. Then say what you do (which means you need to evaluate what sort of lakes you are actually analysing!). Line 186 ‘without any distance limit’…oh come on there must have been some limit?! The catchment or study area boundary at least?! Please evaluate what you have done and report it carefully. Line 207 this info. on mapping units needs to be accurately represented in the abstract. Line 233. This spatial relationship needs to be explicitly named above in the manuscript where I have already queried it. I dislike this classification. See Carrivick and Tweed (2013) https://www.sciencedirect.com/science/article/pii/S027737911300293X for definition of proglacial lakes (my comment for line 173). Supraglacial is a distinct group so that is OK. Proglacial and unconnected are the same/overlap…you need ‘ice-marginal’ and ‘other proglacial’ I think, then ‘other lakes’ as your classes/types. Line 238. The terminology again is wrong here and confusing because mixes position and dam type. See Carrivick and Tweed (2013) https://www.sciencedirect.com/science/article/pii/S027737911300293X . You should have supraglacial, terminus moraine-dammed, lateral moraine dammed, ice-dammed and bedrock-dammed I suggest. Line 318 to 326. I suggest to compare to (and cite) Carrivick and Quincey (2014) who also consider uncertainty v lake area. https://www.sciencedirect.com/science/article/pii/S092181811400054X?via%3Dihub Line 453…so do we need Sentinel images for lake mapping?? If Landsat is doing a good job v sentinel (detection as well as accuracy) then why do we need the extra resolution? What importance do the numerous small lakes have? They are not important volumetrically? Are they important for hazards/GLOFs? (I don’t think so!). I really think the ‘promoted capacity of GLOF risk assessment’ (line 543) needs further elaboration. In contrast, I think a utility of your dataset and indeed your sentinel-based detection of many small lakes is that those small lakes could be the onset of fast-developing proglacial landscapes…and they will likely grow as glaciers diminish further and affect glacier dynamics (see Carrivick et al., 2020 for example https://www.frontiersin.org/articles/10.3389/feart.2020.577068/full ) Citation: https://doi.org/10.5194/essd-2021-468-RC1 - 
                                        
                                     AC1:  'Reply on RC1', Yong Nie, 21 Mar 2022
                            
                            
                            
                            
                                        Overall comment: This manuscript is in good shape. But there are two issues that MUST have attention given to them. (i) Terminology..the classes of lakes are not named at all well. I have suggested what to do. (ii) Argument of why we need to detect small lakes.; I am totally unconvinced by the GLOF angle...rather I suggest thinking about the effects of lakes on glaciers and the fact that many of these newly-formed small lakes will become larger with ongoing glacier mass loss. I have offered a coitation to start these thoughts but really a whole paragraph needs adding. Else the abstract needs a complete overhaul too. [Response] Thank you for your encouragement and constructive comments, which have helped us to improve the quality of this article. We respond to the comments point by point as follows. Regarding the terminology of lake classification systems, we have revised it as suggested. Now, in the first glacial lake classification system, glacial lakes were classified into four types based on their spatial relationship to upstream glaciers: supraglacial, ice-contact, unconnected-glacier-fed lakes, and non-glacier-fed lakes. In the second glacial lake classification system, glacial lakes were classified into five categories (herein named GLCS2) modified based on Yao’s classification system (2018): supraglacial, end-moraine-dammed, lateral-moraine-dammed, glacial-erosion lakes and ice-dammed lakes. Regarding small lakes, we agree that small lakes have little or no hazardous impact due to their limited water release. The focus of this study is to generate a new glacial lake dataset for the CPEC, using 5 pixels as the mapping threshold for both Landsat and Sentinel images. We had to map all the glacial lakes, including small ones. If users want to conduct GLOF hazard risk studies, they have the flexibility to set a minimum threshold, such as greater than 0.05 km2, to eliminate smaller lakes from our dataset. In addition, we argue that small lakes, such as supraglacial lakes, play an important role in glacier and lake evolutions, and affect cryosphere-hydrological changes. Thus our dataset has potential to be widely applied in studies on glacial lake-related hazards, glacier-lake interactions and cryospheric hydrology. We have made some changes in the Abstract and main text, and responded to the query of Line32. We have also cited the recommended references and revised the abstract. Specific comments: Line 19: suggest rewording to …’one of a number of flagship projects…’ [Response] Revised as ‘…one of the flagship projects…’ Line 22, suggest delete ‘critical parameters’ and state ‘…parameters X and Y and Z…’ (list them out) Are these ALL glacial lakes? Or just ice-marginal ones? Supraglacial? Subglacial? Please specify. Add this specification into your methods. [Response] Considering the dataset with 17 attributes, it is not suitable to list all of them here. So we decided to list three parameters for example, and revised this sentence to be: ‘An up-to-date, high-quality glacial lake dataset with parameters such as lake type, acquisition date and area, which is fundamental to flood risk assessments and predicting glacier-lake evolutions and cryosphere-hydrological changes…’ This study defines a glacial lake as one that formed as a result of modern or ancient glaciation. All glacial lakes in the study area were mapped according to our definition without any distance limit between lakes and glaciers. So it is not just ice-marginal glacial lakes. The dataset includes supraglacial lakes; however, it does not include subglacial lakes that are not detectable from optical satellite sensors. See section 4.1 and 4.3 for detailed description. Line 24. I suggest to put the resolution(s) after the dataset type. Split sentence into two. One for lakes, one for glaciers, for clarity (because as written it is not clear if OI was for lakes or glaciers or both). [Response] Following the suggestion, we have revised this sentence to be: ‘This dataset includes (1) a glacial lake inventory for the year 2020 at 10 m resolution produced from Sentinel spectral images, and (2) multi-temporal inventories for 1990, 2000, and 2020 produced from 30 m resolution Landsat images.…’ Line 30…is this 5 pixel threshold for both Landsat and Sentinel? Please clarify the thresholds for BOTH datasets. [Response] Yes, the 5-pixel threshold is for both Landsat and Sentinel images. See ‘Landsat derived 2234 glacial lakes in 2020, covering a total area of 86.31±14.98 km2 with a minimum mapping unit of 5 pixels (4500 m2), whereas Sentinel derived 7560 glacial lakes in 2020 with a total area of 103.70±8.45 km2 with a minimum mapping unit of 5 pixels (500 m2).’ Line 31…I’m not sure this is ‘discrepancy’, rather simply a result that can be interpreted to be due to many small lakes. [Response] We agree that the discrepancy cannot be simply interpreted by the differences in total number and area of both Landsat-derived and Sentinel-derived lake dataset. Hence, we performed a comparison between Sentinel-2 and Landsat derived products in section 6.1 and interpreted the discrepancy. It reads: ‘Glacial lakes from Landsat and Sentinel images have a high consistency in number and area with overlap rates from 85.7% to 94.26% for all lakes greater than about 0.0045 km2 (Table 5), implying a good potential for coordinated utility with Landsat archived observation (Figure 11). Lake extents extracted from Landsat and Sentinel images match well for various lake types and sizes (Table 4). The best consistency rate reaches 94% for the glacial lakes between 0.1 km2 and 0.2 km2. The difference in area of glacial lakes extracted from Landsat and Sentinel images generally lies within the uncertainty ranges.’ Also, ‘Spatial resolution of satellite images plays a primary role in the discrepancies in count and area of glacial lakes extracted from Landsat (30 m) and Sentinel (10 m) observations. Due to a finer spatial resolution, Sentinel images can extract more glacial lakes and more accurate extents than those from Landsat images. We set the same 5 pixels as the minimum mapping unit for both Landsat and Sentinel images, which corresponds to a minimum area of 0.0045 km2 and 0.0005 km2, respectively. The minimum mapping area results in generating nearly 5000 more lakes from Sentinel images than from Landsat images, causing the greatest discrepancy in number of the two glacial lake products (Table 5), such as Figure 12a. ’ Line 32. Are (very) small lakes important? For hazards/GLOFs? Why? I think you need to discuss/show this….in the main text of the manuscript as well as here in the abstract…else the whole premise of your work is not represented/defended/argued (?!). [Response] We agree that small lakes have little or no hazardous impact due to their limited water release. The focus of this study is to generate a new glacial lake dataset based on Landsat and Sentinel images in the CPEC, using 5 pixels as mapping threshold for both Landsat and Sentinel images. We had to map all the glacial lakes, including small ones. If users want to conduct GLOF hazard risk studies, they can set a minimum threshold, such as greater than 0.05 km2, to select a subset of lakes from our dataset to eliminate the impact of small lakes. Small lakes, such as supraglacial lakes play an important role in glacier and lake evolutions, and affect cryosphere-hydrological changes. Thus our dataset has potential to be widely applied in studies on glacial lake-related hazards, glacier-lake interactions and cryospheric hydrology. We have made some changes in the Abstract and main text. It now reads: ‘An up-to-date high-quality glacial lake dataset with parameters such as lake type, acquisition date and area, which is fundamental to flood risk assessments and predicting glacier-lake evolutions and cryosphere-hydrological changes…’ ‘This comprehensive glacial lake dataset has potential to be widely applied in studies on glacial lake-related hazards, glacier-lake interactions and cryospheric hydrology…’ ‘Small lakes such as supraglacial lakes play an important role in understanding meltwater runoff and supraglacial drainage systems (Liu and Mayer, 2015; Miles et al., 2018). Our dataset can be used not only for GLOFs evaluation, but also for glacial lake evolution simulation and glacio-hydrological prediction. ’ Line 36…would be more useful to state the types of lakes please. And state the two classifications systems please. Be explicit (!). what is the improved equation?! Name it! [Response] The two classifications systems contain a total of nine types of glacial lakes, so specifying all types in the abstract will take up too much space. To keep the abstract concise and present improved equation, we changed this sentence to: ‘ A range of critical attributes have been generated in the dataset, including lake types and mapping uncertainty estimated by an improved Hanshaw’s equation’ Line 37. Potentials is not plural. Remove the ‘s’. [Response] We have changed this sentence to ‘This comprehensive glacial lake dataset has potential to be widely applied in studies on glacial lake-related hazards, glacier-lake interactions and cryospheric hydrology…’ Line 48. You really must have to cite Carrivick and Tweed (2016) https://www.sciencedirect.com/science/article/pii/S0921818116301023?via%3Dihub please! here. Furthermore, if you read that paper, the size of lakes producing hazardous GLOFs is reported. Small lakes (like the ones detected by your sentinel analysis v landsat) are not hazardous (!). [Response] We agree and cited the reference from Carrivick and Tweed (2016). It now reads: ‘…glaciers and impacting downstream ecosystem services, agriculture, hydropower and other socioeconomic values (Carrivick and Tweed, 2016; Nie et al., 2021).’ About small lakes, we have responded to the previous inquiries. Line 97. Please explain ‘type’ is this glacier terminus environment? Is it lake dam type? Is it lake position (supraglacial or ice-marginal for example?). [Response] As suggested, we have revised this to be: ‘Dam type classification of glacial lakes provides a crucial attribute for glacier-lake interactions and risk assessment…’. Line 173. A glacial lake is one that receives meltwater from a glacier. Of these most are proglacial (beyond the glacier) and can be attached (ice-marginal or ice-contact) or detached from the edge of the glacier. PLEASE correct this terminology. Then say what you do (which means you need to evaluate what sort of lakes you are actually analysing!). [Response] We agree to divide proglacial lakes into ice-contact and unconnected-glacier-fed (detached) lakes in this study. We revised the classification system of glacial lakes. See the section 4.3. We consider a glacial lake as one that formed as a result of modern or ancient glaciation. In this study, all glacial lakes were mapped according to this definition and are attributed using the two classification systems. Line 186 ‘without any distance limit’…oh come on there must have been some limit?! The catchment or study area boundary at least?! Please evaluate what you have done and report it carefully. [Response] Thank you for this comment. Our dataset is indeed limited by the study area boundary. To avoid any misunderstanding, this sentence was changed to ‘All glacial lakes in the study area were mapped according to our definition regardless of the distance to glaciers.’ Line 207 this info. on mapping units needs to be accurately represented in the abstract. [Response] We agree, and have revised this sentence to be: ‘The results show that Landsat derived 2234 glacial lakes in 2020, covering a total area of 86.31±14.98 km2 with a minimum mapping unit of 5 pixels (4500 m2), whereas Sentinel derived 7560 glacial lakes in 2020 with a total area of 103.70±8.45 km2 with a minimum mapping unit of 5 pixels (500 m2).’ Line 233. This spatial relationship needs to be explicitly named above in the manuscript where I have already queried it. I dislike this classification. See Carrivick and Tweed (2013) https://www.sciencedirect.com/science/article/pii/S027737911300293X for definition of proglacial lakes (my comment for line 173). Supraglacial is a distinct group so that is OK. Proglacial and unconnected are the same/overlap…you need ‘ice-marginal’ and ‘other proglacial’ I think, then ‘other lakes’ as your classes/types. [Response] Thank you for this valuable suggestion. As we responded earlier, we agree to divide proglacial lakes into ice-contact and unconnected-glacier-fed (detached) lakes. We have revised this consistently throughout the main text, figures, tables and attribute of our glacial lake dataset. This sentence was changed to ‘…glacial lakes were classified into four types based on their spatial relationship to upstream glaciers: supraglacial, ice-contact, unconnected-glacier-fed lakes, and non-glacier-fed lakes according to Gardelle et al. (2011) and Carrivick et al. (2013).’ Line 238. The terminology again is wrong here and confusing because mixes position and dam type. See Carrivick and Tweed (2013) https://www.sciencedirect.com/science/article/pii/S027737911300293X . You should have supraglacial, terminus moraine-dammed, lateral moraine dammed, ice-dammed and bedrock-dammed I suggest. [Response] Considering the formation mechanism and dam properties of glacial lakes, the second glacial lake classification system was established via modifying Yao’s classification system (2018). According to your suggestion, we have revised the terminology of the classification system to ice-dammed, end-moraine-dammed, lateral-moraine-dammed and supraglacial lakes. Glacial-erosion lakes contain both bedrock-dominated dam and top-moraine-mixed dam, so we prefer to use glacial-erosion lakes instead of bedrock-dammed lakes. It now reads: ‘Alternatively, combining the formation mechanism of glacial lakes and the properties of natural dam features, glacial lakes were classified into five categories (herein named GLCS2) modified from Yao’s classification system (2018): supraglacial, end-moraine-dammed, lateral-moraine-dammed, glacial-erosion lakes and ice-dammed lakes.’ Line 318 to 326. I suggest to compare to (and cite) Carrivick and Quincey (2014) who also consider uncertainty v lake area. https://www.sciencedirect.com/science/article/pii/S092181811400054X?via%3Dihub [Response] We cited the reference from Carrivick and Quincey (2014). It now reads ‘Lake perimeter and displacement error are widely used to estimate the uncertainty of glacier and lake mapping from satellite observation (Carrivick and Quincey, 2014; Hanshaw and Bookhagen, 2014; Wang et al., 2020).’ The difference in uncertainty estimation between Carrivick’s and Hanshaw's methods is that Carrivick assumes an uncertainty of ±1 pixel, while Hanshaw assumes an uncertainty of ±0.5 pixels and counts the number of edge pixels. In this study, we discovered and solved the problem of repeatedly calculated edge pixels. Considering that the mean lake size in the study area is smaller than that in the Greenland, we prefer to choose the improved Hanshaw’s equation to estimate the mapping uncertainty. Line 453…so do we need Sentinel images for lake mapping?? If Landsat is doing a good job v sentinel (detection as well as accuracy) then why do we need the extra resolution? What importance do the numerous small lakes have? They are not important volumetrically? Are they important for hazards/GLOFs? (I don’t think so!). I really think the ‘promoted capacity of GLOF risk assessment’ (line 543) needs further elaboration. [Response] We believe Sentinel images do offer their unique benefits in mapping glacier lakes, owing to their finer spatial resolution, increasing capacity of revisit observation and accurately depicting lake boundaries with a lower uncertainty. We further clarified these as: ‘Due to a finer spatial resolution, Sentinel images can extract more glacial lakes and more accurate extents than those from Landsat images….Meanwhile, Sentinel images are able to depict boundaries of glacial lake with a lower uncertainty (Figure 12b-d). For example, some small islands and narrow channels (Figure 12b and c) were mapped from Sentinel imagery that were unable to be detected in Landsat imagery.’ Regarding small lakes, we have responded to a similar query earlier: ‘Small lakes such as supraglacial lakes play an important role in understanding meltwater runoff and supraglacial drainage systems (Liu and Mayer, 2015; Miles et al., 2018). Our dataset can be used not only for GLOFs evaluation, but also for glacial lake evolution simulation and glacio-hydrological prediction. ’ About ‘promoted capacity of GLOF risk assessment’ (line 543), we revised this sentence to be ‘Herein, we provide an up-to-date glacial lake dataset derived from both Landsat and Sentinel observations, which further increased the availability of glacial lake datasets for GLOFs risk assessment, predicting glacier evolutions and understanding cryosphere-hydrological changes in the context of climate change.’ In contrast, I think a utility of your dataset and indeed your sentinel-based detection of many small lakes is that those small lakes could be the onset of fast-developing proglacial landscapes…and they will likely grow as glaciers diminish further and affect glacier dynamics (see Carrivick et al., 2020 for example https://www.frontiersin.org/articles/10.3389/feart.2020.577068/full ) [Response] Thank you for your affirmation and encouragement. The Sentinel-derived lake dataset has a wider potential than Landsat-derived dataset to be used in studies on proglacial landscape change and glacier dynamic assessment. The recommended reference is important and cited in the main text. References Carrivick, J.L., Tweed, F.S.: Proglacial lakes: character, behaviour and geological importance. Quaternary Sci. Rev., 78: 34-52, https://doi.org/10.1016/j.quascirev.2013.07.028, 2013. Carrivick, J.L., Tweed, F.S.: A global assessment of the societal impacts of glacier outburst floods. Global Planet. Change, 144: 1-16, https://doi.org/10.1016/j.gloplacha.2016.07.001, 2016. Gardelle, J., Arnaud, Y., Berthier, E.: Contrasted evolution of glacial lakes along the Hindu Kush Himalaya mountain range between 1990 and 2009. Global Planet. Change, 75: 47-55, https://doi.org/10.1016/j.gloplacha.2010.10.003, 2011. Liu, Q., Mayer, C.: Distribution and interannual variability of supraglacial lakes on debris-covered glaciers in the Khan Tengri-Tumor Mountains, Central Asia. Environ. Res. Lett., 10: 014014 2015. Miles, E.S., Watson, C.S., Brun, F., Berthier, E., Esteves, M., Quincey, D.J., Miles, K.E., Hubbard, B., Wagnon, P.: Glacial and geomorphic effects of a supraglacial lake drainage and outburst event, Everest region, Nepal Himalaya. The Cryosphere, 12: 3891-3905, https://doi.org/10.5194/tc-12-3891-2018, 2018. Nie, Y., Pritchard, H.D., Liu, Q., Hennig, T., Wang, W., Wang, X., Liu, S., Nepal, S., Samyn, D., Hewitt, K., Chen, X.: Glacial change and hydrological implications in the Himalaya and Karakoram. Nature Reviews Earth & Environment, 2: 91-106, https://doi.org/10.1038/s43017-020-00124-w, 2021. Yao, X., Liu, S., Han, L., Sun, M., Zhao, L.: Definition and classification system of glacial lake for inventory and hazards study. J. Geogr. Sci., 28: 193-205, https://doi.org/10.1007/s11442-018-1467-z, 2018. Citation: https://doi.org/10.5194/essd-2021-468-AC1 
 
- 
                                        
                                     AC1:  'Reply on RC1', Yong Nie, 21 Mar 2022
                            
                            
                            
                            
                                        
- 
                     RC2:  'Comment on essd-2021-468', Anonymous Referee #2, 14 Feb 2022
            
            
            
            
                        The glacial lake dataset presented in 'Landsat and Sentinel-derived glacial lake dataset in the China-Pakistan Economic Corridor from 1990 to 2020' identifies and classifies lakes in the CPEC region of High-mountain Asia from three time steps - 1990, 2000 and 2020. Lakes are identified from Landsat and Sentinel-2 optical satellite imagery using a semi-automated approach that utilises the well-established NDWI (Normalised Difference Water Index) mehod. Statistics and analysis of lake abundance and size distribution are presented, along with changes in lakes over the course of the time-series and inter-comparison of the Landsat- and Sentinel-derived lake outlines. The dataset is then compared to alike glacial lake datasets from the same region, in order to examine and evaluate discrepancies. This is a valuable dataset that I foresee will be readily used by the cryosphere and hydrology research communities. In particular, the use of two highly-detalied lake classification systems (based on Gardelle et al., 2011, and a modified version of Yao, 2018) is a unique aspect of the dataset that is insightful alongside the general size and abundance information. This type of classification is seldom seen in glacial lake datasets, and reflects the thoroughness of the dataset. The manuscript is structured in a clear and concise manner, guiding the reader through the dataset methods and description, results and statistics from the datastes, followed by an evaluation of the dataset scope and certainty. Several key points need to be addressed, which are detailed below, largely regarding the dataset itself and the definition of a glacial lake. The comparison to alike datasets is flawed given that many of the discrepancies are due to the differing definitions of a glacial lake, rather than the classification method itself. Once these major revisions have been addressed, then the manuscript and associated dataset will be a great addition to ESSD. 
 Major comments1. Dataset transparency 
 A large part of the presented dataset is manually derived - metadata generation, georeferencing, and outline modifications. This can make reproducibility challenging. I would like to see a version of this dataset provided in the supplementary material that presents the dataset before manual intervention/inputs. Therefore, readers can see the dataset before and after manual modifications, and tangibly distinguish the automated and user-defined components of the methodology presented.
 2. Definition of a glacial lake
 A large focus in the manuscript is glacier-related hazards, specifically GLOFs and draining lakes that are either on or share a boundary with a glacier. However, the dataset includes lakes that are not influenced or effect glacier dynamics, such as lakes that are hydrologically unconnected from a glacier. The abundance of glaciologically-unconnected lakes markedly influences the identified trends in the dataset, such as the visible influence on lake abundance in GLCS1. In addition, the dataset lacks a spatial filter relative to the ice margin. If indeed the aim of the dataset is to inform on glacier-related hazards, this dataset should focus exclusively on active glacial lakes, rather than active and ancient lakes.Another aspect is the small threshold size used for the glacial lake dataset. Again, if the focus of this paper is glacier-related hazards then small lakes (<0.05 sq km; Shugar et al., 2021) are largely irrelevant to this study as they have limited GLOF impact. These small lakes make up over 80% of the dataset and heavily influence the identified temporal trends. In order to overcome this, I would suggest shifting the focus of the manuscript away from glacier-related hazards and framing the manuscript under the importance of freshwater transfer and storage in the region. Whilst Section 4.1 adequately outlines the definition of glacial lake, I think a brief definition should be defined early in the manuscript to assist framing the focus of the manuscript. Additionally, I would like to see a passage in the results/discussion that analyses active glacial lakes, under which their relation to glacier-hazards and GLOFs can be addressed. The comparison to other glacial lake datasets should be revisited to provide an adequate examination that focuses on discrepancies in the classification methodologies rather than the definition of a glacial lake. 
 3. Broader overview of remote sensing classification methods
 Optical classification methods are solely focused on in the introduction section of the manuscript (L86-103), which falsely represents them as the sole classification method readily used in remote sensing. I would like to see the overview include other remote sensing classification methods, namely SAR backscatter classification, but also other alternative approaches such as from hydrological sink analysis and from land surface temperature.I am not sure if there are any studies in this region where alternative classification methods are used to detect water bodies; but if there are any then I think they would be a great addition to the dataset comparison section to serve as an inter-comparison of methodologies beyond alike optical classification approaches. Specific comments L41-66: I think this a detailed and concise overview of the importance of glacial lakes and GLOFs in a regional context. However, I think a global perspective is needed to thoroughly illustrate the significance of this study - especially if you are referring to global studies of glacial lakes, such as Shugar et al. (2021). Please include a sentence or two near the beginning about glacial lakes and GLOFs globally (i.e. importance, general trends etc.) L67-85: You largely focus on remote sensing efforts in HMA regional studies, but there are also references to papers from other regions such as Greenland and the Alps. Either open up this section as an overview of remote sensing studies from all regions, or keep it refined to the HMA region. There have been many regional studies that have been published recently (e.g. Alaska, Rick et al., 2022; Greenland, How et al., 2021), not just in HMA, so I would recommend widening this section to outline the methods in a general context, rather than focusing on HMA. L92: What exactly do you mean by object-oriented classification here? This term is generally used in programming rather than in reference to a classification approach. Please change this, or clarify what is meant here; preferably with a more suitable term. L117-119: Are these sub-basins divided by catchments and/or watershed? What determines these sub-basins? L132-170: Great outline of data sources. L178: Why are landslide-dammed lakes irrelevant to glaciation? Can some glacial lakes also be landslide-dammed lakes? L199: Change 'the method automatically generated the histogram...' to 'the method calculated the histogram...' L201: Change 'interactively' to 'manually'. In reference to this comment and the last, I think it needs to be clear in the methodology how this approach is 'semi-automated'. L224-228: False classifications from cloud and topographic shadows can be eliminated with cloud and terrain masking, which are well-established remote sensing methods in land classification. Why did you choose not to include this in the automated component of your workflow? Table 1: The characteristics of a proglacial lake should specify that these lakes share a boundary with the ice margin, according to your definition - 'shared boundary' is a better description than 'connected with glaciers' as this could be intepretted as hydrologically connected instead of physically adjacent. Table 2: There must be occurrences where a lake's formation and/or dam material properties are ambiguous (especially in relation to GLCS2), even from Google Earth imagery. I see in the dataset that there are no instances where a lake's classification is determined as uncertain; even though you state later on that occassional misclassifications are inevitable (L561). In such instances of ambiguous lake types, how do you decide the classification? L272-273: Please provide references to studies that use lake perimeter and displacement error to estimate uncertainty. L270-295: Repetition with the corresponding uncertainty estimation section in the supplementary materials. I would suggest refining this section in the main manuscript, and keeping the full description for the supplementary materials. L294 and L305: Change figure names from Figure S3a/S3b to Figure 3a/3b, unless you would rather move them to the supplementary materials. Figure 4: This is a great figure. Please add labelling to the figure to indicate that one set of graphs is from Landsat and the other from Sentinel - you only understand this once you have read to the end of the caption, and it needs to be signposted earlier. L339: 'proglacier' >> 'proglacial' Figure 5: The four maps are somewhat repetitive and it is difficult to see differences between the Sentinel and Landsat lake sizes/abundance from this. I would suggest changing this figure to have an overview map on the left showing all detected lakes from both methods, and a series of inset maps to the right displaying a closer look at certain regions of interest; divided into Sentinel and Landsat lakes. Also, maybe change the outline colour of the lake points to a darker shade, as it is hard to identify the lake points in the current figure. L352: This is a hanging line, and I am not sure which panels and sub-graphs are being referred to here. Does this belong somewhere else or is this a fault with the journal formatting? Figure 7 and 8: These graphs are very effective at showing changes in lakes - a refreshing take on presenting this type of dataset. L382: '...while the area grew by a less extent (1.21 km2 or 1.42%).' >> '...while the area grew by 1.21 km2 (or 1.42%).' L408: '...including being stable for Shingo...' >> '...including a stable trend for Shingo...' L411-412: 'The total numbers of Kashgar and Hunza basins decreased...' >> 'The total number of lakes in Kashgar and Hunza basins decreased...' L426-27 and Table 5: Can you include some statistics on the link between lake size and % overlap between the Sentinel-2 and Landsat counts? - this would help gauge how much spatial resolution (differentiated from image acquisition) affects lake classification in this study. L441-443: Are these lakes persistently large or just at a particular time step? Do you have evidence as to why they are disproportionally large? L481-485: Can you state here the number of instances where overlapping acquisitions were acqiured? L492: 'approximate' >> 'approximately' L503-504: Are there any studies that present glacial lake datasets derived from Sentinel-2? If so, please reference them here. If not, then change this to state that there are no comparable datasets, rather than a scarce number of datasets. L515-521: I think, similar to your suggestion regarding landslide-dammed lakes, a likely answer is that Wang et al. focus more on glacier-connected lakes, given that they adopt a 10 km buffer to filter out unconnected lakes. And therefore they identify an increasing trend, possibly reflective of a subset of your lake types. Could you subset your lake dataset to match the lakes identified by Wang et al., and examine whether you also see this increasing trend evident in your subsetted dataset? (And perhaps also include landslide-dammed lake for the purpose of this comparison?) L530: "Zhang's dataset..." >> "The dataset from Zhang et al. (2015) ..." L531-536: I am unsure how this study and Zhang et al. could have discrepancies in image availability when both studies are classifying lakes from the same satellite image collection (Landsat). Some clarification is needed here to demonstrate how your dataset could classify these lakes when Zhang et al. could not. L538-544: Discrepancies in glacial lake datasets can be because of minimum lake size, classification method (i.e. not just optical), image acqusition and post-filtering. However, if the purpose of this dataset is to 'further promote the capacity of GLOF risk assessment and predicting glacier evolutions' then I am unsure why there is no spatial filter (relative to ice margin position) adopted to remove lakes that are unconnected to the glacial system. I think the focus of this study needs to be shifted (as stated earlier in major comments), and further analysis needs to be presented that demonstrates changes in GLOF and glacier-fed lakes specifically (i.e. filtered by lake type and size) - see major comments for full details. L550: 'Even though an capacity of repetitive observations...' >> 'Even though the capacity of repeat observations...' L557: '...inter- and intra-annual changes (Liu et al., 2020) in glacial lake dataset of each time period...' >> '...inter- and intra-annual changes (Lie et al., 2020) for each time period...' L545-564: This is a valuable section to include in the study. The temporal range of these datasets and limited image availability (especially in formative years) will not adequately capture the dynamic nature of draining glacial lakes; and therefore such datasets serve as a gauge of long-term, regional trends rather than individual lake change. L565-575: It is great to hear that this work will be continued, and new time steps will be included in the dataset in the future. L584: '...spatial-temporal changes at longer time scale...' >> '...spatial-temporal changes at a longer time scale...' L584: 'observation' >> 'observations' L585: 'started' >> 'starting' L595-596: See previous comment from L538-544 regarding this statement. L602: 'values for cryospheric-hydrology research, assessment of...' >> 'value to cryospheric-hydrology research, the assessment of...' Citation: https://doi.org/10.5194/essd-2021-468-RC2 - 
                                        
                                     AC2:  'Reply on RC2', Yong Nie, 21 Mar 2022
                            
                            
                            
                            
                                        Referee #2 The glacial lake dataset presented in 'Landsat and Sentinel-derived glacial lake dataset in the China-Pakistan Economic Corridor from 1990 to 2020' identifies and classifies lakes in the CPEC region of High-mountain Asia from three time steps - 1990, 2000 and 2020. Lakes are identified from Landsat and Sentinel-2 optical satellite imagery using a semi- automated approach that utilises the well-established NDWI (Normalised Difference Water Index) method. Statistics and analysis of lake abundance and size distribution are presented, along with changes in lakes over the course of the time-series and inter- comparison of the Landsat- and Sentinel-derived lake outlines. The dataset is then compared to alike glacial lake datasets from the same region, in order to examine and evaluate discrepancies. This is a valuable dataset that I foresee will be readily used by the cryosphere and hydrology research communities. In particular, the use of two highly-detailed lake classification systems (based on Gardelle et al., 2011, and a modified version of Yao, 2018) is a unique aspect of the dataset that is insightful alongside the general size and abundance information. This type of classification is seldom seen in glacial lake datasets, and reflects the thoroughness of the dataset. The manuscript is structured in a clear and concise manner, guiding the reader through the dataset methods and description, results and statistics from the datastes, followed by an evaluation of the dataset scope and certainty. Several key points need to be addressed, which are detailed below, largely regarding the dataset itself and the definition of a glacial lake. The comparison to alike datasets is flawed given that many of the discrepancies are due to the differing definitions of a glacial lake, rather than the classification method itself. Once these major revisions have been addressed, then the manuscript and associated dataset will be a great addition to ESSD. [Response] We thank the reviewer for her/his positive assessment of the work and for her/his constructive comments, which have helped us to improve the quality of this article. We respond to the comments point by point as follows. Major comments1. Dataset transparency A large part of the presented dataset is manually derived - metadata generation, georeferencing, and outline modifications. This can make reproducibility challenging. I would like to see a version of this dataset provided in the supplementary material that presents the dataset before manual intervention/inputs. Therefore, readers can see the dataset before and after manual modifications, and tangibly distinguish the automated and user-defined components of the methodology presented. [Response] In this study, we used a human-interactive, semi-automated lake mapping method (Wang et al., 2014; Nie et al., 2017, 2020) to accurately extract glacial lake extents. The used method is flexible and of high reproducibility to map lake boundary by tuning NDWI threshold while screening the NDWI histogram, and automatically generating vector polygons. More detailed information can be seen in our previous publications, such as Wang et al., 2014. In the process of interactive lake mapping, manual inputs refer to the drawing of user-defined region of interest (ROI) and tuning the NDWI threshold in each ROI, whereas calculating the histogram of NDWI and converting raster lake extent to vector polygon were automated. Our lake dataset contains pixelated polygons, rather than manually digitized polygons. We do not provide the dataset before and after manual modifications because of the absence of manually-modified lake polygons. To avoid misunderstanding, we define the method as a human-interactive and semi-automated lake mapping method and made some revision. It now reads: ‘Specifically, the method calculated the NDWI histogram based on the pixels with each user-defined and manually-drawn region of interest. The NDWI threshold that separates lake surface from land was interactively determined by screening the NDWI histogram against the lake region in the imagery (Wang et al., 2014; Nie et al., 2020). This way, the determined NDWI threshold can be well-tuned to adapt various spectral conditions of the studied glacier lakes. The raster lake extents segmented by the thresholds were then automatically converted to vector polygons.’ 
 2. Definition of a glacial lakeA large focus in the manuscript is glacier-related hazards, specifically GLOFs and draining lakes that are either on or share a boundary with a glacier. However, the dataset includes lakes that are not influenced or effect glacier dynamics, such as lakes that are hydrologically unconnected from a glacier. The abundance of glaciologically-unconnected lakes markedly influences the identified trends in the dataset, such as the visible influence on lake abundance in GLCS1. In addition, the dataset lacks a spatial filter relative to the ice margin. If indeed the aim of the dataset is to inform on glacier-related hazards, this dataset should focus exclusively on active glacial lakes, rather than active and ancient lakes. [Response] The focus of this study is to provide a complete glacial lake dataset based on Landsat and Sentinel images, which has potential to be widely applied in studies on glacial lake-related hazards, glacier-lake interactions and cryospheric hydrology. We made some modifications to be more explicit about the goals of this study and the importance of our dataset. For example, we clarified that ‘This comprehensive glacial lake dataset has potential to be widely applied in studies on glacial lake-related hazards, glacier-lake interactions and cryospheric hydrology…’ We agree that most draining lakes that are either on or share a boundary with a glacier. If any studies want to conduct GLOFs risk assessment, one can select the candidate lakes by setting threshold in lake types and areas that are available in our dataset attributes. Lakes, including hydrologically-unconnected from a glacier, inactive and ancient lakes, formed as a result of glaciology and still impact land surface hydrology. Thus, all glacial lakes have to be mapped in lake inventories. Both Randolph Glacier Inventory version 6.0 (Pfeffer et al., 2014; RGI Consortium, 2017) and the Glacier Area Mapping for Discharge from the Asian Mountains (GAMDAM) glacier inventory (Sakai, 2019) were used to affirm the ice margin during classifying lake type. It now reads: ‘These two glacier datasets were used to determine glacial lake types, such as ice-contact, ice-dammed and unconnected-glacier-fed lakes.’ Another aspect is the small threshold size used for the glacial lake dataset. Again, if the focus of this paper is glacier-related hazards then small lakes (<0.05 sq km; Shugar et al., 2021) are largely irrelevant to this study as they have limited GLOF impact. These small lakes make up over 80% of the dataset and heavily influence the identified temporal trends. [Response] This comment was echoed by reviewer #1. We agree that small lakes have little or no hazardous impact due to their limited water release. The focus of this study is to generate a new glacial lake dataset for the CPEC, using 5 pixels as the mapping threshold for both Landsat and Sentinel images. We opted to map all glacial lakes, including small ones. If users want to conduct GLOFs hazard risk studies, they have the flexibility to set a threshold, such as greater than 0.05 km2, to select lakes using our dataset to eliminate the impact of small lakes. Small lakes such as supraglacial lakes play an important role in understanding meltwater runoff and supraglacial drainage systems. Thus our dataset has potential to be widely applied in studies on glacial lake-related hazards, glacier-lake interactions and cryospheric hydrology. We have made some changes in the Abstract and main text, it now reads: ‘An up-to-date high-quality glacial lake dataset with parameters such as lake type, acquisition date and area, which is fundamental to flood risk assessments and predicting glacier-lake evolutions and cryosphere-hydrological changes…’ ‘This comprehensive glacial lake dataset has potential to be widely applied in studies on glacial lake-related hazards, glacier-lake interactions and cryospheric hydrology…’ ‘Small lakes such as supraglacial lakes play an important role in understanding meltwater runoff and supraglacial drainage systems (Liu and Mayer, 2015; Miles et al., 2018). Our dataset can be used not only for GLOFs evaluation, but also for glacial lake evolution simulation and glacio-hydrological prediction.’ In order to overcome this, I would suggest shifting the focus of the manuscript away from glacier-related hazards and framing the manuscript under the importance of freshwater transfer and storage in the region. Whilst Section 4.1 adequately outlines the definition of glacial lake, I think a brief definition should be defined early in the manuscript to assist framing the focus of the manuscript. Additionally, I would like to see a passage in the results/discussion that analyses active glacial lakes, under which their relation to glacier- hazards and GLOFs can be addressed. The comparison to other glacial lake datasets should be revisited to provide an adequate examination that focuses on discrepancies in the classification methodologies rather than the definition of a glacial lake. [Response] Thanks for this suggestion. The focus of this study is to provide a complete glacial lake dataset based on Landsat and Sentinel images, which has potential to be widely applied in studies on glacial lake-related hazards, glacier-lake interactions and cryospheric hydrology. Considering freshwater transfer and storage in the region is part of the study of cryospheric hydrology. We put the definition of glacial lake in the section 4 Glacial lake inventory methods. This section is composed of 4.1 Definition of glacial lakes, 4.2 Interactive lake mapping, 4.3 Classification of glacial lakes, 4.4 Attributes of glacial lake data and 4.5 Improved uncertainty estimating method. As suggested, we add one paragraph in the discussion to analyze active glacial lakes related to GLOFs. It now reads: ‘The high consistency of Sentinel-2 and Landsat derived glacial lake products in 2020 assures the value of our lake dataset. Taking the usage in assessing GLOFs as an example, we set 0.05 km2 as the area threshold to select object lakes, including ice-contact lakes and ice-dammed lakes that are the most active lakes and source lakes of GLOFs in the CPEC (Nie et al., 2021). A total of 24 and 29 ice-contact lakes were selected from Landsat and Sentinel-derived products, respectively. Among them, there were 4 ice-dammed lakes from the Landsat-derived product and 5 from the Sentinel-derived product. These selected lakes can be used for GLOFs hazard evaluation. Because of the high consistency between our Landsat and Sentinel-based mappings, users may have the flexibility to customize the lake size criteria to facilitate their specific purposes. ’ Regarding to the comparison to other glacial lake datasets, it is arduous to compare discrepancies between glacial lake datasets in the classification methodologies due to their low overlap. The highest overlapping rate is only 74% in count with Wang’s data (2020), followed by Chen’ highest overlapping rate of 45% in count. Zhang’s and Shugar’s dataset do not include lake type attribute. Hence, we can not make a comparison in the classification methodologies. 
 3. Broader overview of remote sensing classification methodsOptical classification methods are solely focused on in the introduction section of the manuscript (L86-103), which falsely represents them as the sole classification method readily used in remote sensing. I would like to see the overview include other remote sensing classification methods, namely SAR backscatter classification, but also other alternative approaches such as from hydrological sink analysis and from land surface temperature. [Response] We added other classification methods besides optimal remote sensing. It new reads: ‘Backscatter images from Synthetic Aperture Radar (SAR) (Wangchuk and Bolch, 2020; How et al., 2021) were used to remove the impact of cloud cover for lake mapping. Besides, other approaches such as hydrological sink detection using DEM (How et al., 2021) and land surface temperature-based detection method (Zhao et al., 2020) were also used for lake inventories. Different classification methods impact the results of lake mapping and monitoring.’ I am not sure if there are any studies in this region where alternative classification methods are used to detect water bodies; but if there are any then I think they would be a great addition to the dataset comparison section to serve as an inter-comparison of methodologies beyond alike optical classification approaches. [Response] To our best knowledge, glacial lake dataset produced based on SAR backscatter classification or hydrological sink analysis is not available in the study area. If available in the future, we are glad to make such a comparison. 
 Specific commentsL41-66: I think this a detailed and concise overview of the importance of glacial lakes and GLOFs in a regional context. However, I think a global perspective is needed to thoroughly illustrate the significance of this study - especially if you are referring to global studies of glacial lakes, such as Shugar et al. (2021). Please include a sentence or two near the beginning about glacial lakes and GLOFs globally (i.e. importance, general trends etc.) [Response] As suggested, we have added a sentence herein ‘Global glacial lake number and total area both increased between 1990 and 2018 in response to glacier retreat and climate change (Shugar et al., 2020), which inevitably affected the risk of GLOFs.’ L67-85: You largely focus on remote sensing efforts in HMA regional studies, but there are also references to papers from other regions such as Greenland and the Alps. Either open up this section as an overview of remote sensing studies from all regions, or keep it refined to the HMA region. There have been many regional studies that have been published recently (e.g. Alaska, Rick et al., 2022; Greenland, How et al., 2021), not just in HMA, so I would recommend widening this section to outline the methods in a general context, rather than focusing on HMA. [Response] Thank you and we have cited the suggested recent publications for other regions. It now reads: ‘…the Alaska (Rick et al., 2022), the Greenland (How et al., 2021)…’ L92: What exactly do you mean by object-oriented classification here? This term is generally used in programming rather than in reference to a classification approach. Please change this, or clarify what is meant here; preferably with a more suitable term. [Response] We deleted ‘object-oriented classification’. L117-119: Are these sub-basins divided by catchments and/or watershed? What determines these sub-basins? [Response] Yes, these sub-basins are divided by catchment based on major tributary rivers and DEM data. L132-170: Great outline of data sources. [Response] Thank you for your positive comment. L178: Why are landslide-dammed lakes irrelevant to glaciation? Can some glacial lakes also be landslide-dammed lakes? [Response] In this study, we accept the definition of a glacial lake as one that formed as a result of modern or ancient glaciation. Landslide-dammed lakes formed behind landslides, and have little connection with glaciation. Landslide-dammed lakes vary greatly with time and differ from glacial lakes, hence being exceeded in our dataset. In a particular situation, glacial lakes are also dammed by landslides, someone may define those lakes as landslide-dammed lakes. Our study focuses on all glacial lakes formed as a result of glaciation. L199: Change 'the method automatically generated the histogram...' to 'the method calculated the histogram...' [Response] Revised as ‘Specifically, the method calculated the NDWI histogram based on the pixels with each user-defined and manually-drawn region of interest.’ L201: Change 'interactively' to 'manually'. In reference to this comment and the last, I think it needs to be clear in the methodology how this approach is 'semi-automated'. [Response] We think 'interactively' is more suitable than 'manually' to depict the process of lake mapping. We needed to switch the screening NDWI and original image to determine an optimal threshold, and this is an interactive process. In the process of interactive lake mapping, manual inputs refer to drawing user-defined region of interest (ROI) and tuning the NDWI threshold in each ROI, whereas calculating the histogram of NDWI and converting raster lake extent to vector polygon are automated. To avoid the misunderstanding, we define the method as a human-interactive and automated lake mapping method and made some revision. It now reads: ‘Specifically, the method calculated the NDWI histogram based on the pixels with each user-defined and manually-drawn region of interest. The NDWI threshold that separates lake surface from land was interactively determined by screening the NDWI histogram against the lake region in the imagery (Wang et al., 2014; Nie et al., 2020). This way, the determined NDWI threshold can be well-tuned to adapt various spectral conditions of the studied glacier lakes. The raster lake extents segmented by the thresholds were then automatically converted to vector polygons.’ L224-228: False classifications from cloud and topographic shadows can be eliminated with cloud and terrain masking, which are well-established remote sensing methods in land classification. Why did you choose not to include this in the automated component of your workflow? [Response] We selected high-quality images to map glacial lakes for each time period. However, false lake extents resulting from cloud or snow cover, lake ice, and topographic shadows are unavoidable but limited. Then, we removed those false lakes and again mapped the lakes using our lake mapping method according to alternative images acquired in adjacent years. This method meets the needs of lake mapping. Incorporating cloud and terrain masking in the automated process is an excellent suggestion, and we will consider this in the future research. Table 1: The characteristics of a proglacial lake should specify that these lakes share a boundary with the ice margin, according to your definition - 'shared boundary' is a better description than 'connected with glaciers' as this could be interpreted as hydrologically connected instead of physically adjacent. [Response] Replaced 'connected with glaciers' with 'shared boundary with glaciers '. Table 2: There must be occurrences where a lake's formation and/or dam material properties are ambiguous (especially in relation to GLCS2), even from Google Earth imagery. I see in the dataset that there are no instances where a lake's classification is determined as uncertain; even though you state later on that occassional misclassifications are inevitable (L561). In such instances of ambiguous lake types, how do you decide the classification? [Response] Yes, some dam material properties are ambiguous from satellite observations. This is a challenge for GLCS2. Differentiating moraine-dammed and glacial-erosion lakes is challenging due to unclear moraine dam or bedrock superimposed by top moraine. To differentiate those dam types, we considered auxiliary factors that help classify lake dam types, such as location, surface slope, roughness and shape of the glacial lakes. We established the classification system of lake types and collected typical samples for each lake type to train our operators at the beginning of the classification. We then used our expert knowledge to classify all lakes with a combination of glacier data, DEM, geomorphological features. When indeterminate lake types emerged, we used group discussions to attribute the type. All these steps help us improve the quality of lake datasets that are more useful to users. We proposed the limitation and updating plan in the main text: ‘Although very high-resolution Google Earth images were utilized to assist in lake type interpretation, occasional misclassification was inevitable. We implemented two types of classification systems based on a careful utilization of glacier data, DEM, geomorphological features and expert knowledge. However, the lack of in situ survey prohibited a thorough validation of the glacial lake types.’ ‘The glacial lake dataset will be updated using newly collected Landsat and Sentinel images at a five-year interval or modified according to user feedbacks.’ L272-273: Please provide references to studies that use lake perimeter and displacement error to estimate uncertainty. [Response] We added the citations, and it now reads: ‘Lake perimeter and displacement error are widely used to estimate the uncertainty of glacier and lake mapping from satellite observation (Carrivick and Quincey, 2014; Hanshaw and Bookhagen, 2014; Wang et al., 2020).’ L270-295: Repetition with the corresponding uncertainty estimation section in the supplementary materials. I would suggest refining this section in the main manuscript, and keeping the full description for the supplementary materials. [Response] We removed the duplicate section on uncertainty estimation in the supplementary material, and moved Tutorial for Improved Uncertainty Estimating Method at the end of the main text as an appendix. We want editors to approve these changes. L294 and L305: Change figure names from Figure S3a/S3b to Figure 3a/3b, unless you would rather move them to the supplementary materials. [Response] Revised as “Figure 3a” and “Figure 3b”. Figure 4: This is a great figure. Please add labelling to the figure to indicate that one set of graphs is from Landsat and the other from Sentinel - you only understand this once you have read to the end of the caption, and it needs to be signposted earlier. [Response] We have added Landsat and Sentinel at the upper left corner of the figure 4 to differentiate. L339: 'proglacier' >> 'proglacial' [Response] According to suggestions from Reviewer #1, we replaced 'proglacial' with ‘ice-contact’ and corrected the typo. Figure 5: The four maps are somewhat repetitive and it is difficult to see differences between the Sentinel and Landsat lake sizes/abundance from this. I would suggest changing this figure to have an overview map on the left showing all detected lakes from both methods, and a series of inset maps to the right displaying a closer look at certain regions of interest; divided into Sentinel and Landsat lakes. Also, maybe change the outline colour of the lake points to a darker shade, as it is hard to identify the lake points in the current figure. [Response] The figure 5 aims to describe the distribution of glacial lakes in 2020 extracted from Landsat and Sentinel, and all lakes are classified by GLCS1 and GLCS2. From this point of view, it is not repetitive. For clarity, we added ‘Landsat’ for ‘panels a and b’, and ‘Sentinel’ for ‘panels c and d’. Meanwhile, differences in glacial lakes from Landsat and Sentinel can be clearly seen compared to "Panels a and c" using GLCS1 and "Panels b and d" using GLCS2. A closer look at certain regions of interest is showed in figure 11 and 12. That is why we designed this Figure. As suggested, we set the outline of lake points to black with a thicker size in order to better differentiate the lake points. L352: This is a hanging line, and I am not sure which panels and sub-graphs are being referred to here. Does this belong somewhere else or is this a fault with the journal formatting? [Response] We corrected this typo and ensure all captions are complete in the main text. Figure 7 and 8: These graphs are very effective at showing changes in lakes - a refreshing take on presenting this type of dataset. [Response] Thank you very much. L382: '...while the area grew by a less extent (1.21 km2 or 1.42%).' >> '...while the area grew by 1.21 km2 (or 1.42%).' [Response] Revised as ‘while the area grew by 1.21 km2 (or 1.42%).’ L408: '...including being stable for Shingo...' >> '...including a stable trend for Shingo...' [Response] Revised as ‘...including a stable trend for Shingo...’ L411-412: 'The total numbers of Kashgar and Hunza basins decreased...' >> 'The total number of lakes in Kashgar and Hunza basins decreased...' [Response] Revised as ‘The total number of lakes in Kashgar and Hunza basins decreased...’ L426-27 and Table 5: Can you include some statistics on the link between lake size and % overlap between the Sentinel-2 and Landsat counts? - this would help gauge how much spatial resolution (differentiated from image acquisition) affects lake classification in this study. [Response] Thanks for this suggestion. We tried but did not find any significant statistics between lake sizes and overlapping rates. Impact of spatial resolutions on classification accuracy of lake types is very interesting. We hope to conduct such studies in the future, however this is beyond the goal of this study. L441-443: Are these lakes persistently large or just at a particular time step? Do you have evidence as to why they are disproportionally large? [Response] These lakes are persistently large, and we deleted ‘disproportionally’ to avoid any misunderstandings. L481-485: Can you state here the number of instances where overlapping acquisitions were acquired? [Response] Revised as ‘…is relatively low (only 7 scenes of Sentinel images or 112 glacial lakes in 2020)…’ L492: 'approximate' >> 'approximately' [Response] Changed to ‘approximately’. L503-504: Are there any studies that present glacial lake datasets derived from Sentinel-2? If so, please reference them here. If not, then change this to state that there are no comparable datasets, rather than a scarce number of datasets. [Response] Agreed, it new reads: ‘Regional glacial lake datasets using Sentinel images are scarce. Lack of Sentinel-derived glacial lake datasets in the study area makes it impossible to compare.’ L515-521: I think, similar to your suggestion regarding landslide-dammed lakes, a likely answer is that Wang et al. focus more on glacier-connected lakes, given that they adopt a 10 km buffer to filter out unconnected lakes. And therefore they identify an increasing trend, possibly reflective of a subset of your lake types. Could you subset your lake dataset to match the lakes identified by Wang et al., and examine whether you also see this increasing trend evident in your subsetted dataset? (And perhaps also include landslide-dammed lake for the purpose of this comparison?) [Response] Both Wang’ study and ours are regional-scale glacial lake inventories and do not only focus on active lakes. Regarding discrepancy, all glacial lakes in the study area were mapped according to our definition regardless of the proximity to glaciers whilst Wang’s data was filtered by a 10-km buffer zone from glaciers. That is the reason that we mapped more lakes than Wang’s for each time period. Our lakes dataset maps all glacial lakes in the study area for the first time, making up for missing lakes in other datasets. The missing lakes are mostly non-glacier-fed lakes that remain relatively stable in the past decades (Figure 7), having little impact on changing trend in glacial lakes. Our study demonstrates that end-moraine-dammed lakes increased by 2.48 km2 and contributed most of the glacier lake area expansion, whereas supraglacial, ice-dammed and lateral-moraine-dammed lakes decreased slightly in both number and area. This trend is consistent with the slightly negative mass balance of glaciers in the study area (covering Pamir, Karakoram etc.). Based on the analysis of Wang’s data, we find that newly-emerged and expanded landslide-dammed lakes contributed most to the increase in lake area, and manually delineating glacial lakes twice by different operators exacerbated the errors of mapping. These reasons result in an increasing trend in their study. We have proposed the reason in the main text. Landslide-dammed lakes fluctuate greatly with time and expanded recently in the study area, differing from typical glacial lakes. According to our definition on glacial lakes, Landslide-dammed lakes are excluded. Our dataset shows a long-term regional changing trend of all glacial lakes. A comparison in the condition of including landslide-dammed lakes is beyond the goal of our present study and seems unnecessary. Thanks for your support and understanding. L530: "Zhang's dataset..." >> "The dataset from Zhang et al. (2015) ..." [Response] Revised as ‘The dataset from Zhang et al. (2015)...’ L531-536: I am unsure how this study and Zhang et al. could have discrepancies in image availability when both studies are classifying lakes from the same satellite image collection (Landsat). Some clarification is needed here to demonstrate how your dataset could classify these lakes when Zhang et al. could not. [Response] We revised this sentence. It now reads: ‘…we attributed this anomalous discrepancy to a range of glacial lakes that were missing due to lack of thorough cross-check quality assurance and the limit of a 10-km buffer zone from glaciers during their manual delineation.’ L538-544: Discrepancies in glacial lake datasets can be because of minimum lake size, classification method (i.e. not just optical), image acquisition and post-filtering. However, if the purpose of this dataset is to 'further promote the capacity of GLOF risk assessment and predicting glacier evolutions' then I am unsure why there is no spatial filter (relative to ice margin position) adopted to remove lakes that are unconnected to the glacial system. I think the focus of this study needs to be shifted (as stated earlier in major comments), and further analysis needs to be presented that demonstrates changes in GLOF and glacier-fed lakes specifically (i.e. filtered by lake type and size) - see major comments for full details. [Response] The focus of this study is to provide a complete glacial lake dataset based on Landsat and Sentinel images, which has potential to be widely applied in studies on glacial lake-related hazards, glacier-lake interactions and cryospheric hydrology. We made some modifications to be more explicit about the goals of this study and the importance of our dataset. The sentence was changed to: ‘…we provide an up-to-date glacial lake dataset derived from both Landsat and Sentinel observations, which further increased the availability of glacial lake datasets for GLOFs risk assessment, predicting glacier evolutions (Carrivick et al., 2020) cryosphere-hydrological changes…’ L550: 'Even though an capacity of repetitive observations...' >> 'Even though the capacity of repeat observations...' [Response] Revised as ‘Even though the capacity of repeat observations...’ L557: '...inter- and intra-annual changes (Liu et al., 2020) in glacial lake dataset of each time period...' >> '...inter- and intra-annual changes (Lie et al., 2020) for each time period...' [Response] Revised as ‘...inter- and intra-annual changes (Liu et al., 2020) for each time period...’ L545-564: This is a valuable section to include in the study. The temporal range of these datasets and limited image availability (especially in formative years) will not adequately capture the dynamic nature of draining glacial lakes; and therefore such datasets serve as a gauge of long-term, regional trends rather than individual lake change. [Response] We are thankful for your support and understanding. L565-575: It is great to hear that this work will be continued, and new time steps will be included in the dataset in the future. [Response] Thanks for your encouragement. L584: '...spatial-temporal changes at longer time scale...' >> '...spatial-temporal changes at a longer time scale...' [Response] Revised as ' ...spatial-temporal changes at a longer time scale... '. L584: 'observation' >> 'observations' [Response] Revised as ' observations '. L585: 'started' >> 'starting' [Response] Revised as ' starting '. L595-596: See previous comment from L538-544 regarding this statement. [Response] As responded to earlier questions. This sentence was changed to: ‘…maximize their potential utility for GLOFs risk evaluation, cryosphere-hydrological and glacier-lake evolution projection.’ L602: 'values for cryospheric-hydrology research, assessment of...' >> 'value to cryospheric-hydrology research, the assessment of...' [Response] Revised as 'value to cryospheric-hydrology research, the assessment of... ' References Carrivick, J.L., Quincey, D.J.: Progressive increase in number and volume of ice-marginal lakes on the western margin of the Greenland Ice Sheet. Global Planet. Change, 116: 156-163, https://doi.org/10.1016/j.gloplacha.2014.02.009, 2014. Carrivick, J.L., Tweed, F.S., Sutherland, J.L., Mallalieu, J.: Toward Numerical Modeling of Interactions Between Ice-Marginal Proglacial Lakes and Glaciers. Front. Earth Sci, 8, https://doi.org/10.3389/feart.2020.577068, 2020. Hanshaw, M.N., Bookhagen, B.: Glacial areas, lake areas, and snow lines from 1975 to 2012: status of the Cordillera Vilcanota, including the Quelccaya Ice Cap, northern central Andes, Peru. The Cryosphere, 8: 359-376, https://doi.org/10.5194/tc-8-359-2014, 2014. How, P., Messerli, A., Mätzler, E., Santoro, M., Wiesmann, A., Caduff, R., Langley, K., Bojesen, M.H., Paul, F., Kääb, A., Carrivick, J.L.: Greenland-wide inventory of ice marginal lakes using a multi-method approach. Sci. Rep.-UK, 11: 4481, https://doi.org/10.1038/s41598-021-83509-1, 2021. Liu, Q., Mayer, C.: Distribution and interannual variability of supraglacial lakes on debris-covered glaciers in the Khan Tengri-Tumor Mountains, Central Asia. Environ. Res. Lett., 10: 014014 2015. Liu, Q., Mayer, C., Wang, X., Nie, Y., Wu, K., Wei, J., Liu, S.: Interannual flow dynamics driven by frontal retreat of a lake-terminating glacier in the Chinese Central Himalaya. Earth Planet. Sc. Lett., 546: 116450, https://doi.org/10.1016/j.epsl.2020.116450, 2020. Miles, E.S., Watson, C.S., Brun, F., Berthier, E., Esteves, M., Quincey, D.J., Miles, K.E., Hubbard, B., Wagnon, P.: Glacial and geomorphic effects of a supraglacial lake drainage and outburst event, Everest region, Nepal Himalaya. The Cryosphere, 12: 3891-3905, https://doi.org/10.5194/tc-12-3891-2018, 2018. Nie, Y., Sheng, Y., Liu, Q., Liu, L., Liu, S., Zhang, Y., Song, C.: A regional-scale assessment of Himalayan glacial lake changes using satellite observations from 1990 to 2015. Remote Sens. Environ., 189: 1-13, https://doi.org/10.1016/j.rse.2016.11.008, 2017. Nie, Y., Liu, W., Liu, Q., Hu, X., Westoby, M.J.: Reconstructing the Chongbaxia Tsho glacial lake outburst flood in the Eastern Himalaya: Evolution, process and impacts. Geomorphology, 370: 107393, https://doi.org/10.1016/j.geomorph.2020.107393, 2020. Nie, Y., Pritchard, H.D., Liu, Q., Hennig, T., Wang, W., Wang, X., Liu, S., Nepal, S., Samyn, D., Hewitt, K., Chen, X.: Glacial change and hydrological implications in the Himalaya and Karakoram. Nature Reviews Earth & Environment, 2: 91-106, https://doi.org/10.1038/s43017-020-00124-w, 2021. Pfeffer, W.T., Arendt, A.A., Bliss, A., Bolch, T., Cogley, J.G., Gardner, A.S., Hagen, J., Hock, R., Kaser, G., Kienholz, C., Miles, E.S., Moholdt, G., Mölg, N., Paul, F., Radić, V., Rastner, P., Raup, B.H., Rich, J., Sharp, M.J.: The Randolph Glacier Inventory: a globally complete inventory of glaciers. J. Glaciol., 60: 537-552, https://doi.org/10.3189/2014JoG13J176, 2014. RGI Consortium: Randolph Glacier Inventory – A Dataset of Global Glacier Outlines: Version 6.0: Technical Report, https://doi.org/10.7265/N5-RGI-60, 2017. Rick, B., Mcgrath, D., Armstrong, W., Mccoy, S.W.: Dam type and lake location characterize ice-marginal lake area change in Alaska and NW Canada between 1984 and 2019. The Cryosphere, 16: 297-314, https://doi.org/10.5194/tc-16-297-2022, 2022. Sakai, A.: Brief communication: Updated GAMDAM glacier inventory over high-mountain Asia. The Cryosphere, 13: 2043-2049, https://doi.org/10.5194/tc-13-2043-2019, 2019. Shugar, D.H., Burr, A., Haritashya, U.K., Kargel, J.S., Watson, C.S., Kennedy, M.C., Bevington, A.R., Betts, R.A., Harrison, S., Strattman, K.: Rapid worldwide growth of glacial lakes since 1990. Nat. Clim. Change, 10: 939-945, https://doi.org/10.1038/s41558-020-0855-4, 2020. Wang, J., Sheng, Y., Tong, T.S.D.: Monitoring decadal lake dynamics across the Yangtze Basin downstream of Three Gorges Dam. Remote Sens. Environ., 152: 251-269, https://doi.org/10.1016/j.rse.2014.06.004, 2014. Wang, X., Guo, X., Yang, C., Liu, Q., Wei, J., Zhang, Y., Liu, S., Zhang, Y., Jiang, Z., Tang, Z.: Glacial lake inventory of high-mountain Asia in 1990 and 2018 derived from Landsat images. Earth System Science Data, 12: 2169-2182, https://doi.org/10.5194/essd-12-2169-2020, 2020. Wangchuk, S., Bolch, T.: Mapping of glacial lakes using Sentinel-1 and Sentinel-2 data and a random forest classifier: Strengths and challenges. Science of Remote Sensing, 2: 100008, https://doi.org/https://doi.org/10.1016/j.srs.2020.100008, 2020. Zhao, W., Xiong, D., Wen, F., Wang, X.: Lake area monitoring based on land surface temperature in the Tibetan Plateau from 2000 to 2018. Environ. Res. Lett., 15, https://doi.org/10.1088/1748-9326/ab9b41, 2020. Citation: https://doi.org/10.5194/essd-2021-468-AC2 
 
- 
                                        
                                     AC2:  'Reply on RC2', Yong Nie, 21 Mar 2022
                            
                            
                            
                            
                                        
Peer review completion
 
                                 
                
                                 
                                 
                
                                 
                             
                           
                             
                           
                             
                           
                             
                           
                             
                           
                             
                           
                 
                 
                 
                 
                 
                     
                     
                     
                    