the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
A global map of local climate zones to support earth system modelling and urban-scale environmental science
Matthias Demuzere
Jonas Kittner
Alberto Martilli
Gerald Mills
Christian Moede
Iain D. Stewart
Jasper van Vliet
Benjamin Bechtel
Download
- Final revised paper (published on 29 Aug 2022)
- Preprint (discussion started on 07 Apr 2022)
Interactive discussion
Status: closed
-
RC1: 'Comment on essd-2022-92', Anonymous Referee #1, 05 May 2022
General comments
The paper entitled "A global map of Local Climate Zones to support earth system modelling and urban scale environmental science" depicts the major advancements realised by the community effort led by the WUDAPT community that permitted the obtention of the first global Local Climate Zones map. The paper is already in a very mature state and no major points of concern are to be clarified. The paper should therefore be published once the comments given below are treated. Some specifications are required in the methods. But most importantly, I would like to have the authors commenting more on the quality of the training polygons and the sampling bias per ecoregion. This could help focus future efforts by the community to improve the current map in future releases.
Major comments
Line 120: Please define what a well-trained student is.
Line 127 to 129: How is the best submission defined? If solely on overall accuracy (OA), then this can be biased. Why not using all submissions instead, as suggested by the HUMINEX project. Explain why 50% is retained and not 60% as suggested by Bechtel et al. (2019). Please explain the rationale in a short sentence. Also, why are archived TAs given a higher priority over the ones produced in the Generator? Were they checked or published before being archived? Some places are mostly sampled via the Generator (e.g., India or China), would you say that the resulting mapping in these ecoregions are of lower quality?
Line 169 to 174: You say it in the following paragraph but it is unclear at this stage why you do the feature importance for the 16 sets of TAs. Also, why is the performance measured at that stage and not in GEE? Could you also be a bit more specific on the reasons that explain you going from GEE to a python environment? Could be interesting for some geospatial scientists.
Line 184 to 185: I like that step and fully support it. Nonetheless, there may ba a bias induced by the quality of the TA per ecoregion (e.g., TAs comming only from the Generator). This should be discussed at a certain point.
Line 192 and 193: I had a hard time understanding why you calculate the accuracy again for each subset after going through Pathway 1.
Line 249: Do you know how the GHS-S2net data performs in places where informal settlements are common and where roads are made of bare soil rather than asphalt? This could impact your evaluation.
Line 288 to 290: Looking at the TAs on the LCZ Generator, one can see that in the Indian cities, for example, close to no LCZ 7 has been sampled. Coming back to the question of the TA quality in certain places, how do you think this could influence your global map? Also, could it be that some users do not take sufficient time to get acquainted with the LCZ scheme? Would your TA filtering capture this?
Figures 5, 6 and 7 and related text: I would like the authors to comment more about the probability of a certain LCZ to occur in different FUAs. In Lagos, for example, the probability of having the same LCZ classified is higher in the city and lower in the rural area. This is the opposite for a city like Delhi or Lima. Could you try to explain and discuss how the quality of the TA sampling done in the different ecoregions may lead to such outcome?
Line 331: Does the LCZ 8 class really belong in this cluster? Shouldn't it be added to the group with LCZ 7 and 10? Afterall, the building materials of LCZ 8 are very different to the compact built-up LCZs.
Line 339: Although I do believe that LCZ 3 and LCZ 8 are indeed the most common LCZ globally, the proportion of LCZ 8 over LCZ 3 may be biased because some confusion is happening during the classification. Could you try to explain why such confusion is happening between these two classes? You later speak about their radiative ressamblance (on line 511). Do you have any data to support this?
Figure 8: I really like this figure but could you add an estimation of the uncertainty of the proportion per LCZ?
Line 521: When you talk about "their purpose", could you add that users are invited to continue helping the develoment of future maps releases by contributing to the WUDAPT project through the LCZ Generator?
Minor comments
Line 2: Change "as" to "since" and "acknowledged" to "recognized"
Line 6: Add "and mitigative role" at the end of the sentence
Line 19: Change "warming" to "climate warming"
Line 19 to 20: Rephrase this complex sentence and potentially divide it in two to make it clearer
Line 34: Change to "and alters the local climate creating specific urban climates" or similar.
Line 44: Chose between "distinct urban canopies and boundary layers" or "a distinct urban canopy and related boundary layer".
Line 46 to 47: I would remove this statement that is not defended by any evidence. Otherwise, put it subjectively (e.g., "could" soon allow; "are expected"...)
Line 48 to 49: Rephrase as "Hence, a comprehensive [...] is needed."
Line 53: Change "needed to support" to "required by" and change the final dot to a double point "[...] functions: measures of [...]".
Line 55: "Influences" to "Influence"
Line 60: Change "assess" to "test"
Line 64: Add a space between "heat" and "(Demuzere"
Line 77: Add "[...] parameters (UCPs) required by urban climate models and by policy-makers to run [...]"
Line 96: Check the citation command for Ching et al. (2018). If LaTeX used, check that for all the manuscript.
Line 108: Change "random forest model" to "random forest classifier".
Line 138: Rephrase "one needs" to a less familiar tone
Line 164: "2+ million labels", are these TAs or pixels within TA polygons?
Line 171: Delete the comma after "a)"
Line 195 to 196: Is the "splitting the polygon pool" approach done for the first time in the LCZ mapping or has it been used in previous mapping (e.g., Europe or the US)?
Line 291 to 292: Please detail what the "average number of ROIs" is.
Line 322 to 325: This sentence could be moved to the discussion if needed. Otherwise, please suppress it.
Figure 9: Could you provide boxplots per ER too?
Line 388: Why is the slope chosen as a metric for evaluating the classification performance? This is quite uncommon.
Line 401 to 403: How is this statement explanatory of the difference between the LCZ-derived AHF and the observation?
Line 407: Chose another word than "zonal"
Line 422: Please change "Global South" and later "Global North" to other denominations. This concept dates from the 1980s.
Line 441 to 442: Do you have a reference to defend that city population is a proxy to urban form?
Line 453: The works by Potgieter et al. (2021) and Brousse et al. (2022) are suggested as additional references concerning crowdsourced data.
Line 471: When citing Demuzere et al. (2021a), please refer specifically to the W2W python tool as done for WUDAPT-TO-COSMO.
Line 510: I suggest changing "surface fractions" to "impervious and built-up surface fractions".
Line 512: Rephrase this sentence for clarity.
Please consider checking for American and English spelling discrepancies.
Citation: https://doi.org/10.5194/essd-2022-92-RC1 - AC1: 'Reply on RC1', Matthias Demuzere, 11 Jul 2022
-
RC2: 'Comment on essd-2022-92', Anonymous Referee #2, 06 May 2022
General comments
This work describes a new dataset of land cover types (10 urban – 7 natural) using Local Climate Zones at the global scale. Work is clearly described, evaluated, and presented. The associated dataset is of high quality, and I expect will become a landmark data source for the community.
The discussion on “accuracy” vs “robustness” could be improved (see specific comments). Additionally, there is no acknowledgment that LCZ training polygons are susceptible to human errors (again see specific comments).
Section 3.2 and Figure 10 show that the correlation R2 for building height is only ~0.5, however this is only very briefly mentioned in results, and not mentioned elsewhere (e.g. discussion/conclusion/abstract). So, while 2D information like lambda_B appears to be very well captured, 3D information remains a significant limitation. This is a key result and its implications should be discussed more thoroughly.
A lower reliance on acronyms would assist the casual reader. For example Figures 9 and 10 are not decipherable without referring to other sections of the text.
However, overall, an impressive body of work.
Specific comments
Line 41: “Earth System Models (ESMs) have only recently evolved to accommodate urban-scale landscapes, even though the parameters that are used by ESMs to these landscapes are limited in scope”
Some global climate models have had integrated urban canyon models for over a decade (e.g. CLMU in CESM). I’m not sure if these are ESMs (ESM relates to the carbon cycle, not the global scale, some readers may misinterpret this). I think safer/clearer to say many global-scale models ignore urban landscapes or represent them simply.
Line 120: suggest removing “well-trained” as subjective.
Line 127: “only the best submission is retained” what distinguishes a “best” submission?
Line 128: How is accuracy determined?
Section 2.4.1: I would describe this as a test of robustness, not accuracy, as this does not test whether the classifications are correct, just whether they change with different inputs. This method also assumes that training areas are accurate, but TAs are classified subjectively by humans. True accuracy can be tested with building resolving spatial datasets. However, I accept this “accuracy” terminology has been established elsewhere in the literature, but a comment to clarify accuracy vs robustness would assist readers.
Line 200: “The overall accuracy denotes the percentage of correctly classified pixels.” As described above, the method does not assess whether pixels are classified correctly, only how often they are unchanged (and potentially remain incorrect). With poor training data, the overall “accuracy” could approach 100% but be completely wrong. Please rephrase.
Line 422: While the use of “Global South” and “Global North” is quite common, some see these terms as problematic as they are geographically inaccurate, deterministic, and paternalistic. If authors mean “lower wealth” they could just say that.
Technical corrections
None
Citation: https://doi.org/10.5194/essd-2022-92-RC2 - AC2: 'Reply on RC2', Matthias Demuzere, 11 Jul 2022
-
RC3: 'Comment on essd-2022-92', Jason Ching, 17 Jun 2022
Preface to this review: A decade ago at the Croucher Advanced Study Institute in Hong Kong, this Reviewer and Gerald Mills (a coauthors of this paper) reflected upon a presentation by Iain Stewart (also co-author on Local Climate Zones (LCZ), topic of his PhD research. The LCZ is a universal classification scheme that differentiates urban surfaces into different combinations of building form and function features. Together with associated values of urban canopy parameters (UCPs) the LCZ provided the conceptual framework that inspired the startup of WUDAPT, an urban climate community collaborative project. WUDAPT scope is worldwide, in principle the global LCZ map with its companion UCPs provides model inputs that describe the underlying embedded canopy features of the urban boundary layer for any and all cities in the world. This paper describes efforts leading to the generation of the Global LCZ map, an achievement that culminates an effort a decade in the making and satisfies the major goal and is a key milestone of the WUDAPT. This global LCZ map product support “fit for purpose” applications of environmental models capable of addressing urban induced environmental issues exacerbated by climate changes.
Overview: This paper is a significant contribution; it represents the product at 100-meter resolution designed to provide urban canopy data, making possible a means to generate uniformly consistent urban canopy parameters for all cities in the world as inputs to a wide variety of models such as meteorology (e.g., WRF_ Surface energy budgets (e.g., SUEZ), etc. The implication of this result is a capability for making possible fit for purpose (FFP) modeling for assessment at intra-urban scales impacted by climate changes for any and all cities in the world. The LCZ scheme is the cornerstone of this paper; it defines 10 distinct classes each having unique land cover and physical form and function aspects describing the built environment along with 7 other nonurban land cover types. For each of the built classes, there is a corresponding range of values of urban canopy parameters suitable as modeling inputs for urbanized WRF and other environment modeling systems. The research team has been engaged in developing and establishing methods and techniques at the outset of WUDAPT; their creating of an LCZ generator facilitated upscaling LCZ maps based on sets of Training Areas (TA) representing each LCZ class by urban experts for individual cities upscaled to regional/continental scale maps for different regions of the world. Their R&D trajectory has provided the experiential base and creating approaches and methods that extend TA transferability from individual cities to regional and continental maps. This paper describes in detail the adjustments and modifications to the methodology that generated regional-continental LCZ maps thus enabling the creation of this global LCZ map. This achievement fulfills a major objective of the WUDAPT (www.Wudapt.org) Project.
Key Points: The article provides the reader with (i) a concise discourse describing the approach and methods and inputs to generate this product; and (ii) provide suggestions on its utility to supporting modeling and urban scale environmental science. The context of my review will reflect perspectives based on the WUDAPT initiative, a collaborative venture of the urban community. This article is thoughtful, and well organized, I briefly highlight and summarize key points of each section below.
- The Title is accurate and appropriate. The focus of this paper in on the development and implementation details of the methodologies that makes possible the resulting global LCZ map. LCZ is an universal urban typology and classification scheme to representing unique properties of form and function variables in the urban canopy layer of cities along with 7 other classes representing non-urban landscapes. The complementary urban canopy parameters values associated with each LCZ class provide a globally consistent framework for data on the form and function of morphological features in urban canopies. This effort extends LCZ maps progressing from original city specific sets to recent mapping of cities and the surrounding non-urban areas within regional domains to global coverage. The impact of this achievement supports urban scale modeling systems and their applications anywhere in the globe, achieving the primary objective of the Level 0 approach of WUDAPT. Implementation procedures to generating the upscaled Global product required appropriate modification and innovations to previous efforts for the regional scale prototypes.
- The Introduction provides a perspective on the value and importance of urban science to addressing global climate change issues. The enabling science behind current models is the physics algorithms for the vertical exchange processes of momentum, energy and moisture pollutant emission influenced by the myriad of urban morphological (UM) features. These exchange processes take place in the so-called urban canopy layer which extends from the surface to the top of UM features. Typically, modeling with urban canopy physics requires special sets urban data but heretofore is only available for limited number of cities; thus a large information gap exists which severely limits environmental models as tools intended and needed to addressing climate induced risks issues at urban scales. The introduction of the LCZ framework and its companion range of values of urban canopy parameters (UCPs) is what the Global LCZ map achieved here and generated at 100m resolution makes possible myriad of practical ”fit for purpose” environmental modeling applications at a reasonably fine scale addressing climate change issues impacting weather, climate, air quality at both inter and intraurban scales for each and all cities worldwide.
- Section 2 describes in several subsections, specific details of each of the various methodology and approaches employed to generate this global LCZ map. In general, the approach pertinent to each aspect is described in detail, thoroughly, albeit, many supporting technical details were provided in cited references.
- Section 2.1Training Areas (TAs) This subsection describes the methods to generate this Global LCZ map. It is based on incorporating TAs of LCZs from various sources, mainly from (a) archived community generated TAs representing hundreds of different cities around the world and additionally (b) a special set prepared for another hundred or more other Regions of Interest (ROIs) cities around the world and (c) from TA samples generated by a unique LCZ Generator employed earlier for regional mapping projects (reference cited). The archived TAs required a curation effort to rectify the issue of unevenness in the quality and physical size of TAs submitted into the WUDAPT archives. Their approach adopted in the curation processing is logical, and sound towards assuring uniformity and consistency in quality of the TAs. It clearly builds upon insights and experience gained in prior efforts including the HUMINEX project, implementing such approach contributing to the successful generating of regional to continental scale LCZ maps. The efforts described in this section extends the approach used for the regional maps to assure a uniformity in the quality of this global product.
- Section 2.2describes special treatments to extend the handling of the added and supplementary Earth Observations (EO) needed as inputs for the Global LCZ supervised random forest classifier from the regional mapping stage. Here updates or additions to the original 33 Global Earth Observation were incorporated to the LCZ Generator.
- Section 2.3: Classification schemesThis section describes what the authors call a Lightweight Global Random Forest model based on various pixel-based mapping methods; however, upscaling such methods to the global product was apparently not straightforward. For this, the default LCZ Generator from earlier studies required significant modifications. This was apparently a huge classification challenge (given >106labelled TA and other inputs); it was facilitated by incorporating this dual sequential pathways approach, another innovative advancement.
- The important QA assessmentbuilds upon their recent continental scale LCZ mapping efforts; its procedures are based on (a) five (5) traditional accuracy metrics (Section 2.4.1) and (b) incorporating a novel but indirect thematic benchmarking (Section 2.4.2) involving comparing mapped outcomes of several urban canopy parameters ( % built, % impervious surface and sum of built and impervious total plan area, building heights and AH (anthropogenic heating ) associated with each LCZ class with other sets of global and open source databases reflecting urban form and functions. The level of comparability provided a relative qualitative assurance measure of the outcomes of UCPs associated with the LCZ maps. Clearly, the success of these relative outcomes varied for the different UCP analyzed (Fig 10). In this regard, future effort associated with other independent means including outcomes of UCPs generated by WUDAPT Level 1 and 2 approaches described in Ching et al, (2019) will be helpful, going forward.
- Section 3: Results. The Global LCZ map is shown in Fig 4. While the 7 non-urban LCZ classes are not as discriminating in the number of classes as in other mapping schemes, the major value is that all urban areas herein are discriminated into the 10 universally based LCZ scheme, a product consistent for all urban areas in the world, e.g., Figure 5-7, are examples of zoomed LCZ maps for various cities extracted from this Global LCZ maps. It was noted that these sets of cities display and support earlier observations that that each and all cities has its uniquely characteristic LCZ signature (or fingerprint). This is a feature that was apparent and evident in the UCPs generated for the NUDAPT project (Ching et al, 2010), and for LCZ and UCP maps for individual cities studies and from the recent regional LCZ mappings. From such observations, it is probably reasonable to infer that all cities in the world have unique LCZ fingerprints, and by extension, a commensurate unique set of UCPs. This is an important consideration as it provides the rationale and bases of conducting fit-for-purpose intraurban modeling studies based on the Global LCZ map applicable to each and every city in the world. Support for the contention is expressed in Section 4 indicating how this Global LCZ map can serve earth system modeling and urban scale environmental science, and extend its utility to intraurban scale. However, as noted in the paragraph beginning at line 492, much more need to be done for full effectiveness especially at the intraurban scale in this regard. The UCPs provision in LCZ is currently manifested in lookup tables of ranges of values in the UCPs of each LCZ class. Remedies include path forward innovative cyber-based approaches are currently underway to generate block scale gridded UCPs of both form and function parameters in WUDAPT Level 1 and 2 staged efforts already referred to in the reference list (Ching et al. 2019) to complement the global LCZ mapping efforts.
- Section 4: this Section, the article highlights and discuss the attributes and impact of the Global LCZ product. Herein, the significance in terms of objectives, and potential impacts and caveats of this study are explored, Since this global map has just been completed, the discussion refers to the Global LCZ maps support to wide range of potential modeling applications, some already underway, in concert with the WUDAPT perspecitives.
- Conclusion section: The results of this Global LCZ map is an important and significant achievement culminating from the collaborative efforts of many activities and voluntary contributions from the urban climate and multidisciplinary science communities evolving and improving after over nearly a decade of efforts. The advances and contribution by this team has been impressive moving the LCZ framework from prototypic city specific mapping to the creation of regional maps and culminating in this impressive global product. Given the widespread and rapidly growing literature on LCZs we can anticipate much interest in this product. For a whole host of reasons, including projections of climate change impacts to enhancing weather extremes, to urbanization dynamics from increased population, the LCZ paradigm, and the various levels of coverage, and certainly, with this Global product provides an important and significant approach towards supporting science-based tools for myriad and wide ranges of modeling application and studies supporting local to international policies that addresses climate change issues.
- Data availability: In the last section, this map is available via link provided. While it was not mentioned, I would recommend the authors consider making the map and accessibility to the pixel generated LCZ in the WUDAPT Portal
Suggestions on specific points The following are a list of relatively minor issues and concern indicated below.
- Line 120-121.This sentence bears a burden of explanation; It will be necessary to provide objective measures and criteria to establish the objective quality indicators of the RUB produced TA vs other ARC sets.
- Line 123, page 6.Revise, eliminate term “old“ in old WUDAPT Portal. This eliminates the need to explain or differentiate the progression of Portal versions.
- Line 131, “Third” used here is really “Fourth” as “Third is already used in line 129
- Line 189:To better understand and appreciate the value of the Lightweight global random forest model, herein, as regards the probability layer, please discuss the takeaway points of the meaning of a high vs low probability.
- Line 246 Functional Urban Area (FUA) are indicated by a reference (Schiavina et al. 2019); given the important role of FUAs in discussions in the remaining text it would be highly useful to introduce the key aspects of this FUA framework as in the Section 2.3 and 3.2.
- Line 433 pg 24. Clarification of introducing the Term “scaling laws”
- Line 503: Consider elaborating on introducing the term “subclass scheme”
- Explain “Morphological Gaussian Filter”
- Highly recommend the Global LCZ Map and data results be incorporated into the WUDAPT Portal.
- Page 51, Last line of caption to Figure F1; add “detail in” the Table F1.
Summary: The effort described in this article to creating this Global LCZ map is impressive. This product represents an achievement, culminating a decade of activity and efforts by the urban community through WUDAPT towards acquiring urban canopy layer data for models that provide a means for addressing climate change and urbanization issues for local to regional to global scales. Its paradigm incorporates the LCZ typology thus providing a unified and consistent basis for generating intraurban form and function type data paradigm (WUDAPT Level 0) for the urban canopy layer. This Global LCZ map supports the observation from earlier city specific studies and regional maps, that each city LCZ map signature (e.g. fingerprint) is unique. This “feature” provides a rationale for supporting a wide range of earth systems modeling, applications, and urban scale environmental science, e.g., urban modeling applications in which intraurban scale weather forecasting and assessments can be made based on implementing urban boundary layer parameterizations in models with universally consistent intraurban urban canopy descriptions unique to each and every urban area in the world. The rationale, requisite technical issues to the innovative approach toward generating this global map, the approach and results were well articulated and fully documented. The point to future work advancements some alluded to in the caveats provided of the current LCZ paradigm for balance. In particular, its incorporation into WRF for example through an improved WUDAPT to WRF link is underway as well as efforts along the lines of WUDAPT level 1 and 2 (Ching et al. (2019) will be to provide a pathway towards introducing refined city specific block scale gridded UCPs in future updates.
Citation: https://doi.org/10.5194/essd-2022-92-RC3 - AC3: 'Reply on RC3', Matthias Demuzere, 11 Jul 2022