the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Retrieval of dominant methane (CH4) emission sources, the first high-resolution (1–2 m) dataset of storage tanks of China in 2000–2021
Fang Chen
Lei Wang
Yu Wang
Haiying Zhang
Ning Wang
Pengfei Ma
Bo Yu
Download
- Final revised paper (published on 24 Jul 2024)
- Preprint (discussion started on 22 Feb 2024)
Interactive discussion
Status: closed
-
RC1: 'Comment on essd-2024-10', Anonymous Referee #1, 04 May 2024
Review Earth System Science Data: Retrieval of dominant methane (CH4) emission sources, the first high resolution (1-2m) dataset of storage tanks of China in 2000-2021
This manuscript generates a storage tank dataset (STD) by implementing a deep learning model with manual refinement based on high spatial resolution images (1-2m) from the GaoFen-1, GaoFen-2, GaoFen-6, and Ziyuan-3 satellites over some region in China in 2021. Since oil gas infrastructure data is barely publicly available in China, this dataset could give researchers a tool to estimate or scale up methane emissions from relevant industry sectors, and thus it is of value to be published in this journal.
The reviewer has several comments which may help improving the quality of the manuscript:
- Targeted tanks:
The manuscripts screens tanks under two criteria: - footprint bigger than 500 m2;
- located within the built area and bare land in the LULC product of the Esri Land Cover in 2021
while the objective of this study is to retrieve dominant methane emission sources, the reviewer is wondering how many of tanks retrieved are methane emission sources, and how many tanks which are methane emissions sources, are neglected. For example, oil gas batteries are major methane emission sources, and located in the rural agriculture area or desert. Will these batteries be identified under the current algorithm. What is the rational for 500 m2 threshold? Are there tanks who have footprints smaller than 500 m2?
To make this clear, the reviewer would suggest a table summarizing the key features of tanks, including the following aspects: Usage/function; typical size (water volume, m3); typical footprint (m2); typical location (urban, rural?); industry sector.
- Dataset accuracy verification
The reviewer did not see any verification of the dataset in the manuscript. Have the researchers verified their findings, for example, checking the accuracy of the dataset for one city/region? How many true positive, false positive? How many tanks are missed? Without such verification process, this is not a complete stand-alone paper.
- Construction year assignment
The researchers identified the construction year of tanks by the high-resolution historical images available on Google Earth. Can the researchers also identify how many tanks disappear during the period of study?
Minor Observations:
- Line 53, “ … with 85 times more climate warming potency than CO2”: please specify the time length of 85 times.
- Line 108, “size > 500 m2”. Tank size is usually characterized by its water volume. Suggest changing tank size to tank footprint.
- Line 114. “cities”: city in English usually refers to urban area. Here it looks like that the city covers both the urban area, as well as the rural counties. Would suggest to change the “city” to “region”, and explain the true meaning of it.
- Line 168, digital elevation model (DEM), and Figure 1: Is DEM a model or elevation? In figure 1. DEM High 4708, Low -91. What’s the unit for DEM? Please clarify, and add unit in Figure 1 (m?)
- Line 204 “Given that storage tanks are constructed mainly in residential areas”: Please elaborate why. To the reviewer’s knowledge, most oil gas storage tanks should be outside of the residential areas.
Citation: https://doi.org/10.5194/essd-2024-10-RC1 -
AC1: 'Reply on RC1', Lei Wang, 11 May 2024
Thanks for your valuable comments concerning our manuscript entitled “Retrieval of dominant methane (CH4) emission sources, the first high resolution (1-2m) dataset of storage tanks of China in 2000-2021” (essd-2024-10). These comments are very helpful in revising and improving our paper, as well as guiding our future research. We have studied the comments carefully and tried our best to revise the manuscript.
This manuscript generates a storage tank dataset (STD) by implementing a deep learning model with manual refinement based on high spatial resolution images (1-2m) from the GaoFen-1, GaoFen-2, GaoFen-6, and Ziyuan-3 satellites over some region in China in 2021. Since oil gas infrastructure data is barely publicly available in China, this dataset could give researchers a tool to estimate or scale up methane emissions from relevant industry sectors, and thus it is of value to be published in this journal.
The reviewer has several comments which may help improving the quality of the manuscript:
Q1. Targeted tanks: The manuscripts screens tanks under two criteria: footprint bigger than 500 m2; located within the built area and bare land in the LULC product of the Esri Land Cover in 2021.While the objective of this study is to retrieve dominant methane emission sources, the reviewer is wondering how many of tanks retrieved are methane emission sources, and how many tanks which are methane emissions sources, are neglected. For example, oil gas batteries are major methane emission sources, and located in the rural agriculture area or desert. Will these batteries be identified under the current algorithm. What is the rational for 500 m2 threshold? Are there tanks who have footprints smaller than 500 m2?
To make this clear, the reviewer would suggest a table summarizing the key features of tanks, including the following aspects: Usage/function; typical size (water volume, m3); typical footprint (m2); typical location (urban, rural?); industry sector.
Answer: Thanks very much for your suggestion. The storage tanks are extracted from remote sensing images, according to their spectral, textual, and morphological features based on deep learning frameworks as illustrated in our manuscript. It is very difficult to tell the exact usage of each storage tank only according to the appearance in the image, since storage tanks may take similar appearances with different functions.
Moreover, it is very difficult to estimate typical size of each storage tank because of lack of height. The typical footprint in the unit of m2 has been provided in the attribute table of our dataset. The storage tanks located in rural agriculture area or desert are not considered in our proposed dataset, because storage tanks of crude oil or other petroleum, and industrial materials, such as alcohols, gases, or liquids, are among the most significant sources of emitting CH4 (Im et al., 2022; Johnson et al., 2022), and they are predominantly situated in urban area or development zones in China due to pipeline transport logistics. Additionally, the high cost and limited access to high-resolution spatial imagery, coupled with the extensive coverage required for rural and desert areas, make it impractical to include these tanks in our study. Therefore, we did not distinguish the typical location of the storage tanks in our dataset since they are all located in urban area.
In terms of the threshold of 500m2, it is determined based on the spatial resolution of remote sensing images we collected to construct the dataset. The storage tanks with 125 pixels or more in remote sensing images can be distinguished more accurately from the complicated background objects. There are tanks with footprints smaller than 500m2, but they are more difficult to be extracted accurately due to the small size. Moreover, since large capacity storage tanks are known to release significant levels of CH4, our proposed inventory focuses on storage tanks with an area of no less than 500 m2. The corresponding sentence has been revised to be ‘Given that large capacity storage tanks are known to release significant levels of CH4, resulting in climate warming, the proposed inventory focuses on storage tanks with an area of no less than 500 m2’ in Line 400-402 of the revised manuscript.
Q2. Dataset accuracy verification. The reviewer did not see any verification of the dataset in the manuscript. Have the researchers verified their findings, for example, checking the accuracy of the dataset for one city/region? How many true positive, false positive? How many tanks are missed? Without such verification process, this is not a complete stand-alone paper.
Answer: Actually, each storage tank of our proposed dataset has been manually validated and refined by six experienced experts through visual interpretation based on our collected high spatial resolution images and field survey to facilitate the corresponding construction year. Therefore, we did not conduct dataset evaluation specifically because each storage tank has been verified and refined. To make the manuscript easier to understand, the corresponding sentence has been modified to be ‘To facilitate the dating of each storage tank's construction year, the reconstructed circle for each extracted storage tank has been manually validated and refined by six experienced experts through visual interpretation based on our collected high spatial resolution images and field survey.’ in Line 369-372 of the revised manuscript.
Q3. Construction year assignment. The researchers identified the construction year of tanks by the high-resolution historical images available on Google Earth. Can the researchers also identify how many tanks disappear during the period of study?
Answer: The storage tanks extracted in this study are those existing in the high spatial resolution images we collected in year of 2021, and the corresponding construction year is determined for each extracted storage tank referring to the historical images available on Google Earth. Due to the large scale of our study area and the limited availability of high spatial resolution images, the storage tanks are extracted based on mono-temporal images, rather than time-series images. Therefore, it is difficult to identify the disappeared storage tanks during the period of study.
Minor Observations:
Q1. Line 53, “ … with 85 times more climate warming potency than CO2”: please specify the time length of 85 times.
Answer: Thanks very much for your suggestion. We have specified the time length in Line 52-54 of the revised manuscript as ‘Meanwhile, CH4 is more effective in trapping heat, with 85 times more climate warming potency than CO2 for a decade or two (Stocker, 2014).’.
Q2. Line 108, “size > 500 m2”. Tank size is usually characterized by its water volume. Suggest changing tank size to tank footprint.
Answer: Thanks very much for your generous suggestion. We have changed tank size to tank footprint throughout the revised manuscript.
Q3. Line 114. “cities”: city in English usually refers to urban area. Here it looks like that the city covers both the urban area, as well as the rural counties. Would suggest to change the “city” to “region”, and explain the true meaning of it.
Answer: Your suggestion is much appreciated. The word region is better, but it is difficult to describe the administrative boundary of our study area. Therefore, we have modified city to city region throughout the revised manuscript. Hopefully, it is clearer this time.
Q4. Line 168, digital elevation model (DEM), and Figure 1: Is DEM a model or elevation? In figure 1. DEM High 4708, Low -91. What’s the unit for DEM? Please clarify, and add unit in Figure 1 (m?)
Answer: The unit for DEM is meter. Thanks very much for your suggestion. We have added the unit in the caption of Figure 1 as ‘Figure 1. Study area demonstration with digital elevation (in the unit of meter) from the Shuttle Radar Topography Mission (SRTM) product.’ in Line 178-179 of the revised manuscript.
Q5. Line 204 “Given that storage tanks are constructed mainly in residential areas”: Please elaborate why. To the reviewer’s knowledge, most oil gas storage tanks should be outside of the residential areas.
Answer: The storage tanks are mostly constructed in urban area because of the high expense of pipeline transportation. Moreover, according to our field survey, the storage tanks, especially storing oil and gas, are mostly located in industrial area of urban area. To make the manuscript easier to understand, the corresponding sentence has been modified to be ‘Given that storage tanks are constructed mainly in urban area due to the high expense of transportation of pipelines’ in Line 205-206 of the revised manuscript.
Citation: https://doi.org/10.5194/essd-2024-10-AC1
- Targeted tanks:
-
RC2: 'Comment on essd-2024-10', Anonymous Referee #2, 11 May 2024
The author created a dataset of storage tanks using a deep learning approach applied to high-resolution remote sensing imagery. This dataset provides a comprehensive, validated, and geo-referenced collection of details, including the precise locations, distributions, and construction years of storage tanks. It covers 92 representative cities, encompassing a total of 14,461 tanks. The manuscript also explores the spatial correlations between the distribution and density of storage tank and methane emissions, contributing to a more profound understanding of the societal, ecological, and settlement impacts of methane emissions from these structures. The paper's innovation lies in its comprehensive database of storage tanks across 92 typical cities and its large-scale exploration of the spatial interplay between storage tanks and methane emissions. However, there are some problems that require responses from the authors. Afterwards, this manuscript could be accepted for publication after a minor revision.
Some detailed problems:
Line 167-168: The author seems to refer to 'coastal cities' when mentioning that “Many of the cities are located near or next to the boundary of 167 mainland China.”, and this should be explicitly identified.
Line 172-173: The manuscript predominantly focuses on the measurement of methane emissions, rather than the measurement of methane reduction.
Line 204-205: Why are the storage tanks mainly constructed in residential areas? I think it should be placed in the built area and bare ground that is far from residential areas in urban settings.
Line 220-221: The terms of LULC categories should be consistent throughout the manuscript. For instance, the “Bare ground” or “Bare Ground” and “Flooded vegetation” or “Flooded Vegetation” should be maintained in a uniform format.
Line337-343 & 508: The equations should be aligned at the centre within the text.
Line 425-426 & 598 & 602: The “m2” should be written as “m2” and the “CH4” should be written as “CH4” in the text.
Line 428-430: The storage tanks of different categories should not omit their respective units in the figure, such as 500-1000 m2.
Line 448: The author seems to refer to 'coastal regions' when mentioning “especially at the border of mainland China”, and this should be explicitly identified.
Line 466 & 468: The units “TgCH4yr-1” should be separated in the text.
Line 470 & 525: The p-value is usually represented as “p=0.1” or “p=0.05”.
Line 475-476: In figure 10 (A), the “CH4” should be “CH4”. In figure 10 (B), the unit “Tg CH4 yr-1” of methane emissions of left panel is necessary after “Methane emission”.
Line 490 & 493: The formatting of tables should be consistent throughout the entire manuscript, using either “Table 1, Table 2” or “Table I, Table II”.
Line 505: The term 'oil tank' is being introduced here for the first time. It is necessary to clarify whether 'oil tank' and 'storage tank' are synonymous with each other.
Line 676 & 715 & 724 & 725 & 734 & 735 & 743 & 749 & 778: The author should undertake a thorough review to guarantee the entirety of these references.
Citation: https://doi.org/10.5194/essd-2024-10-RC2 -
AC2: 'Reply on RC2', Lei Wang, 13 May 2024
Thanks for your valuable comments concerning our manuscript entitled “Retrieval of dominant methane (CH4) emission sources, the first high resolution (1-2m) dataset of storage tanks of China in 2000-2021” (essd-2024-10). These comments are very helpful in revising and improving our paper, as well as guiding our future research. We have studied the comments carefully and tried our best to revise the manuscript.
-
AC2: 'Reply on RC2', Lei Wang, 13 May 2024
-
RC3: 'Comment on essd-2024-10', Anonymous Referee #3, 20 May 2024
The authors used their own deep learning model, Res2-Unet+, to generate the first high-resolution 1 (1-2m) dataset of storage tanks from over 4000 images and assign the year of each storage tank. The dataset is then used for CH4 emission analysis. The dataset is useful and the analysis is reasonable. It is suitable for publication in ESSD. However, I have the following comments:
Abstract
Line 30 “based on high spatial resolution images (1-2m) …”. I suggest the authors add the number of images.
“based on over 4000 high spatial resolution images (1-2m)… ”
3.3 Land use land cover product
Line 263-264. "historical high spatial resolution images, high spatial resolution images collected, and field survey from Google Earth", please refine the sentence. from my understanding, only historical high spatial resolution images are from Google Earth.
Section 4.2.1
Line 309 "Res2-Unet+ by Yu et al. (Yu et al., 2021)". "by Yu et al." should be removed. The same issue is in line 352.
Line 329 "Our proposed Res2-UnetA", does it mean Res2-Unet+?
equation (1)-(7), what do "f, m, n, h, etc" stand for?
Section 4.3
The STD dataset covers 2000-2021. How do you define the year if the storage tank was built before 2000, and how many such cases are there?
Section 5.1
Line 404-405. “It may be seen that 404 storage tanks of 500-1000 m2 are more than those of larger sizes”. Then, what are the criteria for threshold 500m2?
Section 5.2
What is the year for density calculation in Fig 9?
Section 6.1
NEPU-OWOD dataset, an oil storage tank dataset from platform Kaggle, and the STD dataset.
They should be separated into different paragraphs.
References
The references need to revised in uniform format. Some references are missing. Some references have doi.
Citation: https://doi.org/10.5194/essd-2024-10-RC3 - AC3: 'Reply on RC3', Lei Wang, 24 May 2024