the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
A high-quality daily nighttime light (HDNTL) dataset for global 600+ cities (2012–2024)
Abstract. Nighttime light (NTL) data at daily scales presents an innovative foundation for monitoring human activities, offering vast potential across various research domains such as urban planning and management, disaster monitoring, and energy consumption. The VNP46A2 dataset, sourced from NPP/VIIRS, has been providing globally corrected daily NTL data since 2012. However, persistent challenges, such as fluctuations in daily NTL series due to spatial mismatch and angular effects, as well as missing data holes, have significantly impacted the accuracy and comprehensiveness of extracting daily NTL changes. To address these challenges, a dataset production framework focusing on error correction, interpolation, and validation was developed. This framework led to the creation of a high-quality daily NTL dataset from 2012 to 2024, named HDNTL, which specifically targets 653 cities with populations predictably exceeding one million in 2025. A comparative analysis with the VNP46A2 dataset revealed promising results in spatial mismatch correction for two sample areas – the airport and flyover (angular effect can be ignored). These areas exhibited reduced fluctuations in HDNTL time series and maintained or strengthened weekly periodicity, which reflects traffic flow dynamics. Furthermore, the correction of angular effects across various urban building landscapes demonstrated sound improvements, mitigating angular effects in different directions and reducing periodicity from the angular impacts. The spatiotemporal interpolation of missing data holes has high similarity with the reference data, as indicated by an R2 of 0.98, and it increased the valid pixels of all cities by 15.12%. The HDNTL dataset exhibited enhanced consistency with high-resolution SDGSAT-1 data regarding the NTL change rate and alignment with ground truth data of power outages, showcasing superior performance in short-event detection. Overall, the HDNTL dataset effectively mitigates instability in daily series caused by spatial mismatch and angular effects observed in VNP46A2, improving data comparability across time and space dimensions. This dataset enhances the ability of the NTL to reflect the ground events, providing a more accurate reference for daily-scale nighttime light research. Additionally, the dataset production framework facilitates easy updates from future VNP46A2 products to HDNTL. The HDNTL is openly available at https://doi.org/10.5281/zenodo.14992989 (Pei et al., 2025).
- Preprint
(2120 KB) - Metadata XML
- BibTeX
- EndNote
Status: open (until 01 Aug 2025)
-
RC1: 'Comment on essd-2025-142', Anonymous Referee #1, 21 May 2025
reply
This study proposed a method to correct the angular effects, spatial mismatch, and small holes in daily NTL data and generated a wonderful product of high-quality daily NTL data covering more than 650 cities across the world. This a very valuable dataset, and the generation procedures and validation assessments are reasonable and solid. I think it should be published after a revision.
Comments:
- In Line 97-98, the authors mentioned that the knowledge of the local building environment is important for identifying the sources of light. I am not so sure why the authors paid attention to this? It looks like there are no associations with the HDNTL generation.
- In Line 174, what does the "fixed area" mean? An area with stable light emissions?
- Adding a scale bar in Fig. 4 and 5 can help us tell if the current cases are solid or not. For example, in Fig. 5, if a 500x500m pixel is much larger than a flyover, the flyover’s surrounding light emissions could affect the validation.
- In Fig. 10, what do (a) and (b) refer to?
- I found that all pixels in the 2021-6-9 NTL data of Fuzhou, Fujian, China are 0, as I known Fuzhou is rainy that day. So I am wondering if the method proposed in this study can correct the daily NTL data on a cloudy or rainy day, especially during a continues cloudy/rainy days?
- What is the maximum/optimal size of holes that the spatiotemporal interpolation can deal with?
Citation: https://doi.org/10.5194/essd-2025-142-RC1 -
RC2: 'Comment on essd-2025-142', Anonymous Referee #2, 08 Jun 2025
reply
This paper presents a valuable contribution to the field of nighttime light remote sensing by addressing the limitations of the VNP46A2 dataset and producing a high-quality daily nighttime light (HDNTL) dataset for 653 cities. The proposed framework for error correction, interpolation, and validation demonstrates significant improvements in data accuracy and reliability. The results are well-supported by comparisons with other datasets and case studies.
- I suggest the authors should provide a more detailed explanation for choosing 3 times the standard deviation as the threshold for identifying NTL variation (𝑁𝑇𝐿𝑣𝑎𝑟𝑖𝑎𝑡𝑖𝑜𝑛). This threshold lacks a clear justification based on the data characteristics or statistical principles.
- The use of mean NTL radiance for angular effect correction may introduces some uncertainty. Maybe the authors can consider exploring methods such as fitting a model to the relationship between NTL pixels for each VZA. This could provide a more accurate and robust correction, particularly for complex urban environments.
- The paper only uses one flyover site to validate the effectiveness of spatial mismatch correction. It is recommended to include additional examples to strengthen the evidence of the correction’s generalizability.
- The sudden increase in RMSE for the temporal window size of 4 in Figure 7 requires further investigation and explanation.
- The paper mentions the “mutation signal recognition threshold” without providing a clear definition.
Citation: https://doi.org/10.5194/essd-2025-142-RC2 -
RC3: 'Comment on essd-2025-142', Anonymous Referee #3, 27 Jun 2025
reply
This paper utilizes an improved SFAC method to correct the VNP46A2 NTL product, specifically addressing the VZA effect and mismatches. These corrections enhance the stability of the NTL time series while preserving critical anomalies caused by events such as natural disasters or holidays. The overall structure of the manuscript is clear; however, it lacks explanation of some key procedures and a deeper discussion of results. Therefore, my recommendation is a major revision. My specific comments are as follows:
- On page 5, line 125, the phrase “Grow a square with... is less than 1” is unclear. What exactly does “less than 1” refer to in this context
- On page 5, line 127, the authors mention that in highly urbanized areas, overlapping convex hulls may emerge. To address this, a manual adjustment method is employed. However, the manuscript does not specify the criteria for this manual adjustment, nor does it describe the resulting delineation of urban areas after resolution.
- On page 6, line 149, the data used to validate the correction results is from Tianjin City, covering the period from January 25 to February 21, 2022. Clearly, relying on a single region and a one-month span is not robust. Is there a way to increase the volume of validation data? If acquiring more data from SDGSAT is not feasible, could alternative validation methods be considered?
- On page 8, line 182, the authors claim that “it is reasonable to assume that the annual lowest light intensity is close to the NTLfix.” However, the rationale behind this assumption is not provided. Why wouldn’t the annual mean light intensity be a better approximation to NTLfix?
- On page 9, line 204, the authors divide the VZA into 16 groups. How exactly was this division made? Was the 0–60° range evenly partitioned into 16 intervals? Although the grouping is said to be based on the 16-day revisit cycle, in practice, very similar VZA values can occur within the same cycle. For example, in a single cycle, if one day has a VZA of 54° and another day 58°, would these be grouped together?
- On page 9, line 227, the authors apply an inverse distance weighting to non-central pixels in a 3×3 window. However, in such a window, the distances to the center pixel are identical for all non-central pixels, resulting in identical weights. What then is the significance of applying this weight?
- On page 11, line 270, the authors select the Liuyang Firework Festival as a case study to test whether their method suppresses genuine NTL anomalies. However, aerosols from fireworks can significantly affect NTL intensity, especially during windless nights. Have the authors considered this potential confounding factor?
- In Figure 5, the selected flyover region seems unsuitable for validation purposes, as it is surrounded by many tall buildings within a 500-meter radius, which could introduce substantial VZA effects.
- In Figure 5, the IR values derived from the original VNP46A2 data and from the HDNTL product are almost identical. Does this imply that the proposed method is ineffective in this case? This may support the previous concern regarding the selection of this flyover site.
- In the Beirut port explosion case shown in Figure 9, why does the HDNTL product not show the NTL intensity on the day of the explosion as the lowest of the year? The authors should provide an explanation for this discrepancy.
Citation: https://doi.org/10.5194/essd-2025-142-RC3
Data sets
A high-quality daily nighttime light (HDNTL) dataset for global 600+ cities (2012–2024) Zixuan Pei et al. https://doi.org/10.5281/zenodo.14992989
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
314 | 94 | 15 | 423 | 14 | 10 |
- HTML: 314
- PDF: 94
- XML: 15
- Total: 423
- BibTeX: 14
- EndNote: 10
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1