A high-quality daily nighttime light (HDNTL) dataset for global 600+ cities (2012–2024)

Pei, Zixuan; Zhu, Xiaolin; Hu, Yang; Chen, Jin; Tan, Xiaoyue

doi:https://doi.org/10.5194/essd-2025-142

Preprints

https://doi.org/10.5194/essd-2025-142

Preprints

07 May 2025

| 07 May 2025

Status: this preprint is currently under review for the journal ESSD.

A high-quality daily nighttime light (HDNTL) dataset for global 600+ cities (2012–2024)

Zixuan Pei, Xiaolin Zhu, Yang Hu, Jin Chen, and Xiaoyue Tan

Abstract. Nighttime light (NTL) data at daily scales presents an innovative foundation for monitoring human activities, offering vast potential across various research domains such as urban planning and management, disaster monitoring, and energy consumption. The VNP46A2 dataset, sourced from NPP/VIIRS, has been providing globally corrected daily NTL data since 2012. However, persistent challenges, such as fluctuations in daily NTL series due to spatial mismatch and angular effects, as well as missing data holes, have significantly impacted the accuracy and comprehensiveness of extracting daily NTL changes. To address these challenges, a dataset production framework focusing on error correction, interpolation, and validation was developed. This framework led to the creation of a high-quality daily NTL dataset from 2012 to 2024, named HDNTL, which specifically targets 653 cities with populations predictably exceeding one million in 2025. A comparative analysis with the VNP46A2 dataset revealed promising results in spatial mismatch correction for two sample areas – the airport and flyover (angular effect can be ignored). These areas exhibited reduced fluctuations in HDNTL time series and maintained or strengthened weekly periodicity, which reflects traffic flow dynamics. Furthermore, the correction of angular effects across various urban building landscapes demonstrated sound improvements, mitigating angular effects in different directions and reducing periodicity from the angular impacts. The spatiotemporal interpolation of missing data holes has high similarity with the reference data, as indicated by an R² of 0.98, and it increased the valid pixels of all cities by 15.12%. The HDNTL dataset exhibited enhanced consistency with high-resolution SDGSAT-1 data regarding the NTL change rate and alignment with ground truth data of power outages, showcasing superior performance in short-event detection. Overall, the HDNTL dataset effectively mitigates instability in daily series caused by spatial mismatch and angular effects observed in VNP46A2, improving data comparability across time and space dimensions. This dataset enhances the ability of the NTL to reflect the ground events, providing a more accurate reference for daily-scale nighttime light research. Additionally, the dataset production framework facilitates easy updates from future VNP46A2 products to HDNTL. The HDNTL is openly available at https://doi.org/10.5281/zenodo.14992989 (Pei et al., 2025).

Received: 10 Mar 2025 – Discussion started: 07 May 2025

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.

Download & links

Zixuan Pei, Xiaolin Zhu, Yang Hu, Jin Chen, and Xiaoyue Tan

Status: open (until 01 Aug 2025)

Post a comment Subscribe to comment alert

RC1:
'Comment on essd-2025-142', Anonymous Referee #1, 21 May 2025 reply
This study proposed a method to correct the angular effects, spatial mismatch, and small holes in daily NTL data and generated a wonderful product of high-quality daily NTL data covering more than 650 cities across the world. This a very valuable dataset, and the generation procedures and validation assessments are reasonable and solid. I think it should be published after a revision.
Comments:
In Line 97-98, the authors mentioned that the knowledge of the local building environment is important for identifying the sources of light. I am not so sure why the authors paid attention to this? It looks like there are no associations with the HDNTL generation.

In Line 174, what does the "fixed area" mean? An area with stable light emissions?

Adding a scale bar in Fig. 4 and 5 can help us tell if the current cases are solid or not. For example, in Fig. 5, if a 500x500m pixel is much larger than a flyover, the flyover’s surrounding light emissions could affect the validation.

In Fig. 10, what do (a) and (b) refer to?

I found that all pixels in the 2021-6-9 NTL data of Fuzhou, Fujian, China are 0, as I known Fuzhou is rainy that day. So I am wondering if the method proposed in this study can correct the daily NTL data on a cloudy or rainy day, especially during a continues cloudy/rainy days?

What is the maximum/optimal size of holes that the spatiotemporal interpolation can deal with?

Reply
Citation: https://doi.org/10.5194/essd-2025-142-RC1
RC2:
'Comment on essd-2025-142', Anonymous Referee #2, 08 Jun 2025 reply
This paper presents a valuable contribution to the field of nighttime light remote sensing by addressing the limitations of the VNP46A2 dataset and producing a high-quality daily nighttime light (HDNTL) dataset for 653 cities. The proposed framework for error correction, interpolation, and validation demonstrates significant improvements in data accuracy and reliability. The results are well-supported by comparisons with other datasets and case studies.
I suggest the authors should provide a more detailed explanation for choosing 3 times the standard deviation as the threshold for identifying NTL variation (𝑁𝑇𝐿𝑣𝑎𝑟𝑖𝑎𝑡𝑖𝑜𝑛). This threshold lacks a clear justification based on the data characteristics or statistical principles.

The use of mean NTL radiance for angular effect correction may introduces some uncertainty. Maybe the authors can consider exploring methods such as fitting a model to the relationship between NTL pixels for each VZA. This could provide a more accurate and robust correction, particularly for complex urban environments.

The paper only uses one flyover site to validate the effectiveness of spatial mismatch correction. It is recommended to include additional examples to strengthen the evidence of the correction’s generalizability.

The sudden increase in RMSE for the temporal window size of 4 in Figure 7 requires further investigation and explanation.

The paper mentions the “mutation signal recognition threshold” without providing a clear definition.

Reply
Citation: https://doi.org/10.5194/essd-2025-142-RC2
RC3:
'Comment on essd-2025-142', Anonymous Referee #3, 27 Jun 2025 reply
This paper utilizes an improved SFAC method to correct the VNP46A2 NTL product, specifically addressing the VZA effect and mismatches. These corrections enhance the stability of the NTL time series while preserving critical anomalies caused by events such as natural disasters or holidays. The overall structure of the manuscript is clear; however, it lacks explanation of some key procedures and a deeper discussion of results. Therefore, my recommendation is a major revision. My specific comments are as follows:
On page 5, line 125, the phrase “Grow a square with... is less than 1” is unclear. What exactly does “less than 1” refer to in this context

On page 5, line 127, the authors mention that in highly urbanized areas, overlapping convex hulls may emerge. To address this, a manual adjustment method is employed. However, the manuscript does not specify the criteria for this manual adjustment, nor does it describe the resulting delineation of urban areas after resolution.

On page 6, line 149, the data used to validate the correction results is from Tianjin City, covering the period from January 25 to February 21, 2022. Clearly, relying on a single region and a one-month span is not robust. Is there a way to increase the volume of validation data? If acquiring more data from SDGSAT is not feasible, could alternative validation methods be considered?

On page 8, line 182, the authors claim that “it is reasonable to assume that the annual lowest light intensity is close to the NTL_fix.” However, the rationale behind this assumption is not provided. Why wouldn’t the annual mean light intensity be a better approximation to NTL_fix?

On page 9, line 204, the authors divide the VZA into 16 groups. How exactly was this division made? Was the 0–60° range evenly partitioned into 16 intervals? Although the grouping is said to be based on the 16-day revisit cycle, in practice, very similar VZA values can occur within the same cycle. For example, in a single cycle, if one day has a VZA of 54° and another day 58°, would these be grouped together?

On page 9, line 227, the authors apply an inverse distance weighting to non-central pixels in a 3×3 window. However, in such a window, the distances to the center pixel are identical for all non-central pixels, resulting in identical weights. What then is the significance of applying this weight?

On page 11, line 270, the authors select the Liuyang Firework Festival as a case study to test whether their method suppresses genuine NTL anomalies. However, aerosols from fireworks can significantly affect NTL intensity, especially during windless nights. Have the authors considered this potential confounding factor?

In Figure 5, the selected flyover region seems unsuitable for validation purposes, as it is surrounded by many tall buildings within a 500-meter radius, which could introduce substantial VZA effects.

In Figure 5, the IR values derived from the original VNP46A2 data and from the HDNTL product are almost identical. Does this imply that the proposed method is ineffective in this case? This may support the previous concern regarding the selection of this flyover site.

In the Beirut port explosion case shown in Figure 9, why does the HDNTL product not show the NTL intensity on the day of the explosion as the lowest of the year? The authors should provide an explanation for this discrepancy.

Reply
Citation: https://doi.org/10.5194/essd-2025-142-RC3

Zixuan Pei, Xiaolin Zhu, Yang Hu, Jin Chen, and Xiaoyue Tan

Data sets

A high-quality daily nighttime light (HDNTL) dataset for global 600+ cities (2012–2024) Zixuan Pei et al. https://doi.org/10.5281/zenodo.14992989

Zixuan Pei, Xiaolin Zhu, Yang Hu, Jin Chen, and Xiaoyue Tan

Viewed

Total article views: 423 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
314	94	15	423	14	10

HTML: 314
PDF: 94
XML: 15
Total: 423
BibTeX: 14
EndNote: 10

Views and downloads (calculated since 07 May 2025)

Month	HTML	PDF	XML	Total
May 2025	197	57	8	262
Jun 2025	117	37	7	161

Cumulative views and downloads (calculated since 07 May 2025)

Month	HTML	PDF	XML	Total
May 2025	197	57	8	262
Jun 2025	117	37	7	161

Viewed (geographical distribution)

Total article views: 408 (including HTML, PDF, and XML) Thereof 408 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 01 Jul 2025

Short summary

Nighttime light (NTL) data aids urban planning and disaster monitoring. Our study introduces HDNTL, a daily NTL dataset for 653 major cities (population >1 M by 2025) from 2012–2024, enhancing NASA's VNP46A2 accuracy by correcting spatial mismatches, angular effects, and filling small missing holes via spatiotemporal interpolation. Validated against high-resolution NTL datasets and ground-truth records, HDNTL enables precise monitoring of nighttime human activities.


Total:	0
HTML:	0
PDF:	0
XML:	0