Preprints
https://doi.org/10.5194/essd-2024-111
https://doi.org/10.5194/essd-2024-111
15 May 2024
 | 15 May 2024
Status: this preprint is currently under review for the journal ESSD.

A 10 km daily-level ultraviolet radiation predicting dataset based on machine learning models in China from 2005 to 2020

Yichen Jiang, Su Shi, Xinyue Li, Chang Xu, Haidong Kan, Bo Hu, and Xia Meng

Abstract. Ultraviolet (UV) radiation is closely related to health, but limited measurements hindered further investigation of its health effects in China. Machine learning algorithm has been widely used in predicting environmental factors with high accuracy, but limited studies have done for UV radiation. This study aimed to develop UV radiation prediction model based on random forest method, and predict UV radiation at daily level and 10 km resolution in mainland China in 2005–2020. A random forest model was employed to predict UV radiation by integrating ground UV radiation measurements from monitoring stations and multiple predictors, such as UV radiation data from satellite. Missing data of satellite-based UV radiation was filled by three-day moving average method. The model's performance was evaluated through multiple cross-validation (CV) methods. The overall R2 (root mean square error, RMSE) between measured and predicted UV radiation from model development and model 10-fold CV was 0.97 (15.64 W m-2) and 0.83 (37.44 W m-2) at daily level, respectively. The model with OMI EDD performed higher predicting accuracy than the one without it. Based on predictions of UV radiation at daily level and 10 km spatial resolution and nearly 100 % spatiotemporal coverage, we found UV radiation increased by 4.20 % while PM2.5 levels decreased by 48.51 % and O3 levels rose by 22.70 % in 2013–2020, suggesting a potential correlation among these environmental factors. Uneven spatial distribution of UV radiation was found to be associated with factors such as latitude, elevation, meteorological factors and seasons. The eastern areas of China posed higher risk with both high population density and UV radiation intensity. Based on machine learning algorithm, this study generated a gridded dataset characterized by relatively high precision and extensive spatiotemporal coverage of UV radiation, which demonstrates the spatiotemporal variability of UV radiation levels in China and can facilitate health-related research in the future. This dataset is currently freely available at https://doi.org/10.5281/zenodo.10884591 (Jiang et al., 2024).

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.
Yichen Jiang, Su Shi, Xinyue Li, Chang Xu, Haidong Kan, Bo Hu, and Xia Meng

Status: open (until 17 Jul 2024)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on essd-2024-111', Anonymous Referee #1, 27 Jun 2024 reply
Yichen Jiang, Su Shi, Xinyue Li, Chang Xu, Haidong Kan, Bo Hu, and Xia Meng

Data sets

A database of 10 km Ultraviolet Radiation Product over mainland China: 2005-2020 Yichen Jiang, Su Shi, Xinyue Li, Chang Xu, Haidong Kan, Bo Hu, and Xia Meng https://doi.org/10.5281/zenodo.10884590

Yichen Jiang, Su Shi, Xinyue Li, Chang Xu, Haidong Kan, Bo Hu, and Xia Meng

Viewed

Total article views: 275 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
236 27 12 275 11 9
  • HTML: 236
  • PDF: 27
  • XML: 12
  • Total: 275
  • BibTeX: 11
  • EndNote: 9
Views and downloads (calculated since 15 May 2024)
Cumulative views and downloads (calculated since 15 May 2024)

Viewed (geographical distribution)

Total article views: 270 (including HTML, PDF, and XML) Thereof 270 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 28 Jun 2024
Download
Short summary
Limited UV measurements hindered further investigation of its health effects. This study used a machine learning algorithm to predict UV radiation at daily level and 10 km resolution with high accuracy in mainland China in 2005–2020. Then, uneven spatial distribution and population exposure risks as well as increased temporal trend of UV radiation were found in China. The long-term and high-quality UV dataset could further facilitate health-related research in the future.
Altmetrics