Articles | Volume 13, issue 6
https://doi.org/10.5194/essd-13-3013-2021
© Author(s) 2021. This work is distributed under
the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
https://doi.org/10.5194/essd-13-3013-2021
© Author(s) 2021. This work is distributed under
the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
AQ-Bench: a benchmark dataset for machine learning on global air quality metrics
Clara Betancourt
Jülich Supercomputing Centre, Jülich Research Centre, Wilhelm-Johnen-Straße, 52425 Jülich, Germany
Timo Stomberg
Jülich Supercomputing Centre, Jülich Research Centre, Wilhelm-Johnen-Straße, 52425 Jülich, Germany
Institute of Geodesy and Geoinformation, University of Bonn, Nußallee 17, 53115 Bonn, Germany
Ribana Roscher
Institute of Geodesy and Geoinformation, University of Bonn, Nußallee 17, 53115 Bonn, Germany
Martin G. Schultz
CORRESPONDING AUTHOR
Jülich Supercomputing Centre, Jülich Research Centre, Wilhelm-Johnen-Straße, 52425 Jülich, Germany
Scarlet Stadtler
Jülich Supercomputing Centre, Jülich Research Centre, Wilhelm-Johnen-Straße, 52425 Jülich, Germany
Viewed
Total article views: 8,000 (including HTML, PDF, and XML)
Cumulative views and downloads
(calculated since 14 Jan 2021)
| HTML | XML | Total | BibTeX | EndNote | |
|---|---|---|---|---|---|
| 5,914 | 1,925 | 161 | 8,000 | 181 | 187 |
- HTML: 5,914
- PDF: 1,925
- XML: 161
- Total: 8,000
- BibTeX: 181
- EndNote: 187
Total article views: 6,003 (including HTML, PDF, and XML)
Cumulative views and downloads
(calculated since 24 Jun 2021)
| HTML | XML | Total | BibTeX | EndNote | |
|---|---|---|---|---|---|
| 4,846 | 1,017 | 140 | 6,003 | 164 | 168 |
- HTML: 4,846
- PDF: 1,017
- XML: 140
- Total: 6,003
- BibTeX: 164
- EndNote: 168
Total article views: 1,997 (including HTML, PDF, and XML)
Cumulative views and downloads
(calculated since 14 Jan 2021)
| HTML | XML | Total | BibTeX | EndNote | |
|---|---|---|---|---|---|
| 1,068 | 908 | 21 | 1,997 | 17 | 19 |
- HTML: 1,068
- PDF: 908
- XML: 21
- Total: 1,997
- BibTeX: 17
- EndNote: 19
Viewed (geographical distribution)
Total article views: 8,000 (including HTML, PDF, and XML)
Thereof 7,528 with geography defined
and 472 with unknown origin.
Total article views: 6,003 (including HTML, PDF, and XML)
Thereof 5,765 with geography defined
and 238 with unknown origin.
Total article views: 1,997 (including HTML, PDF, and XML)
Thereof 1,763 with geography defined
and 234 with unknown origin.
| Country | # | Views | % |
|---|
| Country | # | Views | % |
|---|
| Country | # | Views | % |
|---|
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
1
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
1
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
1
Cited
19 citations as recorded by crossref.
- Challenges and Benchmark Datasets for Machine Learning in the Atmospheric Sciences: Definition, Status, and Outlook P. Dueben et al.
- Proper Weather Forecasting Internet of Things Sensor Framework with Machine Learning A. Turukmane & S. Pande
- Applications of Machine Learning and Artificial Intelligence in Tropospheric Ozone Research S. Hickman et al.
- Representing chemical history in ozone time-series predictions – a model experiment study building on the MLAir (v1.5) deep learning framework F. Kleinert et al.
- Enhancing the Prediction of Multiple Ozone Metrics Using Genetic Algorithm-Based Feature Selection for the Multi-Target Regression of the Environmental AQ-Bench Dataset N. Jailani & G. Mara
- Exploring the potential of machine learning for simulations of urban ozone variability N. Ojha et al.
- Augmenting the real-time rainfall forecast skills over odisha using deep learning technique O. Sharma et al.
- Explainable Machine Learning Reveals Capabilities, Redundancy, and Limitations of a Geospatial Air Quality Benchmark Dataset S. Stadtler et al.
- Improving rainfall forecast at the district scale over the eastern Indian region using deep neural network D. Trivedi et al.
- Integrating geospatial indicators and machine learning for ecosystem health assessment: a case study of Sylhet Sadar, Bangladesh S. Lubna & M. Kabir
- Importance of ozone precursors information in modelling urban surface ozone variability using machine learning algorithm V. Balamurugan et al.
- A multi-task learning model for global soil moisture prediction based on adaptive weight allocation Y. li et al.
- Global, high-resolution mapping of tropospheric ozone – explainable machine learning and impact of uncertainties C. Betancourt et al.
- Feature selection for global tropospheric ozone prediction based on the BO-XGBoost-RFE algorithm B. Zhang et al.
- Graph Machine Learning for Improved Imputation of Missing Tropospheric Ozone Data C. Betancourt et al.
- Interactions between atmospheric composition and climate change – progress in understanding and future opportunities from AerChemMIP, PDRMIP, and RFMIP S. Fiedler et al.
- Addressing the Coupled Optimization of Feature Selection and Hyperparameter Tuning Using a TPE-Driven XGBoost-RFE Framework N. Jailani & G. Mara
- Advancing air pollution forecasting: a review of physical, statistical, and machine learning methods A. Rawat et al.
- LandBench 1.0: A benchmark dataset and evaluation metrics for data-driven land surface variables prediction Q. Li et al.
19 citations as recorded by crossref.
- Challenges and Benchmark Datasets for Machine Learning in the Atmospheric Sciences: Definition, Status, and Outlook P. Dueben et al.
- Proper Weather Forecasting Internet of Things Sensor Framework with Machine Learning A. Turukmane & S. Pande
- Applications of Machine Learning and Artificial Intelligence in Tropospheric Ozone Research S. Hickman et al.
- Representing chemical history in ozone time-series predictions – a model experiment study building on the MLAir (v1.5) deep learning framework F. Kleinert et al.
- Enhancing the Prediction of Multiple Ozone Metrics Using Genetic Algorithm-Based Feature Selection for the Multi-Target Regression of the Environmental AQ-Bench Dataset N. Jailani & G. Mara
- Exploring the potential of machine learning for simulations of urban ozone variability N. Ojha et al.
- Augmenting the real-time rainfall forecast skills over odisha using deep learning technique O. Sharma et al.
- Explainable Machine Learning Reveals Capabilities, Redundancy, and Limitations of a Geospatial Air Quality Benchmark Dataset S. Stadtler et al.
- Improving rainfall forecast at the district scale over the eastern Indian region using deep neural network D. Trivedi et al.
- Integrating geospatial indicators and machine learning for ecosystem health assessment: a case study of Sylhet Sadar, Bangladesh S. Lubna & M. Kabir
- Importance of ozone precursors information in modelling urban surface ozone variability using machine learning algorithm V. Balamurugan et al.
- A multi-task learning model for global soil moisture prediction based on adaptive weight allocation Y. li et al.
- Global, high-resolution mapping of tropospheric ozone – explainable machine learning and impact of uncertainties C. Betancourt et al.
- Feature selection for global tropospheric ozone prediction based on the BO-XGBoost-RFE algorithm B. Zhang et al.
- Graph Machine Learning for Improved Imputation of Missing Tropospheric Ozone Data C. Betancourt et al.
- Interactions between atmospheric composition and climate change – progress in understanding and future opportunities from AerChemMIP, PDRMIP, and RFMIP S. Fiedler et al.
- Addressing the Coupled Optimization of Feature Selection and Hyperparameter Tuning Using a TPE-Driven XGBoost-RFE Framework N. Jailani & G. Mara
- Advancing air pollution forecasting: a review of physical, statistical, and machine learning methods A. Rawat et al.
- LandBench 1.0: A benchmark dataset and evaluation metrics for data-driven land surface variables prediction Q. Li et al.
Saved (final revised paper)
Latest update: 18 May 2026
Short summary
With the AQ-Bench dataset, we contribute to shared data usage and machine learning methods in the field of environmental science. The AQ-Bench dataset contains air quality data and metadata from more than 5500 air quality observation stations all over the world. The dataset offers a low-threshold entrance to machine learning on a real-world environmental dataset. AQ-Bench thus provides a blueprint for environmental benchmark datasets.
With the AQ-Bench dataset, we contribute to shared data usage and machine learning methods in...
Altmetrics
Final-revised paper
Preprint