Preprints
https://doi.org/10.5194/essd-2024-206
https://doi.org/10.5194/essd-2024-206
04 Jul 2024
 | 04 Jul 2024
Status: this preprint is currently under review for the journal ESSD.

A Sentinel-2 Machine Learning Dataset for Tree Species Classification in Germany

Maximilian Freudenberg, Sebastian Schnell, and Paul Magdon

Abstract. We present a machine learning dataset for tree species classification in Sentinel-2 satellite image time series of bottom of atmosphere reflectance. The dataset is based on the German national forest inventory of 2012, as well as analysis ready satellite imagery computed using the FORCE processing pipeline. From the national forest inventory data, we extracted the tree positions, filtered 387 775 trees in the upper canopy layer and automatically extracted the corresponding bottom of atmosphere reflectance time series from Sentinel-2 L2A images. These time series are labeled with the corresponding tree species, which allows pixel-wise classification tasks. Furthermore, we provide auxiliary information such as the approximate tree position, the year of possible disturbance events or the diameter at breast height. Temporally, the dataset spans the years from July 2015 to end of October 2022 with ca. 75.3 million data points for trees of 51 species and species groups, as well as 13.8 million observations for non-tree background. Spatially, it covers entire Germany. The dataset is available under following DOI (Freudenberg et al., 2024): https://doi.org/10.3220/DATA20240402122351-0

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.
Maximilian Freudenberg, Sebastian Schnell, and Paul Magdon

Status: open (until 21 Aug 2024)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Maximilian Freudenberg, Sebastian Schnell, and Paul Magdon

Data sets

Sentinel-2 machine learning dataset for tree species classification in Germany Maximilian Freudenberg, Sebastian Schenll, Paul Magdon https://doi.org/10.3220/DATA20240402122351-0

Maximilian Freudenberg, Sebastian Schnell, and Paul Magdon

Viewed

Total article views: 160 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
118 33 9 160 6 4
  • HTML: 118
  • PDF: 33
  • XML: 9
  • Total: 160
  • BibTeX: 6
  • EndNote: 4
Views and downloads (calculated since 04 Jul 2024)
Cumulative views and downloads (calculated since 04 Jul 2024)

Viewed (geographical distribution)

Total article views: 158 (including HTML, PDF, and XML) Thereof 158 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 15 Jul 2024
Download
Short summary
Classifying tree species in satellite images is an important task for environmental monitoring and forest management. Here we present a dataset containing Sentinel-2 satellite pixel time series of individual trees, intended for training machine learning models. The dataset was created by merging information from the German national forest inventory in 2012 with satellite data. It sparsely covers entire Germany for the years 2015 to 2022 and comprises 51 species and species groups.
Altmetrics