Articles | Volume 14, issue 7
Data description paper
05 Jul 2022
Data description paper |  | 05 Jul 2022

Individual tree point clouds and tree measurements from multi-platform laser scanning in German forests

Hannah Weiser, Jannika Schäfer, Lukas Winiwarter, Nina Krašovec, Fabian E. Fassnacht, and Bernhard Höfle

Laser scanning from different acquisition platforms enables the collection of 3D point clouds from different perspectives and with varying resolutions. These point clouds allow us to retrieve detailed information on the individual tree and forest structure. We conducted airborne laser scanning (ALS), uncrewed aerial vehicle (UAV)-borne laser scanning (ULS) and terrestrial laser scanning (TLS) in two German mixed forests with species typical of central Europe. We provide the spatially overlapping, georeferenced point clouds for 12 forest plots. As a result of individual tree extraction, we furthermore present a comprehensive database of tree point clouds and corresponding tree metrics. Tree metrics were derived from the point clouds and, for half of the plots, also measured in the field. Our dataset may be used for the creation of 3D tree models for radiative transfer modeling or lidar simulation studies or to fit allometric equations between point cloud metrics and forest inventory variables. It can further serve as a benchmark dataset for different algorithms and machine learning tasks, in particular automated individual tree segmentation, tree species classification or forest inventory metric prediction. The dataset and supplementary metadata are available for download, hosted by the PANGAEA data publisher at (Weiser et al.2022a).

1 Introduction

Airborne and terrestrial laser scanning enables the collection of detailed 3D information on forest structure. This information can be used to estimate important tree and forest stand attributes including tree and stand height (Pearse et al.2019; Wang et al.2019), wood volume (Dassot et al.2012; Disney et al.2018), biomass (Disney et al.2018; Næsset et al.2011), canopy cover (Arumäe and Lang2018; Smith et al.2009), tree density (Pearse et al.2019), diameter at breast height (DBH) (Bruggisser et al.2020; Liu et al.2018) and basal area (Pearse et al.2019). Laser scanning data from airplanes and helicopters have been widely applied in the forest sector and are operationally used for forest inventories in several countries (Montaghi et al.2013; Scion2021; White et al.2017) using the area-based approach (Bouvier et al.2015; Holopainen et al.2014; Næsset2002). In the area-based approach, wall-to-wall laser scanning data are combined with information collected in field plots to estimate forest attributes across the complete area covered by the laser scanning survey. Besides the area-based approach, individual tree-based approaches in which first single trees are detected from the point clouds and then attributes are estimated for the individual trees have also received attention (Latifi et al.2015; Maltamo et al.2004). Terrestrial laser scanning data with stationary or mobile sensors can be used to obtain detailed information on individual trees which typically cannot be directly derived from airborne surveys. Corresponding attributes include DBH and tree stem taper information but also information on small-scale understory variation which may be important for characterizing local fuel conditions or habitat properties. Finally, laser scanning with uncrewed aerial vehicles (UAVs) represents a compromise between airborne and terrestrial laser scanning. UAV-borne laser scanning (ULS) allows for data collection over larger areas compared to TLS but cannot reach the fine spatial resolution of TLS data. Due to the lower flying height, ULS achieves higher point densities and a better penetration of the canopy than ALS but covers smaller areas due to constraints in flying time and speed.

Over the last years, notable progress has been made in both sensor systems and workflows to derive forest attributes from laser scanning data collected by various platforms (Calders et al.2020; Maltamo et al.2014; Morsdorf et al.2017; Roussel et al.2020; Wallace et al.2014). At the same time, there is still space for improvements, for example, when it comes to (1) understanding and comparing the potential of tree delineation algorithms or (2) deriving certain attributes such as tree species from laser scanning point clouds. Furthermore, (3) there are still open questions related to the influence of acquisition settings on the model performances in the area-based approach. For issues (1) and (2), obtaining reliable reference data in the field is often extremely challenging and time-intensive, and preparing openly available high-quality benchmark datasets may contribute to making further advancement in the research field. For issue (3), cost is often a limiting factor since in many cases it is impossible to acquire numerous ALS flights in a research project to examine the influence of acquisition parameters due to the high costs of the flights. In this case, a combination of synthetic forest stands and laser scanning simulators may be one way forward by enabling extensive sensitivity analyses (Disney et al.2010; Roberts et al.2020).

One key element for creating synthetic forest stands, enabling laser scanning simulations, is detailed and realistic 3D tree models, which can be reconstructed from 3D point clouds obtained with terrestrial, UAV-borne or airborne laser scanning. Virtual forest scenes can be built from such 3D tree models and used as input to complex radiative transfer models (e.g., DART; Gastellu-Etchegorry et al.2015) or light detection and ranging (lidar) simulators (e.g., HELIOS++; Winiwarter et al.2022). Such simulations enable an improved understanding of the relation between forests’ structural elements and electromagnetic radiation (Disney et al.2010; Weiser et al.2021; Widlowski et al.2015). This is particularly relevant for improving remote-sensing-assisted forest inventory approaches. Three-dimensional tree models derived from laser scans of real trees have also been used to develop structural tree allometries to, e.g., non-destructively estimate the above-ground biomass of tree individuals (Calders et al.2015). Further, 3D tree models can be used to parameterize ecophysiological models (Sinoquet et al.2001) to quantify shading effects and evapotranspiration properties of urban trees. Other applications of detailed tree models include their integration in 3D visualizations as, e.g., used in urban landscape planning projects or in computer games (Bournez et al.2017).

(Semi-)automated methods to extract tree models from TLS point clouds are readily available (e.g., SimpleTree/SimpleForest, Hackenberg et al.2015; TreeQSM, Raumonen et al.2013), but data collection and extraction of high-quality tree point clouds is still very time-consuming because it requires manual interaction and control. This generally limits the acquisition and processing of very large numbers of trees in single projects and thus underlines the importance of publishing such datasets as open data for shared usage.

The dataset presented here is one of the biggest open-access collections of multi-platform tree point clouds with matching inventory data of common tree species of central European forests. We present 1491 trees which were identified in point clouds acquired in 12 forest plots in southwest Germany. A total of 249 trees were extracted from all three types of point clouds, ALS, ULS and TLS. Another 1031 trees were extracted from both ALS and ULS point clouds. A total of 1168 trees were extracted from two different ULS datasets, acquired under leaf-on conditions and leaf-off conditions (Fig. 8). The maximum time period between laser scanning acquisitions under leaf-on conditions is 70 d. Single tree metrics, including species, height, DBH and crown metrics, were derived from the point clouds and, for half of the plots (1060 trees), also measured in the field. Thus, one or more point clouds and corresponding metrics are provided for each tree. In addition to the 3D coordinates, each point of the point cloud has attributes including reflectance, echo number, waveform deviation and time of recording. The dataset further exists of the larger area forest point clouds of the acquisitions and the flight trajectory or scan position information.

Figure 1 displays the workflow of the data acquisition and processing steps.

Figure 1Workflow for the generation of the dataset, starting with the laser scanning data acquisition and forest inventory measurements and resulting in single-tree point clouds and corresponding point-cloud-derived and field-measured metrics. This figure has been designed using resources from (last access: 3 August 2021; Smashicons, Freepik).

2 Methods

In this section, we introduce the study site, give an overview of the measurements taken in the field and document the lidar acquisitions from the three different platforms (airborne, UAV-borne and terrestrial) which were conducted in 2019 and 2020 under leaf-on and leaf-off canopy conditions. We describe how we extracted single-tree point clouds from the lidar datasets to (a) match them to the field measurements and (b) derive individual tree metrics directly from the point clouds.

2.1 Study site

The two study sites, the Bretten municipal forest (490036′′ N, 84135′′ E) and the Hardtwald forest in Karlsruhe-Waldstadt (490204′′ N, 82540′′ E), are located in a region of temperate forests in the federal state of Baden-Württemberg, Germany. Data acquisitions cover three forest plots in the Karlsruhe forest and nine in the Bretten municipal forest, totaling 12 forest plots (Fig. 2). The Bretten site is characterized by the hilly landscape of the Kraichgau, and the Karlsruhe site is on flat terrain of the Upper Rhine Plain. The main tree species in the managed forest stands are Scots pine (Pinus sylvestris L.), European beech (Fagus sylvatica L.), common oak (Quercus robur L.), sessile oak (Quercus petraea (Matt.) Liebl.), red oak (Quercus rubra L.), European hornbeam (Carpinus betulus L.), Douglas fir (Pseudotsuga menziesii (Mirb.) Franco) and Norway spruce (Picea abies (L.) H. Karst). Most stands have multiple layers, a mixed species composition and a dense forest cover. Karlsruhe plots are dominated by Scots pine, red oak and beech. Bretten plots are more diverse and characterized by beech, spruce, Douglas fir, oaks (mostly sessile oak) and European hornbeam. We provide a summary of forest plot characteristics in Table  in the Appendix, including the number of trees, basal area (m2 ha−1), the percentage of dead trees and the median tree height (m), also separated by tree species.

Figure 2Map of the forest plots in Karlsruhe (left) and Bretten (right) in the federal state of Baden-Württemberg, Germany. The extent of the extracted ALS data is outlined in black, and areas for which field measurements were conducted are outlined in purple (“Full inventory”). All plots except BR04 are covered by the ULS acquisitions. TLS was performed in selected locations within the plots. Coordinates are given in the projected coordinate system ETRS89/UTM zone 32N (EPSG:25832). Map data: © OpenStreetMap contributors 2021. Distributed under the Open Data Commons Open Database License (ODbL) v1.0. Administrative boundaries: © EuroGeographics (2021).

2.2 Laser scanning data acquisition and co-registration of the multi-modal datasets

2.2.1 Airborne laser scanning (ALS)

MILAN Geoservice GmbH was commissioned by the Institute of Geography and Geoecology (IFGG) of Karlsruhe Institute of Technology (KIT) to carry out the ALS acquisitions. The flights were conducted on 5 July 2019 under leaf-on conditions with a RIEGL VQ-780i (RIEGL Laser Measurement Systems2019) sensor mounted on an aircraft of type Cessna C207. Data were acquired in 38 parallel flight strips, plus two orthogonal strips, in Karlsruhe, covering an area of around 65 km2. In Bretten, data were acquired in 25 parallel flight strips, plus two orthogonal strips, covering an area of around 32 km2. Detailed sensor and acquisition parameters are listed in Table 1. The data were georeferenced and transformed into the coordinate reference system ETRS89/UTM zone 32 (EPSG:25832) with ellipsoidal height (GRS80). Full-waveform data are available from the authors on reasonable request.

Table 1Airborne laser scanning (ALS), UAV-borne laser scanning (ULS) and terrestrial laser scanning (TLS) sensor and acquisition parameters. Sensor specifications are according to the data sheets (RIEGL Laser Measurement Systems2017, 2019, 2020c).

Download Print Version | Download XLSX

For quality control, seven small built-up areas surrounding the area of interest were recorded by the 3DGeo Research Group of Heidelberg University using ULS, for which georeferencing was carried out as described in the next section.

Flight strips were merged and the point clouds were clipped to the 12 forest plots, using a generously sized buffer to cover the other datasets (ULS, TLS and field measurements). This resulted in extracted ALS point clouds of 300 m× 300 m, 200 m× 200 m or 140 m× 140 m extent (Fig. 2), depending on the spatial coverage of the TLS and ULS datasets. Pulse density grids with a cell size of 1 m were computed for each point cloud. Pulse density, averaged over all 12 plot point clouds, was 72.5 pts m−2 (standard deviation: 31.0 pts m−2).

2.2.2 UAV-borne laser scanning (ULS)

UAV-borne laser scanning point clouds were acquired with a RIEGL miniVUX-1UAV (RIEGL Laser Measurement Systems2020c) sensor mounted on a DJI Matrice 600 Pro hexacopter (DJI2018). Flights were conducted in August and September 2019, in December 2019, and in March and April 2020 to obtain data of trees under both leaf-on and leaf-off conditions. Table 1 shows the general settings used in all 21 flights. For plots KA10 and KA11 in Karlsruhe, only leaf-on data were acquired in summer. Two overlapping double grid patterns, offset by 45 in orientation, were employed for all flights (Fig. 3). In the 2019 flights, adaptive banked turns were flown, while in the 2020 flights, the UAV was operated in stop and turn mode, i.e., stopped in each corner to turn without inclining about the roll axis. Flying altitude, speed, scan frequency, and flight line distance varied across the different flight campaigns and are listed in Table 2. Note that for the 2019 flights of plots BR03 and BR08, the two parts of the double cross were flown at different speeds.

Figure 3UAV-borne laser scanning (ULS) and airborne laser scanning (ALS) flight trajectories of plot KA10. Background: orthophoto of the forest plot, acquired on 6 September 2019.


Table 2UAV-borne laser scanning acquisition parameters per flight and resulting approximate pulse densities, quantified on extracted point clouds of 1 ha, thereby excluding points on the flight line edges. For altitude above ground, a mean value is given, with the standard deviation in parentheses.

Download Print Version | Download XLSX

ULS data were processed using the software RiPROCESS (RIEGL Laser Measurement Systems2020a, version 1.4.2). GNSS reference measurements were retrieved as a virtual base station from SAPOS Baden-Württemberg (, last access: 6 May 2020) and were used to post-process the trajectory using Applanix PosPac MMS (Applanix Corporation2018, version 8.3). The trajectory and the point cloud data from the scanner were imported to RiPROCESS. At import, the point cloud data were filtered for reflectance between −22 and +10dB, scan angle ± 90 off nadir, and ranges between 5 and 150 m to filter outliers from the data. The point cloud data were converted from the scanner (SDCImport) and then transformed from local polar to world coordinates (ETRS89/UTM Zone 32N) using the flight trajectory (RiWORLD). For the subsequent strip adjustment, only points with a scan angle ± 45 off nadir were considered because larger off-nadir angles constitute almost only within-canopy returns, which we presumed to be less accurate and/or less suitable for strip adjustment. Finally, each flight strip (with the full scan angle range of ± 90) was exported in LAZ format including intensity information (signal amplitude and reflectance) and waveform information (pulse shape deviation). Using LAStools (Rapidlasso GmbH2020, version 200509), the single flight strips were merged, and the information on the flight lines was encoded in the LAS field “Point Source ID”.

Pulse density grids were computed for extracted 1 ha point clouds of the 12 plots (i.e., excluding the edges of the flight lines) with a cell size of 1m. The mean pulse densities derived for these grids are shown in Table 2.

2.2.3 Terrestrial laser scanning (TLS)

TLS acquisitions were performed at selected spots within the 12 forest plots. Our focus was on the most common species in our study site. In total, we conducted 31 TLS surveys on 20 d between June and September 2019 under leaf-on conditions. In each survey, a group of selected trees was scanned from five to eight scan positions (SPs), regularly distributed around the target trees. Depending on the size of the scan area, the terrain and the forest density, the number and the exact positions of the SPs were adjusted. The scans were acquired with a RIEGL VZ-400 terrestrial laser scanner, which has a wavelength of 1550 nm and a beam divergence of 0.3 mrad (full angle, measured at the 1/e2 points; RIEGL Laser Measurement Systems2017, Table 1). Scans were performed with a pulse repetition frequency (PRF) of 300 kHz and vertical and horizontal angle increments of 0.017 (0.3 mrad). This results in 3 mm point spacing at 10m range for a single scan and ensures non-overlapping footprints. Only in three acquisitions on 3 and 4 June 2019 was the vertical resolution erroneously set to 0.099 (1.73 mrad). Each scan had a vertical field of view of 100 (−40 to +60 from the horizontal plane) and a horizontal field of view of 60 to 100 facing the target trees. If the distance between scanner and target trees was low in relation to the tree height, we performed an additional tilted scan at respective SPs to increase the vertical extent of the scan. Five cylindrical reflective targets mounted on tripods were positioned in the area to be scanned. They serve as tie points in the tie-point-based co-registration of multiple scans. Further circular reflectors were pinned onto trees of interest to later facilitate their identification in the point cloud.

For each campaign, RTK GNSS measurements from one scan position, referred to as the main stable position, and one tie point were used for initial georeferencing of the respective scan. Remaining scans were registered to the main stable position in the software RiSCAN PRO (RIEGL Laser Measurement Systems2020b, versions 2.8.0, 2.8.2, 2.9.0) using the cylindrical and circular reflective targets in the scene. After the tie-point-based registration, a fine alignment was performed using the Multi Station Adjustment (MSA) function of RiSCAN PRO, which uses a variant of the iterative closest point (ICP) algorithm based on planar areas in the point cloud. Because of the weak GNSS signal below the leaf canopy, the positional measurements taken in the forest only allowed for coarse georeferencing of the TLS point clouds. Therefore, TLS point clouds were registered to the ULS point clouds by tree matching and subsequent ICP-alignment. Because stems are sampled better in the leaf-off point clouds due to the higher penetrability of the defoliated crowns, we used the ULS leaf-off point clouds for stem detection where available. TLS point clouds were downsampled for the stem detection using the RiSCAN PRO octree filter with a voxel size of 0.05 m. We calculated normal vectors, linearity and 3D point densities in corresponding TLS and ULS point clouds using OPALS (Pfeifer et al.2014, version 2.3.1). By filtering for these three attributes, stem points were extracted and then segmented. To derive the final stem positions, each segment was projected to the horizontal plane and the center of gravity was computed. For each pair of overlapping point clouds, TLS-derived tree positions were manually matched to ULS-derived tree positions, and the resulting rigid transformation was applied to the TLS point cloud. The height offset was estimated by selecting corresponding points in a vertical profile in both point clouds. After correcting the height offset, a fine alignment between TLS and ULS point clouds was performed with the ICP implementation of OPALS. A visual assessment of the alignment between ALS, ULS and TLS is presented in Fig. 4.

Figure 4Point cloud visualizations showing the different point density and coverage for each platform. (a) Section of point clouds of a beech tree in plot BR05. The section has a depth of 1m. (b) Point clouds of oaks in plot BR01. The trees are 16 to 18m tall. Point clouds are labeled by their tree ID in the dataset.


2.3 Tree point cloud extraction

We used different procedures to extract single-tree point clouds from the laser scanning data (Fig. 5). The full TLS point clouds were automatically segmented into individual tree point clouds (Sect. 2.3.1). For the trees of interest in the center of the acquired scene, the segmentation was improved by manual editing. ULS and ALS tree point clouds were delineated manually in the point clouds. Because our multi-platform laser scanning point clouds are georeferenced and spatially overlapping, it was possible to extract trees from one point cloud using tree point clouds already extracted from another platform or acquisition. This way, we only had to delineate each tree in one of the overlapping point clouds. Table 5 and Fig. 8 give an overview of the number of tree point clouds that are available for each data source, i.e., for each platform and canopy condition.

Figure 5The different ways of single tree extraction. TLS point clouds were extracted with a combination of automatic segmentation and manual correction. ULS and ALS point clouds were partly segmented manually and partly extracted using a higher-resolution point cloud from another platform (i.e., TLS and ULS).


2.3.1 Automatic tree segmentation from TLS point clouds and manual post-processing

A total of 264 trees were extracted from the TLS point clouds. We first automatically segmented each TLS plot and then refined the segmentation result for our selected trees by manually editing the point clouds.

Automatic segmentation was performed with Computree software (Computree Group2022, version 5.0.054b). The workflow consists of an Euclidean clustering approach combined with a competitive Dijkstra algorithm and relies largely on the SimpleTree plugin (Hackenberg2017, Beta version 4.33.06). After importing the point cloud of the forest scene, a digital terrain model (DTM) is computed. Next, a slice of the point cloud is extracted between 1.0 and 1.6 m above the DTM. The sliced point cloud is spatially clustered to derive one seed cluster per tree stem. Each point in a stem seed is assigned a cluster ID and an initial distance of 0. A competitive Dijkstra algorithm is run, where each vegetation point is connected to one of the clusters and receives the according cluster ID. A graph is built from the connected points. If there are multiple possible paths between a seed point and a target point, the shortest path is chosen.

The result of this automatic approach is error-prone, i.e., Dijkstra's algorithm cannot guarantee the assignment of each point to the correct tree cluster. This is due to overlapping tree crowns and occlusions in the crown. Furthermore, errors in the clustering of stems seeds can lead to false labeling of points.

We applied a manual correction procedure. Selected tree point clouds were compared to surrounding segmented trees to identify erroneous or missing points and correct them manually. This was done in two stages by independent editors using the interactive segmentation tool of CloudCompare (CloudCompare2019, version 2.10.2). Human experts can take into account the morphology of the crowns in ways which are barely feasible automatically. A combined approach of automatic segmentation and manual correction is therefore a good solution to precisely delineate trees.

2.3.2 Manual tree extraction from ALS and ULS point clouds

From ALS and ULS point clouds, we delineated trees manually in the point cloud using the interactive segmentation tool of CloudCompare, or we used already manually delineated trees as templates for automatic extraction (Sect. 2.3.3).

2.3.3 Automatic tree extraction from ALS and ULS point clouds using extracted template tree point clouds

Because our laser scanning point clouds are spatially overlapping and co-registered, we were able to use tree point clouds already extracted from one dataset to query points from another dataset. For each point in the source tree point cloud, we queried all nearest neighbors within a given search radius in the larger area target point cloud to extract the same tree in the target point cloud. This was done with a k-nearest-neighbor (kNN) search using a k-d tree. We ensured that the source point clouds were of equal resolution or higher resolution than the target point clouds to avoid extrapolation. For instance, we extracted ULS point clouds using TLS source point clouds but not using ALS source point clouds because this could lead to missing points if, e.g., the stem was not completely captured in the ALS point cloud. When using TLS point clouds as source point clouds, we used search radii of 0.3, 0.5 and 1.0 m in the nearest-neighbor search. We then visually examined the extracted ALS and ULS point clouds and chose the search radius achieving the best trade-off between errors of omission and errors of commission. For the extraction using ULS template point clouds, we always used a search radius of 1.0 m.

2.3.4 Tree positions

We define the tree position as the intersection of the tree stem center line with the terrain. To derive the tree positions, we first computed digital terrain models (DTMs) from the ALS point clouds. We then cut a slice through the bottom of the tree point cloud, preferably the TLS or ULS point clouds, if available. We determined the xy position of the stem as the mean of the x and y coordinates of all points in the slice. At this position, we queried the elevation of the DTM to retrieve the matching z value. For quality control, the derived positions were visually investigated. In some cases, they had to be corrected either because of DTM errors or because the point cloud did not have enough stem points. If not enough stem points were present to determine the tree stem center at ground level, the xy positions of the lowest visible stem point were used.

2.4 Tree measurements

Tree measurements were recorded in the field and/or derived from the point clouds. Table 3 gives an overview of the metrics and the data source they were derived from.

Table 3Tree measurements and the source(s) they were recorded or computed from. Tree height was only measured from TLS point clouds for understory trees, for which tree height measurements from ALS or ULS would not be reliable. Tree height was only recorded in FM for smaller trees, where the top of the crown was visible from the ground (Sect. 2.4.1). ALS is airborne laser scanning, ULS is UAV-borne laser scanning, TLS is terrestrial laser scanning and FM is field measurements. All height measurements refer to height above the ground level at the stem position. For each tree, there can be multiple values for one tree metric if they were derived from different sources.

Download Print Version | Download XLSX

2.4.1 Field measurements

Field measurements were conducted in six plots of 1 ha (Fig. 2): BR01, BR03 and BR05 in Bretten and KA09, KA10 and KA11 in Karlsruhe. For each tree, the diameter at breast height (DBH) was measured with a measuring tape at 1.3 m above ground. To determine the crown diameter (CD), the diameter of the estimated dripline (i.e., the line right under the outer circumference of the crown) was measured in two orthogonal directions using a Haglöf Vertex-IV hypsometer. Mean CD was then calculated as the mean of these two distance measures. We define crown base height (CBH) as the height of the lowest branch with a minimum length of 1 m and crown base height green (CBH green) as the height of the lowest green (living) branch with a minimum length of 1 m. We selected the branches fulfilling these conditions by visual estimation and then measured their heights above ground using the hypsometer. Since measuring tree height with the hypsometer is difficult in dense forest stands where high tree tops are barely visible, tree height was only measured for about 10 % of the trees. These were mostly understory trees, where the crown was in direct view. As additional attributes, tree species and state (alive or dead) were recorded. For trees outside of the six field measurement plots, only species and state were determined. Field measurements and single-tree point clouds were matched based on extracted tree positions, the distinct crown shape and branch architecture, and the local neighborhood of the trees. For this, additional field surveys were conducted, in which 3D views of the laser scanning point clouds were used to identify the corresponding trees in the field.

2.4.2 Tree metrics derived from point clouds

In addition to field measurements, tree metrics were estimated from the laser scanning point clouds. For the computation of the metrics, the height values of each tree point cloud were height-normalized by subtracting the z coordinate of the tree position (representing the terrain height at the location of the stem) from all z coordinates in the point cloud.

DBH was estimated only from TLS tree point clouds. Due to the terrestrial view point and close range, many returns are recorded from the stems, allowing for accurate DBH estimation. A slice of the tree point cloud was extracted at a height between 1.28 and 1.32 m above ground. In case the stem was partly occluded, a thicker slice was extracted or the vertical position of the slice was adapted to achieve better results. We projected the slice onto the xy plane and fit a circle to the points using the random sample consensus (RANSAC) algorithm to derive the diameter using 1000 random samples. For leaning trees, errors may arise from the projection of points to the xy plane instead of a plane perpendicular to the stem. However, since most of our TLS trees were upright, this error is negligible. Some tree stems have an elliptical rather than a circular stem shape. In these cases (44 of 264 TLS point clouds), an ellipse was fitted to the projected points of the stem slice using least squares (as implemented by Nicky van Foreest:, last access: 12 July 2020). The DBH was then determined as the arithmetic mean of the major and the minor axes of the ellipse. For all trees, the quality of the fit was controlled visually. All figures of the circle/ellipse fitting are included in the Supplement together with a list of trees for which (a) a different slice height was used, (b) the visual assessment suggests low accuracy and (c) TLS-derived DBH values deviate more than 10 % from the values measured in the field.

Tree height was derived as the z coordinate of the highest tree point, normalized by the elevation of the tree position, i.e., the base of the tree.

CBH was determined by dividing the point cloud into height sections of 0.1 m and calculating the maximum xy distance between any two points in the section. Starting from the bottom, we iterate over the height sections. Once the maximum horizontal point distance exceeded a threshold of DBH + 1 m (if no DBH measurement was available, a default of 0.5 m was used), the center of the height section is defined as the CBH (Fig. 6) and the iteration is terminated. Because of the fixed section height, CBH estimates have decimeter resolution.

The tree crown is defined by all points above the CBH. Crown projection area (CPA) was calculated in two different ways:

  • a.

    as the area of the 2D (xy) convex hull of all crown points using the Quickhull algorithm (Barber et al.1996)

  • b.

    as the area of the 2D (xy) concave hull of all crown points using the k-nearest-neighbor approach by Moreira and Yasmina Santos (2007) as implemented by Craig (2017).

Lastly, mean CD was computed as the mean of the largest diameter and the largest perpendicular diameter of the concave hull (Fig. 6).

Figure 6Overview of single tree measurements (a) recorded in the field and (b) derived from the point clouds. Numbers correspond to Table 3. Icon on the left (“man looking up”) by DonBLC, (last access: 3 August 2021). Picture of diameter tape from (last access: 3 August 2021).

3 Results

The data records are split into two parts:

  • a.

    the full TLS and ULS point clouds and extracted ALS subsets covering the ULS, TLS and field measurement plots (see Fig. 2)

  • b.

    the extracted single-tree point clouds and respective measured and point-cloud-derived tree metrics.

The folder structure is visualized in Fig. 7.

Figure 7Folder structure for a single forest plot. PlotID is the ID of the forest plot, e.g., BR01; SpeciesID is the ID of the tree species, e.g., AcePse; see Table 4; TreeID is the unique (in combination with PlotID and SpeciesID) identifier for the tree; YYYY-MM-DD is the day of acquisition; qq is the quality indicator, e.g., q2; on/off is the canopy condition; and SSSm is the side length of the ALS forest plot in meters, e.g., 300 m. The ALS folder contains only one point cloud per plot. Multiple point clouds may be within the ULS folder due to different acquisition times and in the TLS folder due to different acquisition times and locations (in case of several acquisitions in the same plot on the same day, a suffix is added to distinguish the acquisitions, i.e., “_A”, “_B”, etc.). The single_trees folder always contains several subfolders, one for each tree.


3.1 LAS point format

All point clouds are stored in ASPRS LAS 1.4 (ASPRS2013) or ASPRS LAS 1.2 format (ASPRS2008), compressed (lossless) to LAZ. The field “Point source ID” holds the IDs of the flight lines or scan positions that the points originated from. All LAZ files contain the extra bytes amplitude [dB], reflectance [dB] and (pulse shape) deviation [ ] as exported by RIEGL systems. The amplitude is calibrated as the ratio of the optical power of the reflected light to the detection threshold of the instrument and depends on the distance of the target to the scanner (Pfennigbauer and Ullrich2010). The reflectance, however, is distance-independent and determined by the properties of the target that was hit by the ray. Reflectance is given as the ratio of the amplitude of the hit target to the amplitude of the target the instrument was calibrated with, i.e., a diffuse white target (Pfennigbauer and Ullrich2010). The pulse shape deviation indicates the degree to which the shape of the received pulse deviates from a device-specific reference pulse shape.

3.2 Full forest plot or acquisition records

In order to also capture the neighborhood context of the single tree, understory and terrain information, we provide the larger-area georeferenced point clouds of each platform. These include 31 TLS point clouds stored in LAZ files. For each acquisition, we provide a tab-separated .txt file containing the coordinates of the scan positions. A total of 20 LAZ files are available containing the ULS point clouds acquired under leaf-on and leaf-off conditions as summarized in Table 2. Note that the ULS leaf-on data for plots BR03 and BR08 are included in one single LAZ file, and no ULS data are available for plot BR04. For each acquisition, the trajectory is provided as a tab-separated .txt file, containing continuous records of the location and orientation of the sensor in the air. The “Time [s]” in the trajectory files corresponds with the “GPSTime” in the respective LAZ point cloud files and represents seconds since beginning of the respective GPS week (ASPRS2013). Finally, we provide 12 LAZ files with extracted ALS point clouds, one for each plot (Fig. 2). ALS point clouds of plots BR01, BR03, BR05, KA09, KA10 and KA11 are 300 m× 300 m in size and encompass the 100 m× 100 m plots in which field measurements were recorded. ALS point clouds of plots BR02 and BR06 are 200 m× 200 m and ALS point clouds of BR04, BR07, BR08 and SP02 are 140 m× 140 m in size. For these six plots (BR02, BR04, BR06, BR07, BR08 and SP92), no field measurements are available. We provide the trajectory points for the respective parts of the flight strips covering the area in tab-separated .txt files. All ALS point clouds spatially overlap with the TLS and ULS acquisitions conducted in the respective plots.

All point clouds and the coordinates of the scan positions and trajectories are in the same coordinate reference system: ETRS89/UTM zone 32N; EPSG:25832; ellipsoidal height, GRS80.

3.3 Single tree point clouds and metrics

Point clouds and point-cloud-derived metrics are provided for 1491 individual trees of in total 22 different species. An overview of tree species and the respective number of trees is given in Table 4. Field-measured metrics are available for 1060 of the trees. Tree point clouds acquired from all three platforms, i.e., ALS, TLS and ULS (leaf-on), are provided for 249 trees. A total of 1168 tree point clouds were extracted both from ULS leaf-on and ULS leaf-off data. A summary of all available data types per tree is given in Table 5 and Fig. 8. Figure 4 shows the different characteristics of the point clouds acquired from the different platforms and in the different canopy conditions. When comparing the TLS leaf-on point clouds with the ULS leaf-off point clouds, it is clearly visible that the branches hang down more in the summer when they are full of leaves than in the fall or winter when they are free of foliage.

Table 4Number of trees of each species in the dataset.

Download Print Version | Download XLSX

Table 5Number of trees per plot and data source. Note that most trees were measured from different platforms and at different times. ULS leaf-off data are available for 1173 trees. Of these, 133 trees were measured in autumn 2019, 537 trees were measured in spring 2020 and 503 trees were measured both in autumn 2019 and spring 2020.

Download Print Version | Download XLSX

Figure 8Venn diagrams showing the number of trees covered by the different acquisitions.


There are multiple reasons for the different available data types per tree:

  • TLS acquisitions did not cover the entire ALS/ULS acquisitions but only selected locations, so only 246 TLS point clouds of trees are available.

  • No ULS acquisition was performed in plot BR04.

  • ALS point clouds of understory trees with very few points were discarded. This concerned only silver firs in plot SP02 (AbiAlb_SP02_01 to AbiAlb_SP02_05), which had between 11 and 247 ALS points.

  • Some trees were only extracted manually from ALS point clouds, and these were not used for automatic extraction due to the lower resolution compared to ULS and TLS point clouds, which would cause information loss (e.g., missing parts of the stem).

Tree properties are provided in two tab-separated .txt files:

  • General tree characteristics such as the species and the tree position in geographic coordinates and in UTM coordinates are listed in a tab-separated .txt file with suffix “_general”.

  • Tree metrics are provided in a tab-separated .txt file with suffix “_metrics”. There is one row for each measurement source (e.g., field inventory measurement (FI) and ALS) and one column for each tree metric.

We furthermore provide all tree metadata in GeoJSON files, including geometry (position), properties (field measurements and point cloud metrics) and data (point cloud data filenames and metadata). These files can be loaded into common GIS software to visualize the tree positions and access the data.

4 Quality assessment

In the following, we describe how we assessed different aspects of data quality for our dataset. Table 6 summarizes all quality indicators related to point cloud alignment and georeferencing.

4.1 Airborne and UAV-borne laser scanning (ALS and ULS)

4.1.1 Positional accuracy

Positional accuracy of the ULS data was determined using roof-shaped wooden targets of 1 m length and 0.6 m width (for one side of the roof). The two planes of the roofs were recorded with four points each using a Leica Viva GS10/GS15 RTK GNSS unit tied into the SAPOS correction network (, last access: 24 June 2022). A visual inspection of the wooden target point clouds and the GNSS reference measurements showed good alignment, with differences in the range, that was expected due to the limited scanner and RTK accuracies (Fig. 9).

Figure 9Roof-shaped wooden targets used for assessing the positional accuracy of the ULS point cloud. (a) Setup in the field. The two planes of each roof are 1 m by 0.6 m in size. The red squares in both images mark the points where GNSS measurements are taken. (b) View in a point cloud viewer. The ULS point cloud is colored by z coordinate, and the GNSS measurements are shown as large red squares. The georeferencing quality is considered good if the ULS points on the roofs agree closely with the GNSS reference measurements, as shown in this example.

Absolute altitude accuracy of ALS data was determined by MILAN Geoservices GmbH using reference tiles recorded with ULS by the 3DGeo Research Group of Heidelberg University. The height of selected reference points was compared to the median of ALS points within a radius of 0.2 m (Bretten) and 0.3 m (Karlsruhe). The mean altitude differences (standard deviations) were −0.003m (±0.023m), 0.006 m (±0.016m), −0.016m (±0.026m), −0.028m and (±0.029m) for the four tiles in Bretten and −0.016m (±0.017m) for the tile in Karlsruhe. The quality of these reference tiles was, in turn, ensured by the same method as for the ULS data in general; see above.

Positional (horizontal) accuracy of the ALS data was controlled by MILAN Geoservice GmbH using building edges (i.e., stable surfaces). A building layout derived from TLS data, which was acquired by the 3DGeo Research Group and georeferenced with RTK GNSS, was overlaid with the building layout derived by digitizing the classified ALS point cloud to investigate the positional accuracy.

A further independent assessment of positional and altitude accuracy was conducted by the 3DGeo Research Group Heidelberg by comparing points sampled by ULS and ALS on roofs and other stable surfaces in control areas. Figures of the visual assessment are included in Appendix B in the metadata document published along with the dataset on the data repository PANGAEA (, Weiser et al.2022b). To obtain quantitative measures, we computed cloud-to-cloud distances between these ALS and ULS point clouds of the stable areas (six in Karlsruhe and three in Bretten). Resulting mean distances were between 0.03 and 0.06 m, with standard deviations between 0.01 and 0.023 m. This confirmed the high accuracy of the alignment of the datasets.

4.1.2 Strip alignment

The internal alignment of ALS and ULS flight strips was investigated by the 3DGeo Research Group by quantifying the height differences between the flight strips with OPALS (version 2.3.2). For ALS, strip differences were quantified for the extracted plot point clouds (with the extents as shown in Fig. 2). The ULS point clouds were cropped to the 1 ha plots for the assessment of the strip alignment to exclude points at the flight line edges.

For both datasets, single strip point clouds were first filtered for last returns. Second, grids of the standard deviation of the interpolated height (σZ) and of the distances between grid points and the center of gravity of data points (eccentricity) were computed to derive a mask of areas suited for the quality check. For the quantification of strip differences, areas with σZ 0.1 m and eccentricity  0.08 m were masked. This concerns most areas with high vegetation, where strip differences cannot be quantified reliably.

For the ALS point clouds, median absolute strip differences were below 5 mm for 10 of 12 plots. The maximum median value was 11 mm. σ(MAD) was between 17 and 51 mm1. For ULS leaf-on point clouds, median absolute differences were up to 14 mm (mean: up to 55 mm). For ULS leaf-off point clouds, median absolute strip differences were only up to 2 mm (mean: below 20 mm for 10 acquisitions, 51 and 62 mm for the remaining two acquisitions), suggesting that the large values in the leaf-on differences are a remaining influence from vegetation. Histograms and maps of strip differences for ALS and ULS data are included in Appendix A in the metadata document published along with the dataset on the data repository PANGAEA (Weiser et al.2022b).

4.2 Terrestrial laser scanning (TLS)

4.2.1 Positional accuracy and alignment to ULS point clouds

TLS point clouds were georeferenced by coarsely registering them to the ULS data via tree stem matching, followed by a fine alignment with the ICP algorithm. Before running the ICP, we created downsampled TLS point clouds with the octree filter in RiSCAN PRO with voxel sizes of 0.05 m. For each voxel of the octree, a new point is created by taking the center of gravity for all the points within each voxel. We performed an ICP adjustment with five iterations, keeping the ULS point cloud fixed while transforming the TLS point cloud. With these settings, we achieved mean point-to-plane distances of less than 2 mm, with standard deviations between 120 and 140 mm.

For 13 of the 31 point clouds, we still observed significant shifts in a visual assessment and repeated the ICP with revised settings: we used the ULS point cloud as moving, preferably the one acquired under leaf-off canopy condition (if available) due to the better representation of the stems, and the TLS point clouds as fixed point clouds. This means that the subset of points for which correspondences are established and to which planes are fitted to subsequently minimize point-to-plane distances are selected from the TLS point cloud. This is likely to achieve better results because the TLS point cloud has higher point density (allowing for better plane fit) and higher accuracy. We used the full-resolution TLS point clouds and in some cases filtered for points below 15 or 10 m to exclude crown points. For the tree crowns, the coverage of TLS and ULS point clouds is very different, and there are many moving objects (branches and leaves). We increased the number of iterations to up to 10. We then applied the inverse output transformation matrix to the TLS point cloud to ensure that the final georeferencing is derived from the ULS data. This resulted in mean point-to-plane distances of below 4 mm, with standard deviations of 50–100 mm. More importantly, the visual alignment improved considerably.

Because of the lack of bigger planar areas in the scene, the influence of moving branches and leaves, the different viewing geometries and resulting occlusions of the ULS and TLS point clouds, and the sensor accuracy limits, it is expected that no sub-millimeter point-to-plane distances were achieved, and the standard deviation is comparably high. The final alignment is sufficient for our main intention, i.e., the extraction of trees from ULS and ALS point clouds using template TLS point clouds. A visual impression of the quality of the alignment between the datasets acquired from the different platforms is given in Fig. 4. The visual assessment and the low mean distances suggest no remaining systematic shifts.

4.2.2 Internal co-registration of single scan positions

Co-registration quality of scan positions relative to the main stable position for each plot was assessed in two stages. In the first stage we investigated differences computed in the least-squares algorithm of the Multi Station Adjustment performed in RiSCAN PRO. The overall average of the standard deviation of registration error (RMSE) was 5 mm, with a maximum of 8 mm. In the second stage, a visual assessment was conducted by projecting a narrow section of the point cloud onto a plane and coloring the points by their scan position. We used stem slices for assessing horizontal registration errors and cylindrical reflective targets for assessing vertical registration errors. Horizontal stem slices were manually extracted for selected trees with an approximate section width of 0.2 m. To minimize the influence of wind effects, stem slices were cut out from low heights above ground where tree stems are very stable. This was especially relevant for younger trees, where tree stems are smaller in diameter and therefore more prone to movement. No significant vertical registration errors were present in any of the plots. In most of the acquisitions (26 of 31), no or very small (<10mm) horizontal registration errors were observed. In five acquisitions, the maximum horizontal registration error was estimated to be between 10 and 20 mm. Appendix C in the metadata document published along with the data (Weiser et al.2022b) shows the assessment of horizontal and vertical TLS registration errors.

Weiser et al.2022bWeiser et al. (2022b)Weiser et al. (2022b)Weiser et al.2022b

Table 6Summary of the different indicators for georeferencing and alignment quality. SD is the standard deviation.

Download Print Version | Download XLSX

4.3 Quality of single-tree point clouds

Tree point clouds were assigned a quality tag from highest (q1) to lowest (q6). Reasons for lower grades include

  • segmentation errors, e.g., additional branches in the point cloud;

  • extraction errors, e.g., additional points from neighboring trees in the point cloud because of the fixed search radius used for the automatic kNN-based extraction;

  • missing parts/occlusion of tree parts;

  • wind effects (in case of TLS);

  • scan position or flight strip alignment errors.

The quality tags were subjectively assigned by different operators. For most trees that were extracted automatically using the kNN algorithm, the quality was set to one lower than the source point cloud. The quality tag therefore provides a rough idea of the point cloud quality but is not guaranteed to be comparable across the entire dataset.

4.4 Quality of the tree metrics

The quality of the tree metrics computed from the tree point clouds depends on the accuracy of the tree point cloud segmentation and the point density of the tree point cloud. Error of commission in the segmentation may cause higher tree height, mean CD and CPA values, and lower CBH values, while error of omission or a low point density may cause the opposite. In particular, ALS tree point clouds of understory trees have a low point density, and thus the computed metrics might differ from the field-measured metrics. As field measurements were conducted from the ground, field-measured height, mean CD and CBH of high trees have higher uncertainty than those of smaller trees. It is difficult to consider either the field measurements or the point-cloud-derived metrics as true reference. By comparing field-measured and point-cloud-derived metrics, we can nevertheless get an idea of the error ranges and potential biases.

Tree metrics derived from different sources are highly correlated (except CBH). The Pearson correlation coefficient (ρ) of DBH measured in the field and DBH computed from the TLS tree point clouds is 0.98, and the root-mean-square error (RMSE) is only 3.5 cm. The correlation of tree heights derived from different laser scanning point clouds is 1.00, and the RMSE is 0.3–0.4 m. This is lower than the search radius of 0.5 m, which was mostly used for the automatic tree extraction. The deviation of field-measured tree height to point cloud tree height is higher (RMSE = 2.5–2.7 m, ρ=0.96). Field-measured tree heights might be less accurate, especially in cases of high trees with flat crowns, where the tree top was not visible from the ground. The RMSE of mean CD is 0.7–1.2 m between different point cloud metrics (ρ= 0.97–0.98) and 1.5–1.6 m (ρ= 0.88–0.90) between field measurements and point cloud metrics. Absolute crown diameters are between 2 and 22.1 m, with a median value of 7.7 m (values based on field measurements). Higher deviations in CD between field measurements and point cloud metrics occur for trees with an elongated horizontal crown extent, where the direction of the two orthogonal diameter measurements is more important than for crowns with nearly circular circumference. The CD values computed from point clouds might be overestimated for leaning trees because the part of the stem which is not covered by the crown may still be included in the crown extent if it is above the estimated CBH. Field-measured diameters might be of lower accuracy for high tree crowns where the outer crown circumference was difficult to estimate. The definition and determination of CBH are difficult, both in the field and in the point cloud, resulting in high deviation of CBH values derived from different data sources. The RMSE of CBH computed from different laser scanning point clouds is 4.8–6.9 m (ρ= 0.66–0.76). On the one hand, the point-cloud-derived CBH is sensitive to segmentation errors, especially in the case of the automatic tree extraction from template point clouds. If understory points are wrongly assigned to a tree point cloud, the CBH may be underestimated. On the other hand, the lowest branches may not be sampled well in ULS and especially ALS point clouds; hence the CBH may be overestimated. The field-measured CBH accuracy is negatively affected by an over- or underestimation of branch lengths resulting in a wrong selection of the lowest branch defining the CBH. The RMSE between field-measured and point-cloud-derived CBH is 4.0–5.1 m, and ρ is 0.59–0.72 m. Figure 10 shows scatter plots of field-measured compared to point-cloud-derived DBH, tree height, mean CD and CBH.

Figure 10Scatter plots of metrics computed from tree point clouds and measured in the field. Pearson correlation, root-mean-square error and the equation of a linear model fit to the data are given. DBH is the diameter at breast height, H is the tree height, CBH is the crown base height and CD is the crown diameter.


5 Usage notes

There are different methods to clean point clouds prior to further processing, e.g., applying the statistical outlier removal (SOR, Rusu et al.2008) as implemented, e.g., in the Point Cloud Library (PCL; Rusu and Cousins2011) or in CloudCompare. When working with the TLS data, we recommend filtering the point cloud by pulse shape deviation (as defined by RIEGL). This deviation can give an idea of the reliability of the range measurement, so excluding points with high deviation can improve the overall quality of the point cloud (Pfennigbauer and Ullrich2010). We found a value of 50 to be a suitable threshold.

Because we define the tree position as the position of the stem at ground level, this position (in ETRS89/UTM 32N coordinates; EPSG:25832, ellipsoidal height, GRS80) can be used to normalize the height values of the point clouds to derive heights above ground.

Point clouds can be visualized and processed using the open-source software CloudCompare, other lidar software like OPALS (Pfeifer et al.2014) or LAStools (Rapidlasso GmbH2020), or custom code.

We created the Python software package pytreedb (Höfle et al.2022), which provides a simple database interface and REST API to perform queries on a database of tree objects. pytreedb allows trees to be selected using different filters, maps of the selection results to be viewed and data to be exported in different formats (LAZ point clouds, GeoJSON files, CSV files).

Software for the generation of tree models from TLS tree point clouds includes TreeQSM (Raumonen2020; Raumonen et al.2013), SimpleForest (Hackenberg et al.2015; Hackenberg2021), PlantScan3D (Boudon2021) and AdTree (Du et al.2019; Nan et al.2021). These tools reconstruct the woody structure of the trees and therefore require input point clouds without leaf points. Leaves may be removed from TLS point clouds using spectral and geometric features, as proposed in multiple studies (Krishna Moorthy et al.2020; Vicari et al.2019; Wang et al.2020; Yun et al.2016; Zhou et al.2019) and implemented in different software packages (TLSeparation, Vicari2021; LeWoS, Wang2020). Structural tree reconstruction algorithms may also perform well on some of the ULS leaf-off point clouds, for which no leaf removal is required. Trees may furthermore be reconstructed using voxel-based approaches (Weiser et al.2021).

To perform simulation studies with the reconstructed 3D forest scenes, e.g., to conduct sensitivity analyses, radiative transfer models such as DART (Gastellu-Etchegorry et al.2004), librat (Calders et al.2013; Lewis and Muller1992; Lewis1999) or Rayspread (Widlowski et al.2006) may be used. DART also allows lidar to be simulated. Furthermore, there are specialized lidar simulators such as HELIOS++ (Winiwarter et al.2022, 2021). The point clouds presented here can be used as scene parts for HELIOS++ after converting LAZ files to ASCII files, e.g., using LAStools:

las2txt -i <pointcloud.laz>
          -o <>
          -oparse xyz

and using the xyzloader of HELIOS++ (, last access: 24 June 2022). Alternatively, the trees can be loaded as DetailedVoxels (, last access: 24 June 2022), e.g., after deriving voxel-based plant area density estimates using the software AMAPVox (Vincent et al.2017; Weiser et al.2021,, last access: 24 June 2022).

Our dataset can furthermore be used as a benchmark and testing dataset for tree segmentation and species classification algorithm. This was shown by Fu et al. (2022), who used the TLS point clouds of this dataset for testing the accuracy of their individual tree segmentation algorithm. It may be used to explore the differences between the point cloud measurements from different acquisition platforms and the tree and forestry metrics that can be derived from them. Our dataset can be valuable for developing and testing algorithms on the retrieval of forestry parameters on the single tree or the plot level. Furthermore, open forest datasets like this can serve as reference datasets for the calibration and validation of spaceborne data products like biomass maps from satellite images or from spaceborne lidar data such as the NASA's Global Ecosystem Dynamics Investigation (GEDI; Dubayah et al.2020).

6 Code and data availability

Our dataset is published with PANGAEA and available at (Weiser et al.2022a). For each forest plot, one zip folder is provided, containing the forest plot point clouds with trajectory or scan position information and the single-tree point clouds and measurements. Point clouds of the full-coverage airborne laser scanning acquisitions (covering around 65 km2 in Karlsruhe and 32 km2 in Bretten) and associated full-waveform data are not part of the PANGAEA dataset but are available from the authors on reasonable request.

Python code used for processing the dataset is openly available on GitHub (, Weiser2022). The code uses the libraries NumPy, pandas, laspy, pykdtree, SciPy, jsonschema and Matplotlib. The scripts for georeferencing use the Python bindings of OPALS (Pfeifer et al.2014) modules and were tested with versions 2.3.1 and 2.3.2. Please contact the authors for guidance on using the code.

7 Conclusions

Our dataset encompasses terrestrial and airborne laser scanning point clouds acquired from three different platforms: a static tripod, an uncrewed aerial vehicle (UAV) and a fixed-wing aircraft. We thus obtain georeferenced point cloud representations of the overlapping forest areas at different resolutions and from different viewpoints. UAV-borne laser scanning (ULS) point clouds were furthermore recorded in different seasons, under leaf-on and leaf-off canopy conditions. From the full-coverage point clouds of the plot acquisitions, which are also provided within this publication, we identified and extracted individual single trees, resulting in 4205 point clouds of 1491 individual trees of 22 different species. Most of these trees were captured by airborne and UAV-borne laser scanning. For 249 trees, high-resolution terrestrial laser scanning point clouds are additionally available. We recorded tree measurements in the field, which we compare to and complement with point-cloud-derived tree metrics. This results in single tree forest inventory data with one to five 3D point cloud representations of each tree. The full aerial point clouds, the single-tree point clouds, and the measured and point-cloud-derived tree metrics have high reuse value for numerous applications. We identified the following main areas for such applications: (i) development of allometric equations between 3D tree structures and forest inventory variables; (ii) application as training data for machine learning applications such as tree instance segmentation, tree species classification or tree metric computation; and (iii) creation of detailed tree models for lidar simulations or for computer rendering.

Appendix A

Table A1Forest characteristics of full inventory plots of 1ha. Values are given for all trees in the plot (bold) and for trees of each species separately. For the number of trees and the basal area, both absolute (abs.) and relative (rel.) values are provided. It should be noted that point clouds were not extracted for every tree in these plots.

Download XLSX


The supplement related to this article is available online at:

Author contributions

BH and FEF conceived the study. LW, JS, NK and HW collected and analyzed the data. FEF, JS and HW drafted the manuscript. All authors reviewed the manuscript.

Competing interests

The contact author has declared that none of the authors has any competing interests.


Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.


We acknowledge Christian Seitz, Marian Schimka, Katharina Anders, Paula Kuss, Veit Ulrich, Vivien Zahs, Felix Schiefer, Elham Shafeian, Denis Debroize, Annika Denner, Michael Ewald, Lioba Martin, Helen Hake, Tobias Steinert, Vanessa Rittlinger, Fabio Bothner, Louisa Lücking, Janina Schüssler, Angelo Mayer, Katharina Ruge, Johannes Brand, Lea Schraml, Naia Haltmeier, Carolin Klonner, Daria Baete, Thorben Schrempp, Lisa-Maricia Schwarz and Andressa Soarez Braz for their assistance with data acquisition and processing.

Financial support

This research has been supported by the Deutsche Forschungsgemeinschaft (grant no. 411263134).

Review statement

This paper was edited by Sibylle K. Hassler and reviewed by Louise Terryn and one anonymous referee.


Applanix Corporation: POSPAC MMS 8, (last access: 24 June 2022), 2018. a

Arumäe, T. and Lang, M.: Estimation of Canopy Cover in Dense Mixed-Species Forests Using Airborne Lidar Data, Eur. J. Remote Sens., 51, 132–141,, 2018. a

ASPRS: LAS Specification Version 1.2, (last access: 27 July 2021), 2008. a

ASPRS: LAS Specification Version 1.4 – R13, (last access: 27 July 2021), 2013. a, b

Barber, C. B., Dobkin, D. P., and Huhdanpaa, H.: The Quickhull algorithm for convex hulls, ACM T. Math. Software, 22, 469–483, 1996. a

Boudon, F.: PlantScan3D, GitHub [code], (last access: 24 June 2022), 2021. a

Bournez, E., Landes, T., Saudreau, M., Kastendeuch, P., and Najjar, G.: From TLS Point Clouds to 3D Models of Trees: a Comparison of Existing Algorithms for 3D Tree Reconstruction, Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci., XLII-2/W3, 113–120,, 2017. a

Bouvier, M., Durrieu, S., Fournier, R. A., and Renaud, J.-P.: Generalizing Predictive Models of Forest Inventory Attributes Using an Area-Based Approach with Airborne LiDAR Data, Remote Sens. Environ., 156, 322–334,, 2015. a

Bruggisser, M., Hollaus, M., Otepka, J., and Pfeifer, N.: Influence of ULS acquisition characteristics on tree stem parameter estimation, ISPRS J. Photogramm., 168, 28–40,, 2020. a

Calders, K., Lewis, P., Disney, M., Verbesselt, J., and Herold, M.: Investigating assumptions of crown archetypes for modelling LiDAR returns, Remote Sens. Environ., 134, 39–49,, 2013. a

Calders, K., Newnham, G., Burt, A., Murphy, S., Raumonen, P., Herold, M., Culvenor, D., Avitabile, V., Disney, M., Armston, J., and Kaasalainen, M.: Nondestructive estimates of above-ground biomass using terrestrial laser scanning, Methods Ecol. Evol., 6, 198–208,, 2015. a

Calders, K., Adams, J., Armston, J., Bartholomeus, H., Bauwens, S., Bentley, L. P., Chave, J., Danson, F. M., Demol, M., Disney, M., Gaulton, R., Krishna Moorthy, S. M., Levick, S. R., Saarinen, N., Schaaf, C., Stovall, A., Terryn, L., Wilkes, P., and Verbeeck, H.: Terrestrial laser scanning in forest ecology: Expanding the horizon, Remote Sens. Environ., 251, 112102,, 2020. a

CloudCompare: CloudCompare, version 2.10.2, GitHub [code],, last access: 3 April 2019. a

Computree Group: Computree,, last access: 24 June 2022. a

Craig, A.: The Concave Hull of a Set of Points, CodeProject [code], (last access: 1 December 2020), 2017. a

Dassot, M., Colin, A., Santenoise, P., Fournier, M., and Constant, T.: Terrestrial laser scanning for measuring the solid wood volume, including branches, of adult standing trees in the forest environment, Comput. Electron. Agr., 89, 86–93,, 2012. a

Disney, M. I., Kalogirou, V., Lewis, P., Prieto-Blanco, A., Hancock, S., and Pfeifer, M.: Simulating the impact of discrete-return lidar system and survey characteristics over young conifer and broadleaf forests, Remote Sens. Environ., 114, 1546–1560,, 2010. a, b

Disney, M. I., Boni Vicari, M., Burt, A., Calders, K., Lewis, S. L., Raumonen, P., and Wilkes, P.: Weighing trees with lasers: advances, challenges and opportunities, Interface Focus, 8, 20170048,, 2018. a, b

DJI: MATRICE 600 PRO – User Manual, pro/1208EN/Matrice_600_Pro_User_Manual_v1.0_EN_1208.pdf (last access: 17 May 2021), 2018. a

Du, S., Lindenbergh, R., Ledoux, H., Stoter, J., and Nan, L.: AdTree: Accurate, Detailed, and Automatic Modelling of Laser-Scanned Trees, Remote Sensing, 11, 2074,, 2019. a

Dubayah, R., Blair, J. B., Goetz, S., Fatoyinbo, L., Hansen, M., Healey, S., Hofton, M., Hurtt, G., Kellner, J., Luthcke, S., Armston, J., Tang, H., Duncanson, L., Hancock, S., Jantz, P., Marselis, S., Patterson, P. L., Qi, W., and Silva, C.: The Global Ecosystem Dynamics Investigation: High-resolution laser ranging of the Earth's forests and topography, Sci. Remote Sens., 1, 100002,, 2020. a

Fu, H., Li, H., Dong, Y., Xu, F., and Chen, F.: Segmenting Individual Tree from TLS Point Clouds Using Improved DBSCAN, Forests, 13, 566,, 2022. a

Gastellu-Etchegorry, J. P., Martin, E., and Gascon, F.: DART: a 3D model for simulating satellite images and studying surface radiation budget, Int. J. Remote Sens., 25, 73–96,, 2004. a

Gastellu-Etchegorry, J.-P., Yin, T., Lauret, N., Cajgfinger, T., Gregoire, T., Grau, E., Feret, J.-B., Lopes, M., Guilleux, J., Dedieu, G., Malenovský, Z., Cook, B. D., Morton, D., Rubio, J., Durrieu, S., Cazanave, G., Martin, E., and Ristorcelli, T.: Discrete Anisotropic Radiative Transfer (DART 5) for Modeling Airborne and Satellite Spectroradiometer and LIDAR Acquisitions of Natural and Urban Landscapes, Remote Sensing, 7, 1667–1701,, 2015. a

Hackenberg, J.: SimpleTree Plugin, version Beta 4.33.06, (last access: 21 May 2019), 2017. a

Hackenberg, J.: SimpleForest: A tree modelling software, (last access: 24 June 2022), 2021. a

Hackenberg, J., Spiecker, H., Calders, K., Disney, M., and Raumonen, P.: SimpleTree – An Efficient Open Source Tool to Build Tree Models from TLS Clouds, Forests, 6, 4245–4294,, 2015. a, b

Höfle, B., Qu, J., Winiwarter, L., Weiser, H., Zahs, V., Schäfer, J., and Fassnacht, F. E.: pytreedb: library for point clouds of tree vegetation objects, GitHub [code],, last access: 24 June 2022. a

Holopainen, M., Vastaranta, M., and Hyyppä, J.: Outlook for the Next Generation's Precision Forestry in Finland, Forests, 5, 1682–1694,, 2014. a

Krishna Moorthy, S. M., Calders, K., Vicari, M. B., and Verbeeck, H.: Improved Supervised Learning-Based Approach for Leaf and Wood Classification From LiDAR Point Clouds of Forests, IEEE T. Geosci. Remote Sens., 58, 3057–3070,, 2020. a

Latifi, H., Fassnacht, F. E., Müller, J., Tharani, A., Dech, S., and Heurich, M.: Forest inventories by LiDAR data: A comparison of single tree segmentation and metric-based methods for inventories of a heterogeneous temperate forest, Int. J. Appl. Earth Obs., 42, 162–174,, 2015. a

Lewis, P.: Three-dimensional plant modelling for remote sensing simulation studies using the Botanical Plant Modelling System, Agronomie, 19, 185–210,, 1999. a

Lewis, P. and Muller, J.-P.: The Advanced Radiometric Ray-Tracer (ARARAT) for plant canopy reflectance simulation, Int. Arch. Photgramm. Rem. Sens., (Commission VII(B7)) 29, 26–34, (last access: 24 June 2022), 1992. a

Liu, G., Wang, J., Dong, P., Chen, Y., and Liu, Z.: Estimating Individual Tree Height and Diameter at Breast Height (DBH) from Terrestrial Laser Scanning (TLS) Data at Plot Level, Forests, 9, 398,, 2018. a

Maltamo, M., Eerikäinen, K., Pitkänen, J., Hyyppä, J., and Vehmas, M.: Estimation of timber volume and stem density based on scanning laser altimetry and expected tree size distribution functions, Remote Sens. Environ., 90, 319–330,, 2004. a

Maltamo, M., Næsset, E., and Vauhkonen, J. (Eds.): Forestry Applications of Airborne Laser Scanning, Vol. 27, Springer Netherlands, Dordrecht, ISBN 978-94-017-8663-8, 2014. a

Montaghi, A., Corona, P., Dalponte, M., Gianelle, D., Chirici, G., and Olsson, H.: Airborne laser scanning of forest resources: An overview of research in Italy as a commentary case study, Int. J. Appl. Earth Obs., 23, 288–300,, 2013. a

Moreira, A. and Yasmina Santos, M.: Concave hull: A k-nearest neighbours approach for the computation of the region occupied by a set of points, in: Proceedings of the Second International Conference on Computer Graphics Theory and Applications – Volume 2: GRAPP, 61–68, INSTICC, SciTePress,, 2007. a

Morsdorf, F., Eck, C., Zgraggen, C., Imbach, B., Schneider, F. D., and Kükenbrink, D.: UAV-based LiDAR acquisition for the derivation of high-resolution forest and ground information, Leading Edge, 36, 566–570,, 2017. a

Nan, L., Messal, L., Du, S., and Yang, Z.: AdTree, GitHub [code], (last access: 24 June 2022), 2021. a

Næsset, E.: Predicting Forest Stand Characteristics with Airborne Scanning Laser Using a Practical Two-Stage Procedure and Field Data, Remote Sens. Environ., 80, 88–99,, 2002. a

Næsset, E., Gobakken, T., Solberg, S., Gregoire, T. G., Nelson, R., Ståhl, G., and Weydahl, D.: Model-Assisted Regional Forest Biomass Estimation Using LiDAR and InSAR as Auxiliary Data: A Case Study from a Boreal Forest Area, Remote Sens. Environ., 115, 3599–3614,, 2011. a

Pearse, G. D., Watt, M. S., Dash, J. P., Stone, C., and Caccamo, G.: Comparison of Models Describing Forest Inventory Attributes Using Standard and Voxel-Based Lidar Predictors across a Range of Pulse Densities, Int. J. Appl. Earth Obs., 78, 341–351,, 2019. a, b, c

Pfeifer, N., Mandlburger, G., Otepka, J., and Karel, W.: OPALS – A framework for Airborne Laser Scanning data analysis, Comput. Environ. Urban, 45, 125–136,, 2014. a, b, c

Pfennigbauer, M. and Ullrich, A.: Improving quality of laser scanning data acquisition through calibrated amplitude and pulse deviation measurement, in: Laser Radar Technology and Applications XV, edited by: Turner, M. D. and Kamerman, G. W., Vol. 7684, 463–472, International Society for Optics and Photonics, SPIE,, 2010. a, b, c

Rapidlasso GmbH: LAStools – efficient LiDAR processing software (version 200509, unlicensed),, last access: 9 June 2020. a, b

Raumonen, P.: TreeQSM: Reconstruction of quantitative structure models of trees from point cloud data, Zenodo [code],, 2020. a

Raumonen, P., Kaasalainen, M., Åkerblom, M., Kaasalainen, S., Kaartinen, H., Vastaranta, M., Holopainen, M., Disney, M., and Lewis, P.: Fast Automatic Precision Tree Models from Terrestrial Laser Scanner Data, Remote Sens., 5, 491–520,, 2013. a, b

RIEGL Laser Measurement Systems: RIEGL VZ-400, Data Sheet, (last access: 10 June 2022), 2017. a, b

RIEGL Laser Measurement Systems: RIEGL VQ-780i, Data Sheet, (last access: 10 June 2021), 2019. a, b

RIEGL Laser Measurement Systems: Data Processing Software RiPROCESS for RIEGL Scan Data, (last access: 23 March 2021), 2020a. a

RIEGL Laser Measurement Systems: Operating & Processing Software RiSCAN PRO for RIEGL 3D Laser Scanners, (last access: 4 July 2022), 2020b. a

RIEGL Laser Measurement Systems: RIEGL miniVUX-1UAV, Data Sheet, (last access: 10 June 2022), 2020c. a, b

Roberts, O., Bunting, P., Hardy, A., and McInerney, D.: Sensitivity Analysis of the DART Model for Forest Mensuration with Airborne Laser Scanning, Remote Sens., 12, 247,, 2020. a

Roussel, J.-R., Auty, D., Coops, N. C., Tompalski, P., Goodbody, T. R., Meador, A. S., Bourdon, J.-F., de Boissieu, F., and Achim, A.: lidR: An R package for analysis of Airborne Laser Scanning (ALS) data, Remote Sens. Environ., 251, 112061,, 2020. a

Rusu, R. B. and Cousins, S.: 3D is here: Point Cloud Library (PCL), in: IEEE International Conference on Robotics and Automation (ICRA), Shanghai, China,, 2011. a

Rusu, R. B., Marton, Z. C., Blodow, N., Dolha, M., and Beetz, M.: Towards 3D Point cloud based object maps for household environments, Robot. Auton. Syst., 56, 927–941,, 2008. a

Scion: Forest Science, Brochure, (last access: 25 April 2022), 2021. a

Sinoquet, H., Le Roux, X., Adam, B., Ameglio, T., and Daudet, F. A.: RATP: a model for simulating the spatial distribution of radiation absorption, transpiration and photosynthesis within canopies: application to an isolated tree crown, Plant Cell Environ., 24, 395–406,, 2001. a

Smith, A. M. S., Falkowski, M. J., Hudak, A. T., Evans, J. S., Robinson, A. P., and Steele, C. M.: A Cross-Comparison of Field, Spectral, and Lidar Estimates of Forest Canopy Cover, Can. J. Remote Sens., 35, 447–459,, 2009. a

Vicari, M. B.: TLSeparation, GitHub [code], (last access: 24 June 2022), 2021. a

Vicari, M. B., Disney, M., Wilkes, P., Burt, A., Calders, K., and Woodgate, W.: Leaf and wood classification framework for terrestrial LiDAR point clouds, Methods Ecol. Evol., 10, 680–694,, 2019. a

Vincent, G., Antin, C., Laurans, M., Heurtebize, J., Durrieu, S., Lavalley, C., and Dauzat, J.: Mapping plant area index of tropical evergreen forest by airborne laser scanning. A cross-validation study using LAI2200 optical sensor, Remote Sens. Environ., 198, 254–266,, 2017. a

Wallace, L., Musk, R., and Lucieer, A.: An Assessment of the Repeatability of Automatic Forest Inventory Metrics Derived From UAV-Borne Laser Scanning Data, IEEE T. Geosci. Remote Sens., 52, 7160–7169,, 2014. a

Wang, D.: LeWoS, Zenodo [code],, 2020. a

Wang, D., Momo Takoudjou, S., and Casella, E.: LeWoS: A universal leaf-wood classification method to facilitate the 3D modelling of large tropical trees using terrestrial LiDAR, Methods Ecol. Evol., 11, 376–389,, 2020. a

Wang, Y., Lehtomäki, M., Liang, X., Pyörälä, J., Kukko, A., Jaakkola, A., Liu, J., Feng, Z., Chen, R., and Hyyppä, J.: Is field-measured tree height as reliable as believed – A comparison study of tree height estimates from field measurement, airborne laser scanning and terrestrial laser scanning in a boreal forest, ISPRS J. Photogramm., 147, 132–145,, 2019. a

Weiser, H.: 3dgeo-heidelberg/syssifoss: Version 1.0.0, Zenodo [code],, 2022. a

Weiser, H., Winiwarter, L., Anders, K., Fassnacht, F. E., and Höfle, B.: Opaque voxel-based tree models for virtual laser scanning in forestry applications, Remote Sens. Environ., 265, 112641,, 2021. a, b, c

Weiser, H., Schäfer, J., Winiwarter, L., Krašovec, N., Seitz, C., Schimka, M., Anders, K., Baete, D., Braz, A. S., Brand, J., Debroize, D., Kuss, P., Martin, L. L., Mayer, A., Schrempp, T., Schwarz, L.-M., Ulrich, V., Fassnacht, F. E., and Höfle, B.: Terrestrial, UAV-borne, and airborne laser scanning point clouds of central European forest plots, Germany, with extracted individual trees and manual forest inventory measurements, PANGAEA [data set],, 2022a. a, b

Weiser, H., Schäfer, J., Winiwarter, L., Krašovec, N., Seitz, C., Schimka, M., Anders, K., Baete, D., Braz, A. S., Brand, J., Debroize, D., Kuss, P., Martin, L. L., Mayer, A., Schrempp, T., Schwarz, L.-M., Ulrich, V., Fassnacht, F. E., and Höfle, B.: Terrestrial, UAV-borne, and airborne laser scanning point clouds of central European forest plots, Germany, with extracted individual trees and manual forest inventory measurements – Metadata Documentation, (last access: 24 June 2022), 2022b. a, b, c, d, e, f, g

White, J. C., Tompalski, P., Vastaranta, M., Wulder, M. A., Saarinen, N., Stepper, C., and Coops, N. C.: A model development and application guide for generating an enhanced forest inventory using airborne laser scanning data and an area-based approach, CWFC Information Report FI-X-018, Natural Resources Canada, Victoria, BC, Canada, (last access: 24 June 2022), 2017. a

Widlowski, J.-L., Lavergne, T., Pinty, B., Verstraete, M., and Gobron, N.: Rayspread: A Virtual Laboratory for Rapid BRF Simulations Over 3-D Plant Canopies, in: Computational Methods in Transport, edited by: Graziani, F., 211–231, Springer, Berlin, Heidelberg, 211–231,, 2006. a

Widlowski, J.-L., Mio, C., Disney, M., Adams, J., Andredakis, I., Atzberger, C., Brennan, J., Busetto, L., Chelle, M., Ceccherini, G., Colombo, R., Côté, J.-F., Eenmäe, A., Essery, R., Gastellu-Etchegorry, J.-P., Gobron, N., Grau, E., Haverd, V., Homolová, L., Huang, H., Hunt, L., Kobayashi, H., Koetz, B., Kuusk, A., Kuusk, J., Lang, M., Lewis, P. E., Lovell, J. L., Malenovský, Z., Meroni, M., Morsdorf, F., Mõttus, M., Ni-Meister, W., Pinty, B., Rautiainen, M., Schlerf, M., Somers, B., Stuckens, J., Verstraete, M. M., Yang, W., Zhao, F., and Zenone, T.: The fourth phase of the radiative transfer model intercomparison (RAMI) exercise: Actual canopy scenarios and conformity testing, Remote Sens. Environ., 169, 418–437,, 2015.  a

Winiwarter, L., Pena, A. M. E., Weiser, H., Anders, K., Sanches, J. M., Searle, M., and Höfle, B.: 3dgeo-heidelberg/helios, Zenodo [code],, 2021. a

Winiwarter, L., Esmorís Pena, A. M., Weiser, H., Anders, K., Martínez Sánchez, J., Searle, M., and Höfle, B.: Virtual laser scanning with HELIOS++: A novel take on ray tracing-based simulation of topographic full-waveform 3D laser scanning, Remote Sens. Environ., 269, 112772,, 2022. a, b

Yun, T., An, F., Li, W., Sun, Y., Cao, L., and Xue, L.: A Novel Approach for Retrieving Tree Leaf Area from Ground-Based LiDAR, Remote Sens., 8, 942,, 2016. a

Zhou, J., Wei, H., Zhou, G., and Song, L.: Separating Leaf and Wood Points in Terrestrial Laser Scanning Data Using Multiple Optimal Scales, Sensors, 19, 1852,, 2019. a


σ(MAD) is the median absolute deviation to median (MAD) scaled by 1.4826 to obtain a robust estimator for the standard deviation σ under the assumption of a normal distribution; (last access: 24 June 2022).

Short summary
3D point clouds, acquired by laser scanning, allow us to retrieve information about forest structure and individual tree properties. We conducted airborne, UAV-borne and terrestrial laser scanning in German mixed forests, resulting in overlapping point clouds with different characteristics. From these, we generated a comprehensive database of individual tree point clouds and corresponding tree metrics. Our dataset may serve as a benchmark dataset for algorithms in forestry research.
Final-revised paper