Articles | Volume 16, issue 9
https://doi.org/10.5194/essd-16-4161-2024
https://doi.org/10.5194/essd-16-4161-2024
Data description paper
 | 
13 Sep 2024
Data description paper |  | 13 Sep 2024

A field-based thickness measurement dataset of fallout pyroclastic deposits in the peri-volcanic areas of Campania (Italy): statistical combination of different predictions for spatial estimation of thickness

Pooria Ebrahimi, Fabio Matano, Vincenzo Amato, Raffaele Mattera, and Germana Scepi
Abstract

Determining the spatial thickness (z) of in situ and reworked fallout pyroclastic deposits plays a key role in volcanological studies and in shedding light on geomorphological and hydrogeological processes in peri-volcanic areas. However, this is a challenging line of research because (1) field-based measurements are expensive and time-consuming, (2) the ash might have been dispersed in the atmosphere by several volcanic eruptions, and (3) wind characteristics during an eruptive event and soil-forming and/or denudation processes after ash deposition on the ground surface affect the expected spatial distribution of these deposits. This article tries to bridge this knowledge gap by applying statistical techniques for making representative spatial thickness predictions to be used for the analysis of geomorphic processes at the catchment and sub-catchment scales. First, we compiled a field-based thickness measurement dataset (https://doi.org/10.5281/zenodo.8399487; Matano et al., 2023) of fallout pyroclastic deposits in the territories of several municipalities in Campania, southern Italy. Second, 18 predictor variables were derived mainly from digital elevation models and satellite images and were assigned to each measurement point. Third, the stepwise regression (STPW) model and random forest (RF) machine learning technique are used for thickness modeling. Fourth, the estimations are compared with those of three models that already exist in the literature. Finally, the statistical combination of different predictions is implemented to develop a less biased model for estimating pyroclastic thickness. The results show that the prediction accuracy of RF (RMSE <82.46 and MAE <48.36) is better than that of existing models in the literature. Moreover, statistical combination of the predictions obtained from the above-mentioned models through a least absolute deviation (LAD) combination approach leads to the most representative thickness estimation (MAE <45.12) in the study area. The maps for the values estimated by RF and LAD (as the best single model and combination approach, respectively) illustrate that the spatial patterns did not change significantly, but the estimations by LAD are smaller. This combined approach can help in estimating the thickness of fallout pyroclastic deposits in other volcanic regions and in managing geohazards in areas covered with loose pyroclastic materials.

1 Introduction

A significant quantity of ash is dispersed in the atmosphere during an explosive volcanic eruption and is deposited over a large area of ground surface following wind transportation. The spatial thickness of the ash layer typically decreases with distance from the eruptive vent (e.g., see Perrotta and Scarpati, 2003; Bourne et al., 2010; Lowe, 2011; Brown et al., 2012; Caron et al., 2012; Costa et al., 2012; Albert et al., 2019; Eychenne and Engwell, 2023) and noticeably influences geomorphological and hydrological processes such as landscape evolution, hillslope hydrology, erosion, and slope stability because the geotechnical and hydraulic properties of the unconsolidated ash layer usually differ from the underlying bedrock and soil. A deep understanding of the thickness of fallout pyroclastic deposits could, therefore, help address geohazard management and many related socioeconomic concerns. It is challenging to estimate the spatial thickness of fallout pyroclastic deposits because there might be more than one eruptive event, the ash dispersal pattern is influenced by the changes in wind characteristics (e.g., the speed and direction of the wind) during a single eruption, and soil-forming and denudation processes continuously influence the expected spatial thickness due to different slope exposures and geometry. Only costly and time-consuming detailed field-based measurements may allow us to effectively map the thickness variations of fallout pyroclastic deposits over a limited area (for examples, see Matano et al., 2016; Cuomo et al., 2021). The spatial thickness of fallout pyroclastic deposits under the influence of hillslope processes, accordingly, remains a knowledge gap (P. De Vita et al., 2006).

Estimating the residual regolith has been a common practice (e.g., Saulnier et al., 1997; Saco et al., 2006; Tesfa et al., 2009; Segoni et al., 2013), and the implemented models performed better when developed based on independent variables and when applied to a specific site or in limited areas (Del Soldato et al., 2018; Matano et al., 2016). Conventional approaches for predicting pyroclastic thickness primarily rely on geological data, but the significant improvements in the availability of remote sensing data, along with the recent advances in recording the depositional history of fallout pyroclastic deposits, present a unique opportunity to enhance prediction accuracy. Moreover, the statistical literature shows that better results can be achieved by combining estimations derived from different models, which has not been adopted for the objectives of this article to date.

This article explores the integration of a wide range of predictor variables, mainly derived from digital elevation model (DEM) and satellite multispectral images, with machine learning techniques (i.e., stepwise regression and random forest). These approaches identify the most relevant variables and capture non-linear relationships between the predictor variables and pyroclastic thickness values in order to improve prediction accuracy. Combination schemes were then applied to the predictions of these methods and to the ones derived from classical approaches, as slope angle pyroclastic thickness (SAPT; P. De Vita et al., 2006), geomorphological pyroclastic thickness (GPT; Del Soldato et al., 2016), and slope exponential pyroclastic thickness (SEPT; Del Soldato et al., 2018) methods. Finally, the predictions are validated by field-based measurements of fallout pyroclastic deposits to empirically demonstrate that combining the results of different models provides better thickness predictions for the fallout pyroclastic deposits.

The different sections of this article are briefly introduced here to provide better insight into content of this article. Section 2 introduces the study area, while Sect. 3 explains the data collection in the field, along with the methodology for preparing the predictor variables and for predicting the thickness of fallout pyroclastic deposits. In the next section, a detailed description of the field-based thickness measurement dataset and of the predictor variables is provided. Section 5 discusses the results and highlights the advantages of using statistical combination in predicting the thickness of fallout pyroclastic deposits. The concluding remarks and suggestions for future work are presented in the last section.

2 Study area

The area of interest encompasses Campania and the immediate surroundings in southern Italy (Fig. 1). It is bounded on the west by the Tyrrhenian Sea and on the east by the Apennine hilly–mountainous inner land, with an altitude of up to 2050 m a.s.l. at Mt. Miletto. The area has a Mediterranean climate with hot, dry summers and moderately cool rainy winters. The mean annual temperature is about 10 °C in the mountainous areas and roughly 18 °C along the coast. The mean annual rainfall ranges from 700 mm in the eastern part of the region to 1800 mm in the central part of the Apennine mountains (Ducci and Tranfaglia, 2008).

https://essd.copernicus.org/articles/16/4161/2024/essd-16-4161-2024-f01

Figure 1Location of the study area in southern Italy.

The geological units of the Apennine mountains are formed by Triassic to Early Miocene carbonate platform limestones and pelagic basin calcareous pelitic sequences. They are strongly deformed, mainly thrusted eastward (Bonardi et al., 2009; Doglioni, 1991; Patacca et al., 1990) and uncomformably covered by Middle Miocene to Pliocene thrust-top basin fillings, formed by siliciclastic sequences (mostly clay, sandstone, and conglomerate; Di Nocera et al., 2006).

The Quaternary extension in the hinterland and the axial sectors of Campania caused several fluvio-lacustrine intramontane-basin openings (Ferranti and Oldow, 2005; Amato et al., 2018; Boncio et al., 2022). The NW–SE- and NE–SW-striking faults delimit the strongly subsiding coastal basins (e.g., Volturno, Campania, Sarno, and Sele plains) along the Tyrrhenian belt where the volcanic complexes of Somma–Vesuvius, Phlegrean Fields, Ischia, and Roccamonfina occur (Fig. 1). The volcanoes are active (except for Roccamonfina) and have erupted at least once in the last 1000 years (Rosi and Sbrana, 1987; Santacroce, 1987). Diffuse degassing, fumaroles, and hot springs are observed around and in the submerged sectors of the volcanoes (Rosi and Sbrana, 1987; Chiodini et al., 2001; de Lorenzo et al., 2001). The above-mentioned volcanic complexes are briefly introduced below:

  • The Somma–Vesuvius volcanic complex lies over a large sedimentary plain, prevalently filled by pyroclastic deposits. In this volcanic complex, the older Mt. Somma stratovolcano was cut by an eccentric polyphasic caldera and by the Vesuvius stratocone (Sbrana et al., 2020). Four main Plinian eruptions (Cioni et al., 2003; Santacroce et al., 2008; Sulpizio et al., 2010a, b; Mele et al., 2011; Sevink et al., 2011; Doronzo et al., 2022) and several interplinian eruptions (Andronico and Cioni, 2002; Cioni et al., 2015; Sulpizio et al., 2005, 2007; Bertagnini et al., 2006) have been linked to the volcanic activities of Somma–Vesuvius.

  • The Phlegrean Fields consist of several volcanoes in a large caldera west of Naples, characterized by many eruptions with a large and very large volcanic explosivity index (VEI; Newhall and Self, 1982) (Fig. 2). Volcanic activity in the Phlegrean Fields began prior to 80 ka (Pappalardo et al., 1999; Scarpati et al., 2014), and the caldera collapses occurred during the eruptions of Campanian Ignimbrite (ca. 39 ka; Deino et al., 2004; De Vivo et al., 2001), Masseria del Monte Tuff (29 ka; Albert et al., 2019), and Neapolitan Yellow Tuff (15 ka; Orsi et al., 1996; Perrotta et al., 2006; Vitale and Isaia, 2014). The post-15 ka activity was well described by Di Vito et al. (1999), Isaia et al. (2009), and Smith et al. (2011).

  • Ischia Island is the emergent part of a volcanic edifice in the Gulf of Naples, whose activity started before 150 ka. The island is composed of volcanic rocks (mostly trachyte and phonolite) formed by effusive and explosive eruptions, epiclastic deposits, and subordinate terrigenous sediments (S. De Vita et al., 2006).

  • The Roccamonfina volcanic complex was active between 550 and 150 ka in the Garigliano River rift valley. It was affected by an intense Plinian activity revealed by very large craters. The central caldera is the result of the eruptive explosions at 353 ± 5 ka, while the latest stage of activity featured the edification of the central shoshonitic domes at 150 ka (Giannetti, 2001; Rouchon et al., 2008).

The fallout pyroclastic deposits considered in this article are mainly related to the Somma–Vesuvius and Phlegrean Fields volcanoes (Fig. 2) because the thickness of the Ischia tephra is not considerable on the mainland, and the old Roccamonfina deposits have mostly eroded outside the volcanic edifice. Therefore, only the volcanic history of Somma–Vesuvius and Phlegrean Fields will be further described in this section.

https://essd.copernicus.org/articles/16/4161/2024/essd-16-4161-2024-f02

Figure 2Main explosive eruptions of the Somma–Vesuvius and Phlegrean Fields volcanoes. The major explosive events are in red. The pedomarker A and the pedomarker B refer to paleosol layers developed during eruptive quiescence.

Download

2.1 The volcanic history of the Phlegrean Fields

The most important volcanic activities in the Phlegrean Fields are the Campanian Ignimbrite (CI: 39 ka; De Vivo et al., 2001) and the Neapolitan Yellow Tuff eruptions (15 ka; Orsi et al., 1992, 1996; Wohletz et al., 1995; Deino et al., 2004). The former is the most powerful volcanic event to have ever occurred in the Mediterranean area (Barberi et al., 1978; Fisher et al., 1993; Orsi et al., 1996; Rosi et al., 1988, 1996; Civetta et al., 1997; De Vivo et al., 2001; Cappelletti et al., 2003; Engwell et al., 2014; Scarpati and Perrotta, 2016; Smith et al., 2016) and emplaced thick sequences of fallout deposits and pyroclastic density currents of mostly trachytic composition (Giaccio et al., 2008; Costa et al., 2022). The dispersed ash during the first eruptive phase was transported by wind towards the east (Rosi et al., 1999; Perrotta and Scarpati, 2003; Scarpati and Perrotta, 2016). The pyroclastic density currents were subsequently emplaced over an area of 7000 km2 and surmounted ridges more than 1000 m high (Barberi et al., 1978; Fisher et al., 1993). The CI distal outcrops are mostly represented by a massive, gray ignimbrite (Barberi et al., 1978; Fisher et al., 1993; Scarpati et al., 2014), distributed beyond  80 km from the vent (Smith et al., 2016).

The post-15 ka activity of the Phlegrean Fields was concentrated in three epochs separated by two quiescent periods (Fig. 2; Di Vito et al., 1999; Smith et al., 2011; Di Renzo et al., 2011, and references therein) and terminated with the Monte Nuovo eruption in 1538 CE (Guidoboni and Ciuccarelli, 2011; Di Vito et al., 2016, and references therein). The first epoch (15 to  9.5 ka) was characterized by several explosive events, of which the Pomici Principali eruption was the most energetic (Lirer et al., 1987; Di Vito et al., 1999). This epoch was followed by a quiescent period when a thick paleosol layer, pedomarker A, was developed. The second epoch (8.6–8.2 ka; Di Vito et al., 1999) was distinguished by only a few episodes of low-magnitude eruptions, mainly in the NE Campanian Plain. After formation of the pedomarker B in a prolonged volcanic quiescence, the last epoch of intense volcanic activity began between 4.4 and 3.8 ka (Di Vito et al., 1999). The third epoch was characterized by several explosive events, of which the Agnano–Monte Spina eruption (4.4 ka; De Vita et al., 1999; Dellino et al., 2021) was the most powerful. This epoch was followed by a prolongate quiescent period and then the Monte Nuovo eruption (1538 CE; Di Vito et al., 1987; Piochi et al., 2005). Since 1960, fumarolic and hydrothermal activities with episodes of bradyseism have mainly occurred in the Phlegrean Fields (Cannatelli et al., 2020).

2.2 The volcanic history of Somma–Vesuvius

The Somma–Vesuvius volcanic activity is characterized by four major Plinian eruptions (i.e., Pomici di Base or “Sarno” at ca. 22 ka, Mercato or “Ottaviano” at ca. 9.0 ka, Avellino at 3.9 ka, and Pompeii at 79 CE) and several low-intensity interplinian eruptions (Fig. 2). Pomici di Base (Andronico et al., 1995; Santacroce et al., 2008) was the oldest caldera-forming event, which was followed by notably variable interplinian activities, alternating low-magnitude eccentric flank eruptions, quiescent phases, and subplinian events (such as the Greenish Pumice eruption at  19 ka; Santacroce and Sbrana, 2003; Santacroce et al., 2008). The products of the Mercato eruption (Rolandi et al., 1993a; Mele et al., 2011) that occurred about 13,000 years later were separated from those of the Avellino eruption (Rolandi et al., 1993b; Sulpizio et al., 2010a, b; Sevink et al., 2011) by a thick paleosol layer (Di Vito et al., 1999). The low-intensity eruptions of AP1–AP6 (3.5–2.3 ka; Andronico et al., 2002; Santacroce et al., 2008; Passariello et al., 1999, 2010; Di Vito et al., 2019) preceded the eruption of Pompeii, which has been well described by many authors (from Sigurdsson et al., 1985, to Doronzo et al., 2022, and references therein).

The Vesuvius cone was formed by the most recent period of volcanic activity, characterized by a complex alternation of periods of activity with various explosive characters and quiescent phases (Andronico et al., 1995) suddenly interrupted by the Pollena eruption (472 CE; Rolandi et al., 2004). A Middle Age period of variable activity was then started, with alternating lava effusions, moderately explosive eruptions, and mild periods (Rolandi et al., 1998), before a subplinian eruption in 1631 CE (Bertagnini et al., 2006). After this event, the volcano entered a state of semi-persistent mild activity, with minor lava effusions and short quiescent periods. Each period of repose was preceded by relatively powerful explosive and effusive polyphase eruptions (Arrighi et al., 2001), like the last two in 1908 and 1944.

3 Materials and methods

The data and methods used in this study are summarized in Table 2 and Fig. 3 and are further described in this section.

https://essd.copernicus.org/articles/16/4161/2024/essd-16-4161-2024-f03

Figure 3A flowchart showing how the dataset of field-based thickness measurements (Matano et al., 2023) is used to predict the thickness of the fallout pyroclastic deposits. GPT: geomorphological pyroclastic thickness, SAPT: slope angle pyroclastic thickness, SEPT: slope exponential pyroclastic thickness, STPW: stepwise regression, RF: random forest, SA: simple average, MV: minimum variance, OLS: ordinary least squares, LAD: least absolute deviation.

Download

3.1 Dataset of field-based thickness measurements

A dataset of 6671 field-based thickness measurements (Matano et al., 2023) has been collected during the field surveys and investigations conducted over the last few decades for scientific and technical studies in the study area (Fig. 1). The measurements explain the distance between the topographic surface and the upper limit of the consolidated basement; we refer to “total” measurement when the instruments could measure the whole distance and “partial” measurement when the limitations of the implemented instruments led to only partial measurements of the distance. Further details about the stratigraphy of the pyroclastic deposits and the possible presence of non-lithified paleosols are not considered as they are beyond the scope of this article. The following methods have been applied for measuring the thickness of the unconsolidated pyroclastic materials on the bedrock:

  • Probing tests (PRBs). An iron rod (1.8 cm in diameter and up to 306 cm in length) was driven into the ground by hand or by a 0.03 kN hammer to measure the depth of the underlying consolidated bedrock, indicating the thickness of the fallout pyroclastic deposits as well. Each measurement represents the arithmetic mean of two or three measured values within a circle with a radius of 1–2 m to minimize the error associated with the local factors such as the presence of cobbles, boulders, roots, colluvium, and pumice layers.

  • Penetration tests. Two types of penetration tests were implemented in this study. The dynamic cone penetration tests (DPT–DL030) were performed by driving an iron rod with a cross-sectional area of 10 cm2 into the ground by repeatedly raising a 0.3 kN weight for 20 cm and then dropping it. These tests refer to the in situ continuous measurement of rock and/or soil resistance to penetration up to 14 m depth, which could also be an indirect measure for the thickness of the fallout pyroclastic deposits when the probe fails to penetrate. The data collected by this method are in accordance with the probing-test results and help interpret the stratigraphy as well. We also used the results of the standard penetration test (SPT), which is a common in situ dynamic test for determining the geotechnical properties of subsurface soil such as relative density and shear strength parameters.

  • Boreholes (BHs). The borehole stratigraphic data were used for collecting the thickness of fallout pyroclastic deposits.

  • Hand-dug pits (HDPs). These were usually excavated manually (mainly at a size of 20 × 20 × 200 cm) near penetration test or geophysical survey sites to collect further information on the stratigraphy of the loose materials over the bedrock.

  • Trenches (TRNs). These were excavated (1 m wide, 3 m long, and 2–3 m deep) at the base of the slopes or along the intermediate morphological shelves using mechanical diggers for direct investigation of the pyroclastic-deposit stratigraphy.

  • Seismic surveys (SSs). Up to 10 m depth, the seismic reflection data of three bursts (direct, reverse, and intermediate) were recorded by 20–24 geophones placed 3–5 m apart in a straight line on the ground surface. The seismic data revealed the geometry and stratigraphy of the ash layer, along with the boundary between the consolidated bedrock and the overlying loose materials.

  • Outcrops (OCPs). The stratigraphy of several outcrops was analyzed across the study area to measure the thickness of the pyroclastic deposits.

For each method, the measurement error and estimated interpretation uncertainty (i.e., estimation of the fallout pyroclastic-deposit thickness) are shown in Table 1. About 1 cm error is expected for direct thickness measurements conducted with outcrops, hand-dug pits, trenches, and boreholes. The error associated with the probing-test results is around 1 cm as well. The measurement error of the penetration tests (i.e., dynamic cone penetration test (DPT–DL030) and standard penetration test) is considered to be 10 cm because the number of blows was counted following the driving of the rod into the ground for 10 cm. In seismic surveys, the error depends on the specific technique and site characteristics, but a measurement error of 100 cm might be a good estimation for the whole study area.

Table 1The expected measurement error and interpretation uncertainty for the methods implemented.

Download Print Version | Download XLSX

The interpretative uncertainty of the measurements is equal to the measurement error (i.e., 1 cm) for direct thickness measurements, but it increases in probing tests, penetration tests, and seismic surveys. It is noteworthy that the results of these tests and/or surveys were calibrated in the field based on the more precise tests conducted nearby (mostly at 1–10 m distance). The weighted average of errors and the uncertainty are under 6 and 19 cm, respectively (Table 1). Therefore, the bias introduced by measurement errors and interpretative uncertainties is irrelevant to the objectives of this article.

To date, partial and total thickness measurements, the method of investigation, the municipal territory to which the measurement points belong, and the geographic coordinates have been recorded for a total of 6671 points (Matano et al., 2023). The spatial distribution of the measurement points is shown in Fig. 4. Matano et al. (2016) and Cuomo et al. (2021) have already used the measurements in the Cervinara and Nocera Inferiore municipal territories for detailed thickness mapping with heuristic methods.

https://essd.copernicus.org/articles/16/4161/2024/essd-16-4161-2024-f04

Figure 4Spatial distribution of field-based thickness measurement points in each municipality: (a) Villa Santa Lucia, (b) Pozzilli, (c) Gioia Sannitica, (d) Nocera Inferiore, (e) Bagnoli Irpino, (f) Vitulano, and (g) Cervinara. The measurement points are subdivided into the training (n=4294) and test (n=1843) subsets. Although symbols of different shapes and colors are used for the subsets, the spatially dense measurement points do not allow for the application of a larger symbol size. In the electronic version of this article, please zoom in on the figure to distinguish between different symbols based on the shape (if required).

3.2 Predictor variables

A list of the potential predictor variables for estimating the thickness of fallout pyroclastic deposits is provided in Table 2. The value for each predictor variable is assigned to the measurement points based on a set of rasters at 30 × 30 m resolution.

Table 2The predictor variables for thickness modeling.

1 The vertical accuracy of the DEM is evaluated by control points, and the overall root mean square error is <3.5 m (Tarquini et al., 2007). 2 Not applicable. 3 https://geodati.gov.it/resource/id/ispra_rm:01Idro250N_DT (last access: 8 January 2024). 4 Operational Land Imager. Landsat 8 OLI data are calibrated to better than 5 % uncertainty in terms of the top-of-atmosphere reflectance and have an absolute geodetic accuracy better than 65 m circular error at 90 % confidence (Ihlen, 2019).

Download Print Version | Download XLSX

3.2.1 Initial thickness (z0) of fallout pyroclastic deposits

This predictor variable represents the overall thickness of fallout pyroclastic deposits emplaced by Late Quaternary explosive eruptions at a given location. In other words, it explains the thickness value that could be estimated at a location if erosional and/or depositional processes do not occur after the associated eruptive events. In fact, the residual thickness of pyroclastic deposits that can be found at a certain location today is the result of the erosional and depositional processes that occurred after the eruptive events.

To obtain the initial thickness (z0) of fallout pyroclastic deposits, the following approach is used: (1) collect the isopach maps of the fallout deposits for the main volcanic eruptions (characterized by a high explosivity index and great eruptive volume) in Campania from the literature (Tables S1 and S2 in the Supplement); (2) georeference and digitize each map; (3) apply an interpolation technique (i.e., Topo to Raster in ArcMap) to add intermediate isopaches in case of a significant gap between them; (4) assign the average value of two isopaches of different thickness to the area between them, except for the area enclosed by only one isopach; (5) combine all shapefiles into one; (6) compute the z0 value of all volcanic eruptions for each feature in the shapefile; and (7) convert the obtained shapefile into a raster with 30 × 30 m resolution and assign the z0 value to each field-based measurement point in the thickness dataset (Sect. 3.1).

The isopach maps of the fallout deposits for the Somma–Vesuvius and Phlegrean Fields main eruptions are listed in Tables S1 and S2, respectively. The Ischia tephra was not considered for z0 calculation because of its insignificant thickness on the mainland. However, the old Roccamonfina tephra (>150 ka) has been almost entirely eroded outside the volcanic edifice, and we have considered the associated isopach map of pyroclastic deposits only in a semi-quantitative way based on the results of Rouchon et al. (2008) and Giannetti (2001).

Isopach maps are commonly used in volcanological studies to estimate the volume of a single eruptive event and to assess volcanic hazard. They are constructed by interpolating thickness data points, which are considered to be reliable as they are directly measured by investigating the stratigraphy of volcanic deposits. To the best of our knowledge, the uncertainty of the published isopach maps for Somma–Vesuvius and Phlegrean Fields (De Vita et al., 1999; Di Vito et al., 2008; Cappelletti et al., 2003; Costa et al., 2009; Isaia et al., 2004; Orsi et al., 2004; Rolandi et al., 1998, 2004, 2008) has not been discussed in the literature. Only Costa et al. (2012) modeled the Campanian Ignimbrite isopach map based on 113 measurements and reported that the results are in agreement with the measured thickness values (relative mean error =0.3 log units). The uncertainty has also not been quantified for the cumulative isopach maps of multiple eruptions generated for studying erosional processes and landslide susceptibility (P. De Vita et al., 2006; De Vita and Nappi, 2013; Del Soldato et al., 2016, 2018).

3.2.2 Variables derived from DEM and satellite imagery

The predictor variables derived from DEM or satellite imagery (11 and 4 variables, respectively) are listed in Table 2 with a definition, a brief description, and the methodology used to obtain them. Originally, the DEM had a 10 × 10 m spatial resolution, while the satellite imagery had a 30 × 30 m spatial resolution. Raster resampling is, therefore, implemented after calculating the variables to obtain a resolution of 30 × 30 m.

Other variables such as the distance to the hydrographic network and the distance to the source (i.e., eruptive vent) are also considered to be predictor variables in this study. In the latter, several inferred eruptive vents reported by Di Vito et al. (2008), along with the Vesuvius crater and Roccamonfina caldera, were considered to take into account different ash-producing eruptions and to reduce the associated uncertainty as far as possible. Further information is provided in Table 2.

3.3 Methods for thickness modeling

3.3.1 Previous studies

To date, four approaches have been proposed for modeling the thickness (z) of the fallout pyroclastic deposits. The slope angle pyroclastic thickness (SAPT) model estimates z by linking the initial thickness (z0) of fallout pyroclastic deposits that have erupted from the volcanos with the slope angle (P. De Vita et al., 2006; De Vita and Nappi, 2013). In this model, some thresholds for slope angle were derived by field measurements in Mt. Sarno and Mt. Lattari. The geomorphologically indexed soil thickness (GIST) model is an empirical model that combines morphometric, geomorphological, and geological features (Catani et al., 2010) for estimating soil thickness in areas where bedrock weathering is the main soil-forming process (Mercogliano et al., 2013; Segoni et al., 2013), but it is applied to the areas covered by the fallout pyroclastic deposits as well (Rossi et al., 2013). In this article, the GIST model was not implemented because the fallout pyroclastic deposits are of allochthonous origin, and bedrock lithology does not control their thickness (P. De Vita et al., 2006; Del Soldato et al., 2018). Del Soldato et al. (2016) proposed the geomorphological pyroclastic thickness (GPT) model as a combination of the SAPT and GIST models. Comparing the performances of the GIST, SAPT, and GPT models indicated that z is mainly controlled by z0 and slope gradient. Therefore, the slope exponential pyroclastic thickness (SEPT) model was developed based on these two parameters (Del Soldato et al., 2018).

3.3.2 Proposed methods: random forest

For spatial modeling, a wide range of machine learning techniques are available, including logistic regression analysis, random forest (RF), support vector machines, and artificial neural networks. Among these techniques, RF showed the best performance for classification and prediction. It is a shallow ensemble-learning algorithm that can be tuned with few parameters (Liu et al., 2023, and references therein). The principles of decision trees and bagging are implemented for building random forests. Bagging applies bootstrap sampling of the training data for building decision trees and aggregates the predictions across all the trees, which reduces the overall variance and improves the predictive performance. The RF uses a random subset of variables at each split while growing a decision tree during the bagging process to generate a more diverse set of trees, which helps lessen tree correlation beyond bagged trees and noticeably increases the predictive power.

After splitting a given dataset randomly into training and test subsets (Fig. 4), the RF regression modeling could be applied as follows: (1) generate an RF model using the training subset, (2) calculate the variable importance for the established model, (3) apply the constructed RF model to the test subset and evaluate the results, and (4) implement the trained RF model for making predictions in the unknown locations.

The R packages rsample (Frick et al., 2022) and ranger (Wright and Ziegler, 2017) are used for data splitting and modeling, respectively. We considered 70 % of the whole dataset to be the training subset and considered the rest to be the test subset in data splitting (Fig. 4). For training a model, different values are assigned to each hyperparameter, including the number of variables to possibly split at each node (mtry), the minimal node size at which to split (min.node.size), the sample with and without replacement (replace), and the fraction of observations to sample (sample.fraction). A data frame from all possible combinations of mtry, min.node.size, replace, and sample.fraction was then generated; the RF model was trained for each combination; and the best one in terms of root mean square error (RMSE) and mean absolute error (MAE) was selected (see Sect. 3.3.5 for more information). The optimum number of trees (num.trees) was finally investigated by running the RF model for 50 different num.trees values between 0 and 1000. Using the determined hyperparameters, the RF model is trained for making predictions; the performance of the model is evaluated; and the importance of the variables is calculated by (1) the Gini index (Fig. 10b), which indicates the number of times a variable is responsible for a split and the impact of that split divided by the number of trees, and (2) the permutation importance (Fig. 10c), which calculates the prediction accuracy in the out-of-bag observations and recomputes the prediction accuracy after eliminating any association between the variable of interest and the outcome by permuting the values of the variable under evaluation. The difference between the two accuracy values is the permutation importance for the given variable from a single tree. The average of the importance values for all the trees in an RF then gives the RF permutation importance of this variable.

3.3.3 Proposed methods: stepwise regression

Multiple linear regression is used to analyze the relationship between a single response variable (dependent variable) and two or more independent variables (predictor variables). Assuming we store P predictor variables (p=1,,P) for N locations (i=1,,N) in a matrix X{xi,p}, we could simply predict the thickness z{zi} using multiple linear regression:

(1) z = β X + ε ,

with β=[β1,,βP] being the vector of the regression coefficients and ε being a vector of i.i.d. (independent and identically distributed) error terms. However, not all the P variables are necessarily relevant for making predictions, and more accurate predictions may be obtained by a subset X̃x̃i,p,p=1,,P̃;P̃<P of predictor variables. Then, the final model can be written as follows:

(2) z = δ X ̃ + η ,

with δ=[δ1,,δP̃] being the vector of the selected best P̃<P variables and η being the new vector of error terms.

Different methods such as forward selection, backward elimination, and stepwise regression (STPW) are used for this aim. All these methods are based on a series of automated steps (Taylor and Tibshirani, 2015). A forward-selection approach initially assumes no predictor variable and adds the most statistically significant variable, one by one, until no more variables remain. On the contrary, the backward-elimination approach initially includes all predictor variables and then eliminates the least statistically significant variables one by one. However, the STPW method is a combination of forward selection and backward elimination. As with forward selection, the procedure starts with no variables and adds variables using a pre-specified criterion. At every step, the procedure also considers the statistical consequences of dropping the previously included variables. The STPW method is applied in this article with the R package StepReg (Li et al., 2020).

3.3.4 Proposed methods: combination approaches

The RF, STPW, GPT, SAPT, and SEPT models possess their own strengths and weaknesses. Previous studies showed that a combination of the predictions obtained with different methods allows for more accurate estimations (e.g., in the case of time series, see Eliott and Timmermann, 2004; Chan and Pauwels, 2018). Therefore, combination-based predictions are commonly used in many applicative fields (e.g., Cui et al., 2021; Nti et al., 2020; Wong et al., 2007; Yang, 2018).

One of the most common approaches for combining predictions is the stacking ensemble (Ganaie et al., 2022). It trains different models on the same dataset and generates predictions that become the input of a superior model (known as a second-level model; see Ribeiro and dos Santos Coelho, 2020). The fundamental concept behind stacking is that the optimal combination of the predictions of different models achieves better predictive performance compared to those obtained with single models.

Let us define, for each ith location, a vector consisting of K(k=1,,K) alternative predictive models z^i=[z^i,1z^i,2,,z^i,k,,z^i,K] which can be obtained considering some inputs as shown in Fig. 4. Then, a final prediction at the ith location can be defined according to

(3) z ̃ i = f ( z ^ i , ω ) ,

with ω=[ω1,ω2,,ωk,,ωK] being the vector of K weights associated with the K different competing spatial predictive models. In other words, the stacking ensemble z̃i is a function of the K predictions with different base models for the same location z^i. In particular, we assume a linear function:

(4) z ̃ i = ω z ^ i = k = 1 K ω k z ^ i , k .

In this framework, an important issue is the selection of the combination weights ω. For this aim, we can use subjective or objective weighting systems based on either expert evaluations or some statistical criteria. In this paper, we compare the performance of four objective weighting systems and choose the best one.

As the first approach, we consider the simple average (SA) combination, in which the K competing models are weighted equally; i.e., ωSA=[1K,,1K]. Despite its simplicity, this approach empirically provides a better performance compared to more sophisticated alternatives (Hsiao and Wan, 2014). Another commonly adopted approach for the optimal selection of combination weights is based on the variance minimization criterion (MV; see Bates and Granger, 1969; Newbold and Granger, 1974). Given a set of K competing predictive models, the weights are chosen by minimizing the variance of the prediction errors:

(5) min ω ω Σ e ω , with ι ω = 1 ,

with ι=1,1,,1 being a vector of ones and Σe being the K×K covariance matrix associated with the prediction errors of the K competing models. The optimal solution to this minimization problem is given by

(6) ω MV = Σ e - 1 ι ι Σ e - 1 ι ,

where Σe-1 is the inverse of the covariance matrix, also known as the precision matrix.

The third approach implemented for choosing combination weights ω is based on ordinary least squares (OLS) regression (Granger and Ramanathan, 1984), where ω can be chosen by considering the following linear regression:

(7) z i = ω 0 + k = 1 K ω k z ^ i , k + ε i ,

with ω0 being the constant term, ωk being the generic kth weight associated with the kth competing model, and εi being an i.i.d. error term. According to OLS combination, the weight vector ω=[ω1,,ωK] is obtained by solving the following minimization problem:

(8) min ω 1 , , ω K i = 1 N z i - k = 1 K ω k z ^ i , k 2 .

The OLS approach requires computing the weights in a training subset and using the selected ones in a test subset. It has the advantage of generating unbiased combined predictions without the need to investigate the bias for the individual models. This weighting approach is, however, sensitive to outliers. To address this issue, previous studies proposed the least absolute deviation (LAD) combination approach based on the minimization of the absolute loss function (Nowotarski et al., 2014):

(9) min ω 1 , , ω K i = 1 N z i - k = 1 K ω k z ^ i , k .

3.3.5 Accuracy evaluation

To evaluate and compare the performance of the K predictive models and their combinations, the root mean square error (RMSE) and the mean absolute error (MAE) are computed. Let us first define the prediction error ei,k of the kth model (including combinations of the models) for the observed ith location:

(10) e i , k = z i - z ^ i , k .

The RMSEk and MAEk are defined as follows:

(11)RMSEk=1Ni=1Nei,k2=1Ni=1Nzi-z^i,k2,(12)MAEk=1Ni=1Nei,k=1Ni=1Nzi-z^i,k.

Notice that the MAE loss is less affected by outliers than RMSE, and we prefer the models with lower MAE in case of ambiguity. The difference in the accuracy of the two competing models might also be due to randomness, especially for small differences. Therefore, several equal predictive accuracy (EPA) tests (Diebold and Mariano, 2002) were used in this article for comparing the K competing models and their combinations and for highlighting the statistically significant improvements in accuracy. Specifically, given two competing models k and k, we define a generic loss function g(⋅) of the prediction errors g(ei,k) and gei,k. In our case, we consider squared and absolute losses (RMSE and MAE, respectively). Let us define the loss differential vector d=[d1,d2,,dN], where, for each generic ith location,

(13) d i = g e i , k - g e i , k .

Under the null hypothesis, the vector d has zero mean, and the two competing models k and k have the same predictive accuracy. Under the alternative hypothesis, the two models are statistically different, and the best model is the one associated with the lowest statistical loss. In practice, the EPA test can be simply applied by regressing the loss differential vector d with a constant vector ι of ones and by conducting inference with robust standard errors to account for possible heteroskedasticity.

4 Description of the field-based thickness dataset and related predictor variables

We first explain the process of creating a subset from the dataset of Matano et al. (2023) (n=6671) to achieve our research objectives. Briefly, 41 measurements belonging to the Forio and Procida municipalities are excluded because they are not located on the mainland. The subset is populated with 18 predictor variables and is visualized in Figs. 5–8 to give an idea of the available data for detailed elaboration. It is noteworthy that the partial thickness measurements (n=493) are then culled, and the remaining 6137 points (Fig. 2) are considered for thickness modeling in the following section. The field-based thickness measurements range between 0 and 1450 cm. Most of the measurements refer to total thickness, representing the thickness of fallout pyroclastic deposits from the ground surface to the underlying bedrock (Fig. 5). The median of partial thickness values is 3 times greater than that of total thickness (approximately 200 and 60 cm, respectively). This is mainly due to the application of the probing tests and the hand-dug pits in the areas with thick fallout pyroclastic deposits, where the thickness is greater than the maximum survey depth reached by these two measurement methods: 300 and 200 cm respectively. The average and the range of thickness values mainly show the limitations of the measurement methods. The probing test is the leading methodology implemented in field surveys, and the measured values usually range from 10 to 300 cm. The recorded values are often <10 cm in outcrops and 200–800 cm in the measurements conducted with SPT and boreholes. The greatest thickness in the dataset is determined by seismic surveys (Fig. 5d).

https://essd.copernicus.org/articles/16/4161/2024/essd-16-4161-2024-f05

Figure 5An overview of the field-based thickness measurements of the fallout pyroclastic deposit in study area: (a) proportion of total and partial measurements; (b) proportion of the measurements regarding the methodology; (c) proportion of the measurements in each municipality; and (d) variation of thickness considering the whole dataset, total and partial measurements, measurement methods, and municipalities. DS: the dataset, OCP: the outcrop, PRB: the probing test, TRN: the trench, HDP: the hand-dug pit, DPT: the dynamic cone penetration tests, BH: the borehole, SS: the seismic survey, SPT: the standard penetration test, NI: Nocera Inferiore, BI: Bagnoli Irpino, CER: Cervinara, VIT: Vitulano, GS: Gioia Sannitica, POZ: Pozzilli, and VSL: Villa Santa Lucia. It is noteworthy that the partial measurements are excluded before modeling thickness in this study.

Download

https://essd.copernicus.org/articles/16/4161/2024/essd-16-4161-2024-f06

Figure 6Range of the values for predictor variables referred to in the whole dataset (DS) and in the total and partial measurements. DistHydroNet: the distance to hydrographic network, DistSource: the distance to the source, FlowAccum: the flow accumulation, FlowDirect: the flow direction, PlanCur: the plan curvature, and ProfileCur: the profile curvature. It is noteworthy that the partial measurements are excluded before thickness modeling in this study.

Download

https://essd.copernicus.org/articles/16/4161/2024/essd-16-4161-2024-f07

Figure 7Range of values for the predictor variables referred to in the whole dataset (DS) and in the different measurement methods (from PRB to BH). See caption of Fig. 5 for the abbreviations. It is noteworthy that the partial measurements (n=493) are excluded before thickness modeling in this study.

Download

https://essd.copernicus.org/articles/16/4161/2024/essd-16-4161-2024-f08

Figure 8Range of values for the predictor variables referred to in the whole dataset (DS) and in the municipalities (from CER to POZ). See caption of Fig. 5 for the abbreviations. It is noteworthy that the partial measurements (n=493) are excluded before thickness modeling in this study.

Download

The surveys were mainly conducted in the Cervinara, Nocera Inferiore, and Vitulano territories (43 %, 24 %, and 24 %, respectively; Fig. 5c). The median thickness values are above 60 cm in Nocera Inferiore and Cervinara, while they fluctuate around 35 cm in the other municipalities (Fig. 5d). The values of predictor variables assigned to the total and partial measurement points are almost similar, but the minor differences have some interesting interpretations (Fig. 6). Compared to the stations for total measurements, the median values in Fig. 6 show that thickness is partially recorded in the measurement points with lower altitudes, distances to the source, and slope degrees but higher distances to the hydrographic network (Fig. 6a, d, e, and m). In addition, the distance of these stations to the hydrographic network is greater than that of the total thickness measurement points (Fig. 6d). This last aspect confirms that the bedrock is usually not reached by investigation and that the thickness is therefore greater as one moves away from the hydrographic network, with the torrential erosive intensity being lower. The similarity between Fig. 5d (i.e., the three boxplots on the left) and Fig. 6r probably reveals that z0 is a good indicator of z.

The measurement points are also categorized in terms of the measurement methods (Fig. 7) to underline the category-based variation of the predictor variables. The insignificant variation of the variables for the stations investigated using borehole stratigraphic data explains the few field-based observations under this category. Standard penetration tests and seismic surveys are the preferred methods in the low-altitude locations far from the hydrographic network and near the source (Fig. 7a, d, and e). On the contrary, the thickness of the fallout pyroclastic deposits is investigated via trenches and outcrops at the measurement points farthest from the source (Fig. 7e). The lowest computed z0 values are related to these stations as well (Fig. 7r). Curvature, plan curvature, profile curvature, and topographic wetness index (TWI) show very similar distributions regardless of the methodology (Fig. 7c, k, l, and q). The ranges of the predictor values are compared for different municipalities (Fig. 8) to provide some spatial information on the variables. For instance, Bagnoli Irpino and Villa Santa Lucia occur at the highest and lowest altitudes, respectively (Fig. 8a). The latter has the lowest distance to the hydrographic network, along with the minimum calculated z0 and measured z values (Figs. 8d, r, and 5d, respectively). Cervinara and Nocera Inferiore are the closest municipalities to the source (Fig. 8e), where the greatest z0 values are calculated (Fig. 8r), above 65 % of the measurements are performed (Fig. 5c), and the highest z values are recorded (the right panel of Fig. 5d). Taking into account Figs. 1 and 8r, significantly higher fallout pyroclastic deposits were emplaced by volcanic activities in the eastern sector of the study area. The discussion in this section indicates a relatively high level of heterogeneity in the variables. The presence of many data outliers can also be visually verified in Figs. 5–8. These characteristics of the data suggest a complex relationship between the predictor variables and the thickness of fallout pyroclastic deposits, which can be better modeled by means of machine learning techniques rather than by the standard models employed in the literature.

5 The estimated thickness of fallout pyroclastic deposits: results

In this article, we proposed applying RF and STPW approaches that train a model on the training subset and make predictions on the test subset. The dataset of 6137 measurement points was, therefore, randomly divided into training (n=4294) and test (n=1843) subsets (Fig. 4), and the same subsets were used in all predictive models to evaluate the performance of the competing models. For the training data, the thickness is estimated by GPT, SAPT, SEPT, STPW, and RF models. The predictions for different models are then combined and the associated errors are investigated. Finally, the prediction errors are computed for both the training and test subsets, and the predictive accuracy tests are implemented for evaluating differences in estimations. This section explains the results in detail.

5.1 Training stepwise regression and random forest

The STPW model is used for selecting the best subset of variables in terms of explicative power for the pyroclastic thickness deposit. Given an initial set of 18 independent variables, only 8 relevant ones are chosen by the STPW model for making predictions (Table 3): distance to the hydrographic network, distance to the source, altitude, z0, aspect, plan curvature, modified secondary soil-adjusted vegetation index (MSAVI2), and normalized clay index (NCI) (for a detailed description of the variables, see Table 2). All these variables are very relevant in determining the thickness of fallout pyroclastic deposits as they control the erosion–deposition processes. The estimated parameters shown in Table 3 are then used for predicting the thickness values in the test subset.

Table 3List of the variables selected by STPW model for making predictions in the training subset. The estimated parameters, standard errors, and p values are also reported.

Download Print Version | Download XLSX

In order to train random forest model representatively, different values are assigned to mtry, min.node.size, replace, and sample.fraction, and a list of all possible combinations of the hyperparameters (882 in our case) is generated. The random forest is then trained for all combinations, and the optimum value for each hyperparameter (mtry=5, min.node.size = 17, replace = True, and sample.fraction = 0.632) is determined based on the model with the least error. The out-of-bag error is then investigated for different number of trees, and num.trees = 530 is determined for training a model with the least error (Fig. S1a). Figure S1b and c show that altitude, z0, normalized difference vegetation index (NDVI), distance to the hydrographic network, NCI, and topsoil grain size index (TGSI) account for the most important variables in training the model based on both variable importance metrics (i.e., impurity and permutation). Compared to the STPW model, distance to the source is the only variable excluded before RF modeling to avoid unrealistic estimations.

5.2 Predicted thickness values

Table 4 shows the RMSE and MAE for all models applied to both the training and test subsets. The relevant differences in prediction accuracy are evident (e.g., RMSE = 184.52 and MAE = 157.10 for SAPT vs. RMSE = 79.11 and MAE = 46.44 for RF). Considering the single models applied to the training subset, both the STPW and RF models improve the predictive accuracy measures compared with GPT, SAPT, and SEPT. It is worth mentioning that the RF model (RMSE = 79.11 and MAE = 46.44) outperforms other approaches (RMSE >89 and MAE >55) for the training dataset. This suggests that the predictor variables discussed in Sect. 5.1 play a crucial role in estimating the thickness of fallout pyroclastic deposits.

Table 4Prediction accuracy results. Best model in bold. GPT refers to geomorphological pyroclastic thickness, SAPT: slope angle pyroclastic thickness, SEPT: slope exponential pyroclastic thickness, STPW: stepwise regression, RF: random forest, SA: the simple average, MV: the minimum variance, OLS: ordinary least squares, and LAD: least absolute deviation.

Download Print Version | Download XLSX

In the next step, the thickness predictions for the training subset (obtained from the single models) underwent a combination approach using the weighting schemes in Table 5. Except for the SA method that assigns equal weights to the models, all the weighting schemes assign the largest weight to RF, which is the most accurate one among the single models. The accuracy measures for the combination models are shown in the lower part of Table 4. According to the results for the training subset (Table 4), the RMSE shows that the predictions of the SA approach are the worst, while the MV approach provides better accuracy than the RF model. The MAE function indicates that the LAD method improves the accuracy by about 5 % compared to the RF model.

https://essd.copernicus.org/articles/16/4161/2024/essd-16-4161-2024-f09

Figure 9Cumulative probability of the field-based thickness measurements (z) against the estimations by single models and combination approaches (a and b, respectively). In panel (c), only z, the estimations by the best single model, and the estimations by the best combination approach are visualized. However, panel (d) represents the difference between z and the thickness estimated by the best single model and the thickness estimated by the best combination technique. In this figure, z represents thickness, but the subscript refers to the method of estimation. The estimations (n=6137) for both the training and test subsets are visualized. GPT: geomorphological pyroclastic thickness, SAPT: slope angle pyroclastic thickness, SEPT: slope exponential pyroclastic thickness, STPW: stepwise regression, RF: random forest, SA: simple average, MV: the minimum variance, OLS: ordinary least squares, and LAD: least absolute deviation.

Download

https://essd.copernicus.org/articles/16/4161/2024/essd-16-4161-2024-f10

Figure 10Spatial visualization of the predicted thickness of fallout pyroclastic deposits by the random forest as the best single model (a) and by LAD as the best combination approach (b) for the Vitulano municipality. It is worth mentioning that the random forest predictions are classified by the Jenks Natural Breaks algorithm, and the thresholds are implemented for generating the map of LAD predictions. Please see the text for more information about A, B, and C labels, together with the black box. Although symbols of different shapes and colors are used for the classes, the spatially dense measurement points do not allow an application of a larger symbol size. In the electronic version of this article, please zoom in on the figure to distinguish between different symbols based on the shape (if required).

https://essd.copernicus.org/articles/16/4161/2024/essd-16-4161-2024-f11

Figure 11Spatial visualization of the predicted thickness of fallout pyroclastic deposits by the random forest as the best single model (a) and by LAD as the best combination approach (b) for the Cervinara municipality. It is worth mentioning that the random forest predictions are classified by the Jenks Natural Breaks algorithm, and the thresholds are implemented for generating the map of LAD predictions. Please see the text for more information about the black boxes. Although symbols of different shapes and colors are used for the classes, the spatially dense measurement points do not allow an application of a larger symbol size. In the electronic version of this article, please zoom in on the figure to distinguish between different symbols based on the shape (if required).

Table 5The weights for making predictions in the training subset with combination approaches. The abbreviations are as defined in Fig. 3.

Download Print Version | Download XLSX

Then both single models and the alternative combination approaches for the test subset are compared in terms of RMSE and MAE (see the last two columns of Table 4). Among the single models, RF provides the most accurate results (RMSE = 82.46 and MAE = 55.20), but the combination of the RF predictions with those of the other models enhances the accuracy. In particular, the OLS approach reduces the RMSE to 82.42, and the LAD method lowers the MAE to 45.12. Regarding the lower sensitivity of the MAE to data outliers, the LAD combination could be considered to be the most representative model. The LAD combination improved the accuracy by 7.2 % compared to the RF model. The improvement is above 26 % respect to the GPT model, the best single model proposed in the previous studies.

Finally, pairwise EPA tests are applied to account for the uncertainty in the results and to investigate whether the differences in predictive performance are statistically significant in the test subset (Table 6).

The upper part of Table 6 compares RF as the best single model with the other single models and reveals that the differences are statistically significant. In other words, the observed differences in the predictive performance of the models are not explained by the randomness of the data, and the RF model provides the most accurate thickness estimation. The negative RMSE and MAE values indicate that the RF model has a lower average prediction error than the other single models. On the other hand, the lower part of Table 6 shows the pairwise comparison of the RF model with all the combination approaches. A positive RMSE or MAE value suggests that the predictions of the combination approach are more accurate. In terms of squared error loss, the OLS combination approach provides the most accurate predictions compared to the RF model, but it is noteworthy that the RMSE and OLS are both sensitive to data outliers, as explained in Sect. 3.3. Regarding the MAE, statistically significant improvement is observed when combination approaches are applied (except for the SA method; p<0.01). The greatest statistically significant constant value of the LAD method demonstrates that this combination technique is suitable for predicting the thickness of fallout pyroclastic deposits in unmeasured locations. The results are in accordance with those in Table 4.

Table 6Equal predictive accuracy tests, applied to the test subset (n=1843), for comparing the best single model (RF) with the other ones. Under the null hypothesis, the two models provide equal predictive accuracy. SE refers to the heteroskedasticity robust standard error. The abbreviations are as defined in Fig. 3. *** indicates p value <0.01.

Download Print Version | Download XLSX

6 Discussion

In this section, the cumulative probabilities of the estimated thickness (for both the training and test subsets) obtained by various methods are compared in Fig. 9a and b. Regarding the single models (Fig. 9a), significant underestimation of the SEPT model and noticeable overestimation of the SAPT model are evident. These models have the highest RMSE and MAE values among the single models (Table 4). The distributions of the values obtained by the GPT, RF, and STPW models share more similarity with those of field-based measurements. Although the STPW model performs better than RF in predicting the smaller values, the RF model outperforms the single models, which agrees with Table 4. Taking Fig. 9b into account, all combination techniques work more effectively than the LAD approach in the upper 25 % of thickness values. The overall estimations of the LAD approach are, however, more consistent with the field-based measurements, being confirmed by the accuracy measures in Table 4.

The results of the RF model and the LAD approach are also visualized in Fig. 9c for the sake of comparison, representing that the values estimated by the former are greater than those of the latter. The LAD approach outperforms the RF model in a wide range of the field-based thickness values, and it is, therefore, the best model for predicting the thickness of fallout pyroclastic deposits in the study area. Figure 9d demonstrates that the LAD estimations are less biased.

The estimated thickness values of fallout pyroclastic deposits obtained by the RF model and the LAD combination approach in Vitulano, Cervinara, and Nocera Inferiore are visualized in Figs. 10–12 to investigate the differences between the spatial patterns. Although the spatial distribution remained generally unchanged, the legends of the maps reveal that the estimated thickness values obtained by the LAD combination approach decreased, as shown before (Fig. 9c). For instance, in Vitulano (Fig. 10), the estimations range from 10 to 196 cm in the RF model, which is reduced to 1–177 cm in the LAD combination approach. This is also evident in most estimations of the bottom-right box, in which the values are reduced from 66–94 cm to about 32–66 cm, or in the top-left corner of the map: A (47–196 cm) > C (32–66 cm) > B (32–47 cm) in the RF model vs. A (47–94 cm) > C (32–47 cm) > B (1–32 cm) in the LAD combination approach. The same decline could be observed in Cervinara and Nocera Inferiore as well. To facilitate a quick comparison, two boxes are drawn in Figs. 11 and 12 that highlight the sectors with a clear change. A few estimations exceed 113 cm in the lower panel of Fig. 11, which is contrary to the upper panel generated by the RF model in Cervinara.

https://essd.copernicus.org/articles/16/4161/2024/essd-16-4161-2024-f12

Figure 12Spatial visualization of the predicted thickness of fallout pyroclastic deposits by the random forest as the best single model (a) and by the LAD as the best combination approach (b) for the Nocera Inferiore municipality. It is worth mentioning that the RF predictions are classified by the Jenks Natural Breaks algorithm, and the thresholds are implemented for generating the map of LAD predictions. Please see the text for more information about the black boxes. Although symbols of different shapes and colors are used for the classes, the spatially dense measurement points do not allow the application of a larger symbol size. In the electronic version of this article, please zoom in on the figure to distinguish between different symbols based on the shape (if required).

7 Data availability

The field-based thickness measurements of fallout pyroclastic deposits are accessible on Zenodo (Matano et al., 2023; https://doi.org/10.5281/zenodo.8399487).

8 Conclusion and future research direction

A given volcano might have several eruptive events. In an explosive volcanic eruption, the height of the ash plume and the wind characteristics mainly determine the ash dispersal pattern. However, the expected spatial thickness may be continuously altered by the soil-forming and denudation processes. It is, therefore, a daunting task to estimate thickness of fallout pyroclastic deposits that we observe today. The GPT, SAPT, and SEPT models were proposed in previous studies to address this issue, but this article tries to apply other models for the first time to estimate thickness more accurately around the Somma–Vesuvius, Phlegrean Fields, and Roccamonfina volcanoes in Campania, southern Italy.

First, we prepared a database of 6137 field-based thickness measurements with 18 predictor variables. Second, the STPW model and the RF machine learning technique were implemented for thickness modeling, and the results were compared with those of the GPT, SAPT, and SEPT models. The RF estimations (RMSE = 79.11 and MAE = 46.44 for the training subset and RMSE = 82.46 and MAE = 48.36 for the test subset) evidently outperformed the other models (RMSE >89.60 and MAE >55.25 for the training subset and RMSE = 92.35 and MAE >55.20 for the test subset). Third, the SA, LAD, MV, and OLS approaches were considered to combine the predictions of the five above-mentioned single models and to obtain more accurate thickness estimations. It was indicated that the LAD approach returns the best results in terms of MAE. Thus, the estimations with the RF and LAD methods (as a single model and a combination approach, respectively) were less biased in Campania. The thickness values obtained from the RF and LAD in Vitulano, Cervinara, and Nocera Inferiore were applied for spatial analysis, and it was demonstrated that the estimated values of the LAD approach are smaller than those of RF, but the spatial patterns do not change significantly. The results showed that following the methodology in this article and generating a map using the estimations of the LAD combination approach provides the most representative estimations in the study area.

In the future, we will consider a set of more representative predictor variables (if any) and collect a larger field-based thickness measurement dataset for estimating thickness in the unmeasured locations (i.e., out-of-sample predictions from a statistical point of view) more accurately. This would help in generating a regional map of the thickness of fallout pyroclastic deposits in Campania, which plays a key role in hydrogeological and volcanological studies and in managing geohazards in the areas covered with loose pyroclastic materials. Furthermore, we will aim to define the best statistical combinations at local levels by means of cluster-wise techniques.

Supplement

The supplement related to this article is available online at: https://doi.org/10.5194/essd-16-4161-2024-supplement.

Author contributions

All the authors wrote the original draft and revised the paper. FM, PE, and VA: geology, geomorphology, and volcanological history; RM, GS, and PE: statistical analyses; FM: field measurements and supervision of the research activity.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Acknowledgements

The authors acknowledge Paola Petrosino for suggestions about the volcanic history of the study area. Thanks are extended to Massimo Cesarano and Annarita Casaburi for the constructive comments.

Review statement

This paper was edited by Giulio G.R. Iovine and reviewed by Gema Ferndànez-Avilés and one anonymous referee.

References

Albert, P. G., Giaccio, B., Isaia, R., Costa, A., Niespolo, E. M., Nomade, S., Pereira, A., Renne, P. R., Hinchliffe, A., Mark, D. F., Brown, R. J., and Smith, V. C.: Evidence for a large-magnitude eruption from Campi Flegrei caldera (Italy) at 29 ka, Geology, 47, 595–599, https://doi.org/10.1130/G45805.1, 2019. 

Amato, V., Aucelli, P. P. C., Cesarano, M., Filocamo, F., Leone, N., Petrosino, P., Rosskopf, C. M., Valente, E., Casciello, E., Giralt, S., and Jicha, B. R.: Geomorphic response to late Quaternary tectonics in the axial portion of the Southern Apennines (Italy): A case study from the Calore River valley, Earth Surf. Proc. Land., 43, 2463–2480, https://doi.org/10.1002/esp.4390, 2018. 

Andronico, D., Calderoni, G., Cioni, R., Sbrana, A., Sulpizio, R., and Santacroce, R.: Geological map of Somma-Vesuvius volcano, Period. Mineral., 64, 77–78, 1995. 

Andronico, D. and Cioni, R.: Contrasting styles of Mount Vesuvius activity in the period between the Avellino and Pompeii Plinian eruptions, and some implications for assessment of future hazards, B. Volcanol., 64, 372–391, https://doi.org/10.1007/s00445-002-0215-4, 2002. 

Arrighi, S., Principe, C., and Rosi, M.: Violent strombolian and subplinian eruptions at Vesuvius during post-1631 activity, B. Volcanol., 63, 126–150, https://doi.org/10.1007/s004450100130, 2001. 

Barberi, F., Innocenti, F., Lirer, L., Munno, R., Pescatore, T., and Santacroce, R.: The Campanian Ignimbrite: a major prehistoric eruption in the Neapolitan area (Italy), Bulletin Volcanologique, 41, 10–31, https://doi.org/10.1007/BF02597680, 1978. 

Bates, J. M. and Granger, C. W.: The combination of forecasts, J. Oper. Res. Soc., 20, 451–468, 1969. 

Bertagnini, A., Cioni, R., Guidoboni, E., Rosi, M., Neri, A., and Boschi, E.: Eruption early warning at Vesuvius: The A.D. 1631 lesson, Geophys. Res. Lett., 33, L18317, https://doi.org/10.1029/2006GL027297, 2006. 

Beven, K. J. and Kirkby, M. J.: A physically based, variable contributing area model of basin hydrology/Un modèle à base physique de zone d'appel variable de l'hydrologie du bassin versant, Hydrolog. Sci. J., 24, 43–69, https://doi.org/10.1080/02626667909491834, 1979. 

Bonardi, G., Ciarcia, S., Di Nocera, S., Matano, F., Sgrosso, I., and Torre, M.: Carta delle principali unità cinematiche dell'Appennino meridionale, Nota illustrativa. Boll. Soc. Geol. It., 128, 47–60, 2009. 

Boncio, P., Auciello, E., Amato, V., Aucelli, P., Petrosino, P., Tangari, A. C., and Jicha, B. R.: Late Quaternary faulting in the southern Matese (Italy): implications for earthquake potential and slip rate variability in the southern Apennines, Solid Earth, 13, 553–582, https://doi.org/10.5194/se-13-553-2022, 2022. 

Bourne, A. J., Lowe, J. J., Trincardi, F., Asioli, A., Blockley, S. P. E., Wulf, S., Matthews, I. P., Piva, A., and Vigliotti, L.: Distal tephra record for the last ca 105,000 years from core PRAD 1-2 in the central Adriatic Sea: implications for marine tephrostratigraphy, Quaternary Sci. Rev., 29, 3079–3094, https://doi.org/10.1016/j.quascirev.2010.07.021, 2010. 

Brown, R. J., Bonadonna, C., and Durant, A. J.: A review of volcanic ash aggregation, Phys. Chem. Earth Pt. A/B/C, 45, 65–78, https://doi.org/10.1016/j.pce.2011.11.001, 2012. 

Burrough, P. A. and McDonell, R. A.: Principles of Geographical Information Systems, Oxford University Press, Oxford, UK, 350 pp., ISBN 0198233663, 1998. 

Cannatelli, C., Spera, F. J., Bodnar, R. J., Lima, A., and De Vivo, B.: Ground movement (bradyseism) in the Campi Flegrei volcanic area: a review, in: Vesuvius, Campi Flegrei, and Campanian Volcanism, edited by: De Vivo, B., Belkin, H. E., and Rolandi, G., Elsevier, Amsterdam, the Netherlands, 407–433, https://doi.org/10.1016/B978-0-12-816454-9.00015-8, 2020. 

Cappelletti, P., Cerri, G., Colella, A., de'Gennaro, M., Langella, A., Perrotta, A., and Scarpati, C.: Post-eruptive processes in the Campanian Ignimbrite, Miner. Petrol., 79, 79–97, https://doi.org/10.1007/s00710-003-0003-7, 2003. 

Caron, B., Siani, G., Sulpizio, R., Zanchetta, G., Paterne, M., Santacroce, R., Tema, E., and Zanella, E.: Late Pleistocene to Holocene tephrostratigraphic record from the northern Ionian Sea, Mar. Geol., 311, 41–51, https://doi.org/10.1016/j.margeo.2012.04.001, 2012. 

Catani, F., Segoni, S., and Falorni, G.: An empirical geomorphology‐based approach to the spatial prediction of soil thickness at catchment scale, Water Resour. Res., 46, W05508, https://doi.org/10.1029/2008WR007450, 2010. 

Chan, F. and Pauwels, L. L.: Some theoretical results on prediction combinations, International Journal of Predictioning, 34, 64–74, https://doi.org/10.1016/j.ijforecast.2017.08.005, 2018. 

Chiodini, G., Frondini, F., Cardellini, C., Granieri, D., Marini, L., and Ventura, G.: CO2 degassing and energy release at Solfatara volcano, Campi Flegrei, Italy, J. Geophys. Res.-Sol. Ea., 106, 16213–16221, https://doi.org/10.1029/2001JB000246, 2001. 

Cioni, R., Sulpizio, R., and Garruccio, N.: Variability of the eruption dynamics during a subplinian event: The Greenish Pumice eruption of Somma-Vesuvius (Italy), J. Volcanol. Geoth. Res., 124, 89–114, https://doi.org/10.1016/S0377-0273(03)00070-2, 2003. 

Cioni, R., Pistolesi, M., and Rosi, M.: Plinian and Subplinian Eruptions. In: Encyclopedia of Volcanoes, 2nd edn., edited by: Sigurdsson, H., Elsevier Science Publishing, London, UK, https://doi.org/10.1016/B978-0-12-385938-9.00029-8, 2015. 

Civetta, L., Orsi, G., Pappalardo, L., Fisher, R. V., Heiken, G., and Ort, M.: Geochemical zoning, mingling, eruptive dynamics and depositional processes – the Campanian Ignimbrite, Campi Flegrei caldera, Italy. J. Volcanol. Geoth. Res., 75, 183–219, https://doi.org/10.1016/S0377-0273(96)00027-3, 1997. 

Costa, A., Dell'Erba, F., Di Vito, M. A., Isaia, R., Macedonio, G., Orsi, G., and Pfeiffer, T.: Tephra fallout hazard assessment at the Campi Flegrei caldera (Italy), B. Volcanol., 71, 259–273, https://doi.org/10.1007/s00445-008-0220-3, 2009. 

Costa, A., Folch, A., Macedonio, G., Giaccio, B., Isaia, R., and Smith, V. C.: Quantifying volcanic ash dispersal and impact of the Campanian Ignimbrite super-eruption, Geophys. Res. Lett., 39, L10310, https://doi.org/10.1029/2012GL051605, 2012. 

Costa, A., Di Vito, M. A., Ricciardi, G. P., Smith, V. C., and Talamo, P.: The long and intertwined record of humans and the Campi Flegrei volcano (Italy), B. Volcanol., 84, 5, https://doi.org/10.1007/s00445-021-01503-x, 2022. 

Cui, S., Yin, Y., Wang, D., Li, Z., and Wang, Y.: A stacking-based ensemble learning method for earthquake casualty prediction, Appl. Soft Comput., 101, 107038, https://doi.org/10.1016/j.asoc.2020.107038, 2021. 

Cuomo, S., Masi, E. B., Tofani, V., Moscariello, M., Rossi, G., and Matano, F.: Multiseasonal probabilistic slope stability analysis of a large area of unsaturated pyroclastic soils, Landslides, 18, 1259–1274, https://doi.org/10.1007/s10346-020-01561-w, 2021. 

de Lorenzo, S., Gasparini, P., Mongelli, F., and Zollo, A.: Thermal state of the Campi Flegrei caldera inferred from seismic attenuation tomography, J. Geodyn., 32, 467–486, https://doi.org/10.1016/S0264-3707(01)00044-8, 2001. 

De Vita, P. and Nappi, M.: Regional distribution of ash-fall pyroclastic soils for landslide susceptibility assessment in: Landslide Science and Practice, edited by: Margottini, C., Canuti, P., and Sassa, K., Springer, Berlin, Germany, 103–109, https://doi.org/10.1007/978-3-642-31310-3_15, 2013. 

De Vita, P., Agrello, D., and Ambrosino, F.: Landslide susceptibility assessment in ash-fall pyroclastic deposits surrounding Mount Somma-Vesuvius: Application of geophysical surveys for soil thickness mapping, J. Appl. Geophys., 59, 126–139, https://doi.org/10.1016/j.jappgeo.2005.09.001, 2006. 

De Vita, S., Orsi, G., Civetta, L., Carandente, A., D'Antonio, M., Deino, A., Di Cesare, T., Di Vito, M. A., Fisher, R. V., Isaia, R. Marotta, E., Necco, A., Ort, M., Pappalardo, L., Piochi, M., and Southon, J.: The Agnano–Monte Spina eruption (4100 years BP) in the restless Campi Flegrei caldera (Italy), J. Volcanol. Geoth. Res., 91, 269–301, https://doi.org/10.1016/S0377-0273(99)00039-6, 1999. 

De Vita, S., Sansivero, F., Orsi, G., and Marotta, E.: Cyclical slope instability and volcanism related to volcano-tectonism in resurgent calderas: the Ischia island (Italy) case study, Eng. Geol., 86, 148–165, https://doi.org/10.1016/j.enggeo.2006.02.013, 2006. 

De Vivo, B., Rolandi, G., Gans, P. B., Calvert, A., Bohrson, W. A., Spera, F. J., and Belkin, H. E.: New constraints on the pyroclastic eruptive history of the Campanian volcanic Plain (Italy), Miner. Petrol., 73, 47–65, https://doi.org/10.1007/s007100170010, 2001. 

Deino, A. L., Orsi, G., De Vita, S., and Piochi, M.: The age of the Neapolitan Yellow Tuff caldera-forming eruption (Campi Flegrei caldera–Italy) assessed by 40Ar/39Ar dating method. J. Volcanol. Geoth. Res., 133, 157–170, https://doi.org/10.1016/S0377-0273(03)00396-2, 2004. 

Del Soldato, M., Segoni, S., De Vita, P., Pazzi, V., Tofani, V., and Moretti, S.: Thickness model of pyroclastic soils along mountain slopes of Campania (southern Italy), in: Landslides and engineered slopes. Experience, theory and practice. Proceedings of the 12th International Symposium on Landslides, Napoli, Italy, 12–19 June, edited by: Aversa, S., Cascini, L., Picarelli, L., and Scavia C., CRC Press, Boca Raton, FL, 797–804, 2016. 

Del Soldato, M., Pazzi, V., Segoni, S., De Vita, P., Tofani, V., and Moretti, S.: Spatial modeling of pyroclastic cover deposit thickness (depth to bedrock) in peri-volcanic areas of Campania (southern Italy), Earth Surf. Proc. Land., 43, 1757–1767, https://doi.org/10.1002/esp.4350, 2018. 

Dellino, P., Dioguardi, F., Isaia, R., Sulpizio, R., and Mele, D.: The impact of pyroclastic density currents duration on humans: the case of the AD 79 eruption of Vesuvius, Sci. Rep.-UK, 11, 4959, https://doi.org/10.1038/s41598-021-84456-7, 2021. 

Diebold, F. X. and Mariano, R. S.: Comparing predictive accuracy, J. Bus. Econ. Stat., 20, 134–144, 2002. 

Di Nocera, S., Matano, F., Pescatore, T., Pinto, F., Quarantiello, R., Senatore, M., and Torre, M.: Schema geologico del transetto Monti Picentini orientali–Monti della Daunia meridionali: unità stratigrafiche ed evoluzione tettonica del settore esterno dell'Appennino meridionale. Boll. Soc. Geol. Ital., 125, 39–58, 2006. 

Di Renzo, V., Arienzo, I., Civetta, L., D'Antonio, M., Tonarini, S., Di Vito, M. A., and Orsi, G.: The magmatic feeding system of the Campi Flegrei caldera: architecture and temporal evolution, Chem. Geol., 281, 227–241, https://doi.org/10.1016/j.chemgeo.2010.12.010, 2011. 

Di Vito, M., Lirer, L., Mastrolorenzo, G., and Rolandi, G.: The 1538 Monte Nuovo eruption (Campi Flegrei, Italy), B. Volcanol., 49, 608–615, https://doi.org/10.1007/BF01079966, 1987. 

Di Vito, M. A., Isaia, R., Orsi, G., Southon, J. D., De Vita, S., D'Antonio, M., Pappalardo, L., and Piochi, M.: Volcanism and deformation since 12,000 years at the Campi Flegrei caldera (Italy), J. Volcanol. Geoth. Res., 91, 221–246, https://doi.org/10.1016/S0377-0273(99)00037-2, 1999. 

Di Vito, M. A., Sulpizio, R., Zanchetta, G., and D'Orazio, M.: The late Pleistocene pyroclastic deposits of the Campanian Plain: new insights into the explosive activity of Neapolitan volcanoes, J. Volcanol. Geoth. Res., 177, 19–48, https://doi.org/10.1016/j.jvolgeores.2007.11.019, 2008. 

Di Vito, M. A., Acocella, V., Aiello, G., Barra, D., Battaglia, M., Carandente, A., Del Gaudio, C., De Vita, S., Ricciardi, G. P., Ricco, C. Scandone, R., and Terrasi, F.: Magma transfer at Campi Flegrei caldera (Italy) before the 1538 AD eruption, Sci. Rep.-UK, 6, 32245, https://doi.org/10.1038/srep32245, 2016. 

Di Vito, M. A., Talamo, P., de Vita, S., Rucco, I., Zanchetta, G., and Cesarano, M.: Dynamics and effects of the Vesuvius Pomici di Avellino Plinian eruption and related phenomena on the Bronze Age landscape of Campania region (Southern Italy), Quatern. Int., 499, 231–244, https://doi.org/10.1016/j.quaint.2018.03.021, 2019. 

Doglioni, C.: A proposal for the kinematic modelling of W-dipping subductions-possible applications to the Tyrrhenian-Apennines system, Terra Nova, 3, 423–434, https://doi.org/10.1144/SP288.3, 1991. 

Doronzo, D. M., Di Vito, M. A., Arienzo, I., Bini, M., Calusi, B., Cerminara, M., Corradini, S., De Vita, S., Giaccio, B., Gurioli, L., Mannella, G., Ricciardi, G. P., Rucco, I., Sparice, D., Todesco, M., Trasatti, E., and Zanchetta, G.: The 79 CE eruption of Vesuvius: A lesson from the past and the need of a multidisciplinary approach for developments in volcanology, Earth-Sci. Rev., 231, 104072, https://doi.org/10.1016/j.earscirev.2022.104072, 2022. 

Ducci, D. and Tranfaglia, G.: Effects of climate change on groundwater resources in Campania (southern Italy), Geol. Soc. Spec. Publ., 288, 25–38, https://doi.org/10.1144/SP288.3, 2008. 

Elliott, G. and Timmermann, A.: Optimal forecast combinations under general loss functions and forecast error distributions, J. Econometrics, 122, 47–79, https://doi.org/10.1016/j.jeconom.2003.10.019, 2004. 

Engwell, S. L., Sparks, R. S. J., and Carey, S.: Physical characteristics of tephra layers in the deep sea realm: the Campanian Eruption, Geol. Soc. Spec. Publ., 398, 47–64, https://doi.org/10.1144/SP398.7, 2014. 

Eychenne, J. and Engwell, S. L.: The grainsize of volcanic fall deposits: Spatial trends and physical controls, Geol. Soc. Am. Bull., 135, 1844–1858, https://doi.org/10.1130/B36275.1, 2023. 

Ferranti, L. and Oldow, J. S.: Latest Miocene to Quaternary horizontal and vertical displacement rates during simultaneous contraction and extension in the Southern Apennines orogen, Italy, Terra Nova, 17, 209–214, https://doi.org/10.1111/j.1365-3121.2005.00593.x, 2005. 

Fisher, R. V., Orsi, G., Ort, M., and Heiken, G.: Mobility of a large-volume pyroclastic flow – emplacement of the Campanian ignimbrite, Italy, J. Volcanol. Geoth. Res., 56, 205–220, https://doi.org/10.1016/0377-0273(93)90017-L, 1993. 

Frick, H., Chow, F., Kuhn, M., Mahoney, M., Silge, J., and Wickham, H.: rsample: General Resampling Infrastructure, R package version 1.1.1, CRAN [code], https://CRAN.R-project.org/package=rsample (last access: 12 September 2023), 2022. 

Ganaie, M. A., Hu, M., Malik, A. K., Tanveer, M., and Suganthan, P. N.: Ensemble deep learning: A review, Eng. Appl. Artif. Intel., 115, 105151, https://doi.org/10.1016/j.engappai.2022.105151, 2022. 

Giaccio, B., Isaia, R., Fedele, F. G., Di Canzio, E., Hoffecker, J., Ronchitelli, A., Sinitsyn, A. A., Anikovich, M., Lisitsyn, S. N., and Popov, V. V.: The Campanian Ignimbrite and Codola tephra layers: two temporal/stratigraphic markers for the Early Upper Palaeolithic in southern Italy and eastern Europe, J. Volcanol. Geoth. Res., 177, 208–226, https://doi.org/10.1016/j.jvolgeores.2007.10.007, 2008. 

Giannetti, B.: Origin of the calderas and evolution of Roccamonfina volcano (Roman Region, Italy), J. Volcanol. Geoth. Res., 106, 301–319, https://doi.org/10.1016/S0377-0273(00)00259-6, 2001. 

Granger, C. W. and Ramanathan, R.: Improved methods of combining forecasts, J. Forecasting, 3, 197–204, https://doi.org/10.1002/for.3980030207, 1984. 

Greenlee, D. D.: Raster and vector processing for scanned linework, Photogramm. Eng. Rem. S., 53, 1383–1387, 1987. 

Guidoboni, E. and Ciuccarelli, C.: The Campi Flegrei caldera: historical revision and new data on seismic crises, bradyseisms, the Monte Nuovo eruption and ensuing earthquakes (twelfth century 1582 AD), B. Volcanol., 73, 655–677, https://doi.org/10.1007/s00445-010-0430-3, 2011. 

Hofmann-Wellenhof, B., Lichtenegger, H., and Collins, J.: Global positioning system: theory and practice, Springer-Verlag Wien GmbH, Vienna, Austria, https://doi.org/10.1007/978-3-7091-6199-9, 2001. 

Hsiao, C. and Wan, S. K.: Is there an optimal forecasting combination? J. Econometrics, 178, 294–309, https://doi.org/10.1016/j.jeconom.2013.11.003, 2014. 

Ihlen, V.: Landsat 8 Data Users Handbook, US Geological Survey, Sioux Falls, USA, LSDS-1574, Version 5.0, 2019. 

Isaia, R., D'Antonio, M., Dell'Erba, F., Di Vito, M., and Orsi, G.: The Astroni volcano: the only example of closely spaced eruptions in the same vent area during the recent history of the Campi Flegrei caldera (Italy), J. Volcanol. Geoth. Res., 133, 171–192, https://doi.org/10.1016/S0377-0273(03)00397-4, 2004. 

Isaia, R., Marianelli, P., and Sbrana, A.: Caldera unrest prior to intense volcanism in Campi Flegrei (Italy) at 4.0 ka BP: Implications for caldera dynamics and future eruptive scenarios, Geophys. Res. Lett., 36, L21303, https://doi.org/10.1029/2009GL040513, 2009. 

Jensen, J. R.: Introductory Digital Image Processing: A Remote Sensing Perspective, Pearson Education, London, UK, 656 pp., ISBN 9780134058160, 2015. 

Jenson, S. K. and Domingue, J. O.: Extracting topographic structure from digital elevation data for geographic information system analysis, Photogramm. Eng. Rem. S., 54, 1593–1600, 1988. 

Kienast-Brown, S., Libohova, Z., USDA-NRCS, and Boettinger, J.: Digital soil mapping, in: Soil science division staff. Soil survey manual, edited by: Ditzler, C., Scheffe, K., and Monger, H. C., USDA Handbook No. 18, 295–354, 2017. 

Krakiwsky, E. J. and Wells, D. E.: Coordinate Systems In Geodesy, Department of Geodesy and Geomatics Engineering, University of New Brunswick, Lecture Notes No. 16, 115 pp., 1971. 

Lancaster, P. and Salkauskas, K.: Curve and surface fitting: An introduction, Academic Press, London, ISBN 0124360602, 280 pp., 1986. 

Li, J., Lu, X., Cheng, K., Liu, W., and Li, M. J.: StepReg: Stepwise Regression Analysis, R package version 1.4.4, CRAN [code], https://CRAN.R-project.org/package=StepReg (last access: 12 September 2023), 2020. 

Ligas, M. and Banasik, P.: Conversion between Cartesian and geodetic coordinates on a rotational ellipsoid by solving a system of nonlinear equations, Geodesy and Cartography, 60, 145–159, https://doi.org/10.2478/v10277-012-0013-x, 2011. 

Lirer, L., Luongo, G., and Scandone, R.: On the volcanological evolution of Campi Flegrei, Eos T. Am. Geophys. Un., 68, 226–234, https://doi.org/10.1029/EO068i016p00226, 1987. 

Liu, H., Harris, J., Sherlock, R., Behnia, P., Grunsky, E., Naghizadeh, M., Rubingh, K., Tuba, G., Roots, E., and Hill, G.: Mineral prospectivity mapping using machine learning techniques for gold exploration in the Larder Lake area, Ontario, Canada, J. Geochem. Explor., 253, 107279, https://doi.org/10.1016/j.gexplo.2023.107279, 2023. 

Lowe, D. J.: Tephrochronology and its application: a review, Quat. Geochronol., 6, 107–153, https://doi.org/10.1016/j.quageo.2010.08.003, 2011. 

Matano, F., De Chiara, G., Ferlisi, S., and Cascini, L.: Thickness of pyroclastic cover beds: the case study of Mount Albino (Campania region, southern Italy), J. Maps, 12, 79–87, https://doi.org/10.1080/17445647.2016.1158668, 2016. 

Matano, F., Ebrahimi, P., Angelino, A., and Grimaldi, G. M.: Database of pyroclastic cover deposit thickness measurements (PT-Cam) in peri-volcanic areas of Campania (Italy) (1.0), Zenodo [data set], https://doi.org/10.5281/zenodo.8399487, 2023. 

Mele, D., Dellino, P., Sulpizio, R., and Braia, G.: A systematic investigation on the aerodynamics of ash particles, J. Volcanol. Geoth. Res., 203, 1–11, https://doi.org/10.1016/j.jvolgeores.2011.04.004, 2011. 

Mercogliano, P., Segoni, S., Rossi, G., Sikorsky, B., Tofani, V., Schiano, P., Catani, F., and Casagli, N.: Brief communication “A prototype forecasting chain for rainfall induced shallow landslides”, Nat. Hazards Earth Syst. Sci., 13, 771–777, https://doi.org/10.5194/nhess-13-771-2013, 2013. 

Moore, I. D., and Burch, G. J.: Sediment transport capacity of sheet and rill flow: application of unit stream power theory, Water Resour. Res., 22, 1350–1360, https://doi.org/10.1029/WR022i008p01350, 1986. 

Moore, I. D., Grayson, R. B., and Ladson, A. R.: Digital terrain modelling: a review of hydrological, geomorphological, and biological applications, Hydrol. process., 5, 3–30, https://doi.org/10.1002/hyp.3360050103, 1991. 

Newbold, P. and Granger, C. W.: Experience with forecastings univariate time series and the combination of forecasts, J. R. Stat. Soc. Ser. A, 137, 131–146, 1974. 

Newhall, C. G. and Self, S.: The volcanic explosivity index (VEI) an estimate of explosive magnitude for historical volcanism, J. Geophys. Res.-Oceans, 87, 1231–1238, https://doi.org/10.1029/JC087iC02p01231, 1982. 

Nowotarski, J., Raviv, E., Trück, S., and Weron, R.: An empirical comparison of alternative schemes for combining electricity spot price forecasts, Energ. Econ., 46, 395–412, https://doi.org/10.1016/j.eneco.2014.07.014, 2014. 

Nti, I. K., Adekoya, A. F., and Weyori, B. A.: A comprehensive evaluation of ensemble learning for stock-market prediction, Journal of Big Data, 7, 20, https://doi.org/10.1186/s40537-020-00299-5, 2020. 

Orsi, G., D'Antonio, M., de Vita, S., and Gallo, G.: The Neapolitan Yellow Tuff, a large-magnitude trachytic phreatoplinian eruption: eruptive dynamics, magma withdrawal and caldera collapse, J. Volcanol. Geoth. Res., 53, 275–287, https://doi.org/10.1016/0377-0273(92)90086-S, 1992. 

Orsi, G., De Vita, S., and Di Vito, M.: The restless, resurgent Campi Flegrei nested caldera (Italy): constraints on its evolution and configuration, J. Volcanol. Geoth. Res., 74, 179–214, https://doi.org/10.1016/S0377-0273(96)00063-7, 1996. 

Orsi, G., Di Vito, M. A., and Isaia, R.: Volcanic hazard assessment at the restless Campi Flegrei caldera, B. Volcanol., 66, 514–530, https://doi.org/10.1007/s00445-003-0336-4, 2004. 

Pappalardo, L., Civetta, L., D'Antonio, M., Deino, A., Di Vito, M., Orsi, G., Carandente, A., de Vita, S., Isaia, R., and Piochi, M.: Chemical and Sr-isotopical evolution of the Phlegraean magmatic system before the Campanian Ignimbrite and the Neapolitan Yellow Tuff eruptions, J. Volcanol. Geoth. Res., 91, 141–166, https://doi.org/10.1016/S0377-0273(99)00033-5, 1999. 

Passariello, I., Lubritto, C., D'Onofrio, A., Guan, Y., and Terrasi, F.: The Somma–Vesuvius complex and the Phlaegrean Fields caldera: new chronological data of several eruptions of the Copper–Middle Bronze Age period, Nucl. Instrum. Meth. B, 268, 1008–1012, https://doi.org/10.1016/j.nimb.2009.10.085, 1999. 

Passariello, I., Talamo, P., D'Onofrio, A., Barta, P., Lubritto, C., and Terrasi, F.: Contribution of Radiocarbon Dating to the Chronology of Eneolithic in Campania (Italy), Geochronometria, 35, 25–33, https://doi.org/10.2478/v10003-010-0008-2, 2010. 

Patacca, E., Sartori, R., and Scandone, P.: Tyrrhenian basin and Apenninic arcs: kinematic relations since late Tortonian times, Memorie della Società Geologica Italiana, 45, 425–451, 1990. 

Perrotta, A. and Scarpati, C.: Volume partition between the plinian and co-ignimbrite air fall deposits of the Campanian Ignimbrite eruption, Miner. Petrol., 79, 67–78, https://doi.org/10.1007/s00710-003-0002-8, 2003. 

Perrotta, A., Scarpati, C., Luongo, G., and Morra, V.: The Campi Flegrei caldera boundary in the city of Naples, in: Developments in Volcanology, edited by: De Vivo, B., Elsevier, Volume 9, 85–96, https://doi.org/10.1016/S1871-644X(06)80019-7, 2006. 

Piochi, M., Mastrolorenzo, G., and Pappalardo, L.: Magma ascent and eruptive processes from textural and compositional features of Monte Nuovo pyroclastic products, Campi Flegrei, Italy, B. Volcanol., 67, 663–678, https://doi.org/10.1007/s00445-005-0410-1, 2005. 

Qi, J., Chehbouni, A., Huete, A. R., Kerr, Y. H., and Sorooshian, S.: A modified soil adjusted vegetation index, Remote Sens. Environ., 48, 119–126, https://doi.org/10.1016/0034-4257(94)90134-1, 1994. 

Qin, C., Zhu, A. X., Pei, T., Li, B., Zhou, C., and Yang, L.: An adaptive approach to selecting a flow-partition exponent for a multiple-flow-direction algorithm, Int. J. Geogr. Inf. Sci., 21, 443–458, https://doi.org/10.1080/13658810601073240, 2007. 

Ribeiro, M. H. D. M. and dos Santos Coelho, L.: Ensemble approach based on bagging, boosting and stacking for short-term prediction in agribusiness time series, Appl. Soft Comput., 86, 105837, https://doi.org/10.1016/j.asoc.2019.105837, 2020. 

Rolandi, G., Maraffi, S., Petrosino, P., and Lirer, L.: The Ottaviano eruption of Somma-Vesuvio (8000 y B.P.): a magmatic alternating fall and flow-forming eruption, J. Volcanol. Geoth. Res., 58, 43–65, https://doi.org/10.1016/0377-0273(93)90101-V, 1993a. 

Rolandi, G., Mastrolorenzo, G., Barrella, A. M., and Borrelli, A.: The Avellino plinian eruption of Somma-Vesuvius (3760 y.B.P.): the progressive evolution from magmatic to hydromagmatic style, J. Volcanol. Geoth. Res., 58, 67–88, https://doi.org/10.1016/0377-0273(93)90102-W, 1993b. 

Rolandi, G., Petrosino, P., and Mc Geehin, J.: The interplinian activity at Somma–Vesuvius in the last 3500 years, J. Volcanol. Geoth. Res., 82, 19–52, https://doi.org/10.1016/S0377-0273(97)00056-5, 1998. 

Rolandi, G., Munno, R., and Postiglione, I.: The AD 472 eruption of the Somma volcano, J. Volcanol. Geoth. Res., 129, 291–319, https://doi.org/10.1016/S0377-0273(03)00279-8, 2004. 

Rolandi, G., Paone, A., Di Lascio, M., and Stefani, G.: The 79 AD eruption of Somma: The relationship between the date of the eruption and the southeast tephra dispersion, J. Volcanol. Geoth. Res., 169, 87–98, https://doi.org/10.1016/j.jvolgeores.2007.08.020, 2008. 

Rosi, M. and Sbrana, A.: Phlegrean fields, Quaderni de La Ricerca Scientifica, 114, Consiglio Nazionale delle Ricerche, Roma, Italia, 1987. 

Rosi, M., Sbrana, A., and Vezzoli, L.: Correlazioni tefrostratigraphiche di alcuni livelli di Ischia, Procida e Campi Flegrei, Memorie della Societa Geologica Italiana, 41, 1015–1027, 1988. 

Rosi, M., Vezzoli, L., Aleotti, P., and De Censi, M.: Interaction between caldera collapse and eruptive dynamics during the Campanian Ignimbrite eruption, Phlegraean Fields, Italy, B. Volcanol., 57, 541–554, https://doi.org/10.1007/BF00304438, 1996. 

Rosi, M., Vezzoli, L., Castelmenzano, A., and Grieco, G.: Plinian pumice fall deposit of the Campanian Ignimbrite eruption (Phlegraean Fields, Italy), J. Volcanol. Geoth. Res., 91, 179–198, https://doi.org/10.1016/S0377-0273(99)00035-9, 1999. 

Rossi, G., Catani, F., Leoni, L., Segoni, S., and Tofani, V.: HIRESSS: a physically based slope stability simulator for HPC applications, Nat. Hazards Earth Syst. Sci., 13, 151–166, https://doi.org/10.5194/nhess-13-151-2013, 2013. 

Rouchon, V., Gillot, P. Y., Quidelleur, X., Chiesa, S., and Floris, B.: Temporal evolution of the Roccamonfina volcanic complex (Pleistocene), Central Italy, J. Volcanol. Geoth. Res., 177, 500–514, https://doi.org/10.1016/j.jvolgeores.2008.07.016, 2008. 

Saco, P. M., Willgoose, G. R., and Hancock, G. R.: Spatial organization of soil depths using a landform evolution model, J. Geophys. Res.-Earth, 111, F02016, https://doi.org/10.1029/2005JF000351, 2006. 

Santacroce, R.: Somma-Vesuvius, Quaderni de La Ricerca Scientifica, 114, Consiglio Nazionale delle Ricerche, Roma, Italia, 251 pp., 1987. 

Santacroce, R. and Sbrana, A.: Carta geologica del Vesuvio. Scala 1:15,000, Progetto CARG. Sevizio Geologico Nazionale – Consiglio Nazionale delle Ricerche, S.E.L.C.A, Firenze, 2003. 

Santacroce, R., Cioni, R., Marianelli, P., Sbrana, A., Sulpizio, R., Zanchetta, G., Donahue, D. J., and Joron, J. L.: Age and whole rock–glass compositions of proximal pyroclastics from the major explosive eruptions of Somma-Vesuvius: A review as a tool for distal tephrostratigraphy, J. Volcanol. Geoth. Res., 177, 1–18, https://doi.org/10.1016/j.jvolgeores.2008.06.009, 2008. 

Saulnier, G. M., Beven, K., and Obled, C.: Including spatially variable effective soil depths in TOPMODEL, J. Hydrol., 202, 158–172, https://doi.org/10.1016/S0022-1694(97)00059-0, 1997. 

Sbrana, A., Cioni, R., Marianelli, P., Sulpizio, R., Andronico, D., and Pasquini, G.: Volcanic evolution of the Somma-Vesuvius complex (Italy), J. Maps, 16, 137–147, https://doi.org/10.1080/17445647.2019.1706653, 2020. 

Scarpati, C. and Perrotta, A.: Stratigraphy and physical parameters of the Plinian phase of the Campanian Ignimbrite eruption, Geol. Soc. Am. Bull., 128, 1147–1159, https://doi.org/10.1130/B31331.1, 2016. 

Scarpati, C., Sparice, D., and Perrotta, A.: A crystal concentration method for calculating ignimbrite volume from distal ash-fall deposits and a reappraisal of the magnitude of the Campanian Ignimbrite, J. Volcanol. Geoth. Res., 280, 67–75, https://doi.org/10.1016/j.jvolgeores.2014.05.009, 2014. 

Segoni, S., Martelloni, G., and Catani, F.: Different methods to produce distributed soil thickness maps and their impact on the reliability of shallow landslide modeling at catchment scale, in: Landslide Science and Practice, edited by: Margottini, C., Canuti, P., and Sassa, K., Springer, Berlin, 127–133, https://doi.org/10.1007/978-3-642-31310-3_18, 2013. 

Sevink, J., van Bergen, M. J., van der Plicht, J., Feiken, H., Anastasia, C., and Huizinga, A.: Robust date for the Bronze age Avellino eruption (Somma-Vesuvius): 3945 ± 10 calBP (1995 ± 10 calBC). Quaternary Sci. Rev., 30, 1035–1046, https://doi.org/10.1016/j.quascirev.2011.02.001, 2011. 

Sigurdsson, H., Carey, S., Cornell, W., and Pescatore, T.: The eruption of Vesuvius in A.D. 79, Natl. Geogr. Res., 1, 332–387, 1985. 

Smith, V. C., Isaia, R., and Pearce, N. J. G.: Tephrostratigraphy and glass compositions of post-15 kyr Campi Flegrei eruptions: implications for eruption history and chronostratigraphic markers, Quaternary Sci. Rev., 30, 3638–3660, https://doi.org/10.1016/j.quascirev.2011.07.012, 2011. 

Smith, V. C., Isaia, R., Engwell, S. L., and Albert, P. G.: Tephra dispersal during the Campanian Ignimbrite (Italy) eruption: implications for ultra-distal ash transport during the large caldera-forming eruption, B. Volcanol., 78, 45, https://doi.org/10.1007/s00445-016-1037-0, 2016. 

Sulpizio, R., Mele, D., Dellino, P., and La Volpe, L.: A complex, Subplinian-type eruption from low-viscosity, phonolitic to tephri-phonolitic magma: the AD 472 (Pollena) eruption of Somma-Vesuvius, Italy, B. Volcanol., 67, 743–767, https://doi.org/10.1007/s00445-005-0414-x, 2005. 

Sulpizio, R., Mele, D., Dellino, P., and La Volpe, L.: Deposits and physical properties of pyroclastic density currents during complex Subplinian eruptions: the AD 472 (Pollena) eruption of Somma-Vesuvius, Italy, Sedimentology, 54, 607–635, https://doi.org/10.1111/j.1365-3091.2006.00852.x, 2007. 

Sulpizio, R., Bonasia, R., Dellino, P., Mele, D., Di Vito, M. A., and La Volpe, L.: The Pomici di Avellino eruption of Somma–Vesuvius (3.9 ka BP). Part II: sedimentology and physical volcanology of pyroclastic density current deposits, B. Volcanol., 72, 559–577, https://doi.org/10.1007/s00445-009-0340-4, 2010a. 

Sulpizio, R., Cioni, R., Di Vito, M. A., Mele, D., Bonasia, R., and Dellino, P.: The Pomici di Avellino eruption of Somma-Vesuvius (3.9 ka BP). Part I: stratigraphy, compositional variability and eruptive dynamics, B. Volcanol., 72, 539–558, https://doi.org/10.1007/s00445-009-0339-x, 2010b. 

Tarboton, D. G., Bras, R. L., and Rodriguez-Iturbe, I.: On the extraction of channel networks from digital elevation data, Hydrol. Process., 5, 81–100, https://doi.org/10.1002/hyp.3360050107, 1991. 

Tarquini, S., Isola, I., Favalli, M., and Battistini, A.: TINITALY, a digital elevation model of Italy with a 10 m-cell size (Version 1.0), Istituto Nazionale di Geofisica e Vulcanologia (INGV) [data set], https://doi.org/10.13127/TINITALY/1.0, 2007. 

Taylor, J. and Tibshirani, R. J.: Statistical learning and selective inference, P. Natl. Acad. Sci. USA, 112, 7629–7634, https://doi.org/10.1073/pnas.1507583112, 2015. 

Tesfa, T. K., Tarboton, D. G., Chandler, D. G., and McNamara, J. P.: Modeling soil depth from topographic and land cover attributes, Water Resour. Res., 45, W10438, https://doi.org/10.1029/2008WR007474, 2009. 

Vitale, S. and Isaia, R.: Fractures and faults in volcanic rocks (Campi Flegrei, Southern Italy): Insights into volcano tectonic processes, Int. J. Earth Sci., 103, 801–819, https://doi.org/10.1007/s00531-013-0979-0, 2014.  

Wohletz, K., Orsi, G., and De Vita, S.: Eruptive mechanisms of the Neapolitan Yellow Tuff interpreted from stratigraphic, chemical, and granulometric data, J. Volcanol. Geoth. Res., 67, 263–290, https://doi.org/10.1016/0377-0273(95)00002-C, 1995. 

Wong, K. K., Song, H., Witt, S. F., and Wu, D. C.: Tourism predictioning: to combine or not to combine?, Tourism Manage., 28, 1068–1078, https://doi.org/10.1016/j.tourman.2006.08.003, 2007. 

Wright, M. N. and Ziegler, A.: ranger: A fast implementation of random forests for high dimensional data in C++ and R, J. Stat. Softw., 77, 1–17, https://doi.org/10.18637/jss.v077.i01, 2017. 

Xiao, J., Shen, Y., Tateishi, R., and Bayaer, W.: Development of topsoil grain size index for monitoring desertification in arid land using remote sensing, Int. J. Remote Sens., 27, 2411–2422, https://doi.org/10.1080/01431160600554363, 2006. 

Yang, D.: Spatial prediction using kriging ensemble, Sol. Energy, 171, 977–982, https://doi.org/10.1016/j.solener.2018.06.105, 2018. 

Zevenbergen, L. W. and Thorne, C. R.: Quantitative analysis of land surface topography, Earth Surf. Proc. Land., 12, 47–56, https://doi.org/10.1002/esp.3290120107, 1987. 

Download
Short summary
Fallout pyroclastic deposits cover hillslopes after explosive volcanic eruptions and strongly influence landscape evolution, hydrology, erosion, and slope stability processes. Accurate mapping of the spatial-thickness variations of these fallout pyroclastic deposits over large hillslope areas remains a knowledge gap. We attempt to bridge this gap by applying statistical techniques to a field-based thickness measurement dataset of fallout pyroclastic deposits.
Altmetrics
Final-revised paper
Preprint