CASCADE – The Circum-Arctic Sediment CArbon DatabasE

Biogeochemical cycling in the semi-enclosed Arctic Ocean is strongly influenced by land–ocean transport of carbon and other elements and is vulnerable to environmental and climate changes. Sediments of the Arctic Ocean are an important part of biogeochemical cycling in the Arctic and provide the opportunity to study present and historical input and the fate of organic matter (e.g., through permafrost thawing). Comprehensive sedimentary records are required to compare differences between the Arctic regions and to study Arctic biogeochemical budgets. To this end, the Circum-Arctic Sediment CArbon DatabasE (CASCADE) was established to curate data primarily on concentrations of organic carbon (OC) and OC isotopes (δ13C,114C) yet also on total N (TN) as well as terrigenous biomarkers and other sediment geochemical and physical properties. This new database builds on the published literature and earlier unpublished records through an extensive international community collaboration. This paper describes the establishment, structure and current status of CASCADE. The first public version includes OC concentrations in surface sediments at 4244 oceanographic stations including 2317 with TN conPublished by Copernicus Publications. 2562 J. Martens et al.: Circum-Arctic Sediment CArbon DatabasE (CASCADE) centrations, 1555 with δ13C-OC values and 268 with 114C-OC values and 653 records with quantified terrigenous biomarkers (high-molecular-weight n-alkanes, n-alkanoic acids and lignin phenols). CASCADE also includes data from 326 sediment cores, retrieved by shallow box or multi-coring, deep gravity/piston coring, or sea-bottom drilling. The comprehensive dataset reveals large-scale features of both OC content and OC sources between the shelf sea recipients. This offers insight into release of pre-aged terrigenous OC to the East Siberian Arctic shelf and younger terrigenous OC to the Kara Sea. Circum-Arctic sediments thereby reveal patterns of terrestrial OC remobilization and provide clues about thawing of permafrost. CASCADE enables synoptic analysis of OC in Arctic Ocean sediments and facilitates a wide array of future empirical and modeling studies of the Arctic carbon cycle. The database is openly and freely available online (https://doi.org/10.17043/cascade; Martens et al., 2021), is provided in various machine-readable data formats (data tables, GIS shapefile, GIS raster), and also provides ways for contributing data for future CASCADE versions. We will continuously update CASCADE with newly published and contributed data over the foreseeable future as part of the database management of the Bolin Centre for Climate Research at Stockholm University.


Introduction
The Arctic Ocean receives large input of terrestrial organic matter from rivers and coastal erosion, making it a valuable receptor system for studying both large-scale terrestrial carbon remobilization and marine biogeochemistry. Rising temperatures cause multiple changes to the Arctic, including reduced sea-ice cover, accelerated erosion of icerich permafrost shorelines and enhanced river runoff, which changes the input of terrestrial organic matter to the Arctic Ocean (AMAP, 2017). This affects nutrients and the detrital load, the ocean optical field, marine primary productivity, ocean acidification and many other aspects of biogeochemical cycling Vonk and Gustafsson, 2013). On land, climate change causes warming and thaw of terrestrial permafrost (Biskaborn et al., 2019), potentially remobilizing parts of its large dormant pool of OC (1300 Pg; Hugelius et al., 2014) into active carbon cycling. Rising temperatures may thus shift balances in the Arctic carbon cycle by transformation and translocation of previously frozen organic matter, which leads to system hysteresis effects and translocated carbon-climate feedback (e.g., Vonk and Gustafsson, 2013). Couplings between the large permafrost-carbon pools and amplified climate warming in the Arctic represent a potential "tipping point" in the climate system (Lenton, 2012). These perturbations may affect both OC sequestration in the biosphere and release of climateforcing greenhouse gases (e.g., AMAP, 2017; IPCC, 2019) as well as the coupling between permafrost carbon remobilization and ocean acidification across the extensive shelf seas .
Continental shelves cover less than 10 % of the global ocean area but account for the largest part of OC accumulation in marine sediments and thereby provide an excellent archive for both terrestrial carbon input and marine productivity (Hedges et al., 1997). The Arctic Ocean is semienclosed and dominated by its extensive shelves, including the world's largest continental shelf system, the East Siberian Arctic Shelf (ESAS; the Laptev, East Siberian and Russian part of the Chukchi Sea). This further accentuates the particular importance of shelf sediments for carbon cycling in the Arctic Vetrov and Romankevich, 2004). Earlier landmark contributions have provided comprehensive observational perspectives on the distribution of organic matter in marine sediments at the global scale (e.g., Berner, 1982;Romankevich, 1984;Hedges and Keil, 1995). Focusing in greater detail on carbon in the Arctic, the book by Vetrov and Romankevich (2004) Carbon Cycle in the Russian Arctic Seas and the book edited by  The Organic Carbon Cycle in the Arctic Ocean provided the first more comprehensive perspectives on the Arctic land-ocean carbon couplings across various regions. Therein, the authors synthesized the collected knowledge of carbon sources, transformations and burial in Arctic marginal seas and the central Arctic Ocean. These compilations demonstrated substantial regional variations in carbon cycling between different Arctic shelf seas, while also acknowledging the near lack of observational data for key parameters and regions. Substantial progress has been made by individual and region-specific studies since then, with key advances in isotope and organic geochemistry that expand the variety of biogeochemical proxies to trace both sources and organic matter degradation. Stable carbon isotopes (δ 13 C-OC) have been widely used to distinguish between marine and terrigenous sources in Arctic Ocean sediments (e.g., Naidu et al., 1993;Mueller-Lupp et al., 2000;Semiletov et al., 2005) and have since then been greatly supplemented by an expanded use of natural abundance radiocarbon ( 14 C-OC). This has improved source apportionment of OC in bulk sediments across Arctic regions and timescales (e.g., Vonk et al., 2012;Goñi et al., 2013;Martens et al., 2020) and in sediment density fractions (Tesi et al., 2016b), in suspended particulate organic matter (e.g., Vonk et al., 2010Vonk et al., , 2014Karlsson et al., 2016), and at the molecular level (e.g., Drenzek et al., 2007;Gustafsson et al., 2011;Feng et al., 2013). Extensive studies of a wide set of molecular biomarkers (e.g., Fahl and Stein, 1997;Goñi et al., 2000;Belicka et al., 2004;Yunker et al., 2005;van Dongen et al., 2008;Tesi et al., 2014;Sparkes et al., 2015;Bröder et al., 2016) have provided growing insights into OC distribution and fate, particularly for terrigenous organic matter. Access to this growing number of observational data in a readily accessible interactive format would be greatly beneficial to wider system assessments and interpretations of organic matter in the Arctic Ocean.
The overarching objective of this effort is to curate and harmonize all available data on OC in Arctic Ocean sediments in an open and freely available database. The Circum-Arctic Sediment CArbon DatabasE (CASCADE) builds on previously published and unpublished collections holding information on OC and total N (TN) concentrations, as well as OC isotopes (δ 13 C-OC, 14 C-OC) in sediments of all continental shelves and the deep central basins of the Arctic Ocean. Furthermore, CASCADE contains molecular data with an initial focus on terrestrial biomarkers (i.e., high-molecular-weight (HMW) n-alkanes, n-alkanoic acids, lignin phenols) to facilitate studies of terrestrial OC remobilization. The backbone of CASCADE is large data collections, including (i) OC concentrations, δ 13 C / 14 Cisotope data and biomarkers from the informal 2-decadelong Swedish-Russian collaboration network the International Siberian Shelf Study (ISSS; Semiletov and Gustafsson, 2009) (e.g., Guo et al., 2004;Semiletov et al., 2005;van Dongen et al., 2008;Vonk et al., 2012;Tesi et al., 2016a;Bröder et al., 2018;Martens et al., 2019Martens et al., , 2020Muschitiello et al., 2020); (ii) OC concentrations from the Arctic portion of the "Carbon Database" of the Shirshov Institute of Oceanology, Russian Academy of Sciences (Romankevich, 1984;Vetrov and Romankevich, 2004); (iii) previously published databases and online collections (e.g., https://pangaea.de/) with many contributions from German-Russian partnerships and cruises involving the Alfred Wegener Institute, Germany (e.g., Stein et al., 1994;Mueller-Lupp et al., 2000;Xiao et al., 2015); (iv) US and Canadian research (e.g., Naidu et al., 1993Naidu et al., , 2000Goñi et al., 2000Goñi et al., , 2013Grebmeier et al., 2006); and (v) data from various other contributors that are acknowledged in the database. The initial version also includes previously unpublished data, with some generated here in the upstart of CASCADE, to fill gaps for particularly data-lean regions such as the Barents and Kara seas, the Canadian Arctic Archipelago, and the Chukchi Sea.
The aim of the CASCADE effort is to provide a foundation for future studies. These may include large-scale assessments of the carbon cycle, such as characteristics of OC input, and its distribution and fate in the Arctic Ocean. This paper describes the creation and the structure of CASCADE, including a discussion of data availability and quality.  (Obu et al., 2019), rates of coastal erosion (Lantuit et al., 2012) and the latest IBCAO v4 bathymetry . Black lines delineate the extent of the Arctic Ocean shelf seas and each respective watershed on land.

The physical compartments: Arctic shelf seas and interior Arctic Ocean basins
CASCADE includes OC data from the entire Arctic Ocean with special focus on the seven Arctic continental shelf seas ( Fig. 1; Table 1). Accordingly, a distinction is made among the central Arctic Ocean and the following marginal seas: Beaufort Sea, Chukchi Sea, East Siberian Sea, Laptev Sea, Kara Sea, Barents Sea (including White Sea) and the Canadian Arctic Archipelago (we exclude data from Baffin Bay, Foxe Basin and Hudson Bay, as they are outside the circum-Arctic scope of the database). For defining the limits of these Arctic shelf seas, Jakobsson (2002) is followed, which distinguishes the Arctic Ocean constituent seas using hypsometric criteria. Therein, shelf is defined as the seaward extension of the continental margin until the increase in steepness at the shelf break (Jakobsson, 2002). Data for the central Arctic Ocean were treated as one individual unit that covers all area beyond the shelf break and includes the continental slope, rise, deep basins and mid-ocean ridges.

Georeferencing and sampling
The coordinate system used for CASCADE is WGS1984, and coordinates are kept in machine-readable decimal degrees (latitude in • N, longitudes in the −180 to 180 • format) to harmonize the data across all GIS applications. The collection of data from oceanographic stations is the main part of CASCADE and is organized in a table format that contains columns for the station number ("STATION") and geographical coordinates ("LAT"; "LON"). The spatial references also include information about the sediment depth interval that reported data represent ("UPPERDEPTH'; "LOW-ERDEPTH"), where the upper depth is equal to 0 cm in the case of surface sediments. In addition, the table contains a column for water depth ("WATERDEPTH") as reported by the data source. In cases where the water depth was not reported, the water depth was estimated using the latest version (v4) of the bathymetric map of IBCAO  corresponding to the position of the oceanographic station and reported in a separate column ("IB-CAODEPTH"). Furthermore, the name of the expedition and/or ship ("EXPEDITION") and the year when the sample was taken ("YEAR") are reported. For samples where the sampling year was unknown, users may use the year of publication instead.

Surface sediments and sediment cores
The first stage of the CASCADE development focused on maximizing spatial coverage for surface sediments of the seven circum-Arctic shelf sea systems and the central Arctic Ocean. Here, surface sediments are defined as those collected from the water-sediment interface to a depth of maximum 5 cm. Data for surface sediments are provided in a table ("CASCADEsurfsed") as .txt and .xlsx files and in a ready-to-use GIS shapefile format. This database also includes deeper sediments from sediment cores, which represent longer timescales and add a third dimension to the geographical referencing. Types of sediment cores are distinguished in CASCADE such that different biogeochemical processes, acting on three depositional timescales, may be addressed. The three timescales are 1. centennial scale cores (core scale 1) in upper sediments of the Arctic Ocean, e.g., multi-corer, Gemini corer, box corer, van Veen grab sampler, other short gravity corers up to 1 m length; 2. millennial scale cores (core scale 2) of shelf sediments roughly covering the depositional time frame from the late Holocene to the last glacial-interglacial transition, by piston corer, long gravity corer and kasten corer; and 3. glacial cycle scale cores (core scale 3) from the continental slopes or the deeper Arctic Ocean basins covering periods from earlier than the Last Glacial Maximum, including drill coring on the circum-Arctic shelves or deep-sea piston cores.
Downcore data are stored in three separate data tables ("CASCADEcorescale1"; "CASCADEcorescale2"; "CAS-CADEcorescale3") in addition to the surface sediment files, including a column for the sampling depth of core subsamples in centimeters below the sediment surface ("CORE-DEPTH").

Database parameters
CASCADE contains information about the concentration and isotopic and molecular composition of OC in marine Arctic sediments. In addition to (i) OC concentrations (column "OC"), the database includes (ii) concentrations of TN ("TN") and (iii) the gravimetric ratio of OC / TN ("OC / TN"), which may provide additional information about the organic matter source (e.g., Goñi et al., 2005;van Dongen et al., 2008). Furthermore, CASCADE contains data of (iv) δ 13 C-OC ("d13C") as a parameter to distinguish between marine and terrestrial sources (e.g., Fry and Sherr, 1989) and (v) 14 C-OC ("D14C") to assess the presence of aged organic matter released from permafrost deposits (e.g., Gustafsson et al., 2011;Vonk et al., 2012) or from petrogenic sources such as sedimentary rocks (e.g., Yunker et al., 2005;Goñi et al., 2013) in marine sediments. More details about the CASCADE parameters and their units are provided in Table 2. Data of terrigenous biomarkers may facilitate further investigations of terrigenous OC input ( Table 2). The first version of CASCADE compiles total concentrations of nalkanes with high molecular weight (HMW) and C 21 -C 31 carbon atoms ( C 21 -C 31 ; column "HMWALK"), as well as the often separately reported more specific n-alkanes C 27 + C 29 + C 31 ("HMWALK_SPEC"). CASCADE also contains the sum of the HMW n-alkanoic acids C 20 -C 30 ("HMWACID"). Both compound classes stem mostly from terrigenous compartments as they derive from epicuticular leaf waxes of land plants with a typical pattern of dominating odd-numbered homologues for HMW n-alkanes and evennumbered homologues for HMW n-alkanoic acids (Eglinton and Hamilton, 1967). Furthermore, the database holds concentrations of lignin phenols ( syringyl, vanillyl, cinnamyl; "LIGNIN"), which are products from the break-up of the lignin biopolymer, a compound only produced by vascular plants (Hedges and Mann, 1979). These three compound classes are frequently used as tracers of the sources and fate of terrestrial organic matter sequestered in Arctic Ocean sediments (Fahl and Stein, 1997;Goñi et al., 2000;Tesi et al., 2014;Bröder et al., 2016). It is recognized that there are more parameters that could be included, and CASCADE can add further extensions in future versions.

Reference to the original publication
Each data source added to CASCADE is fully cited (in the formatting style of Earth Systems Science Data; ESSD) to maintain a high level of transparency. When applicable, citations also include a digital object identifier (DOI) that is linked to the reference in the primary literature next to each parameter column. Accordingly, the CASCADE data sheet distinguishes between a common reference for OC, TN and OC / TN data ("CN_CITATION") as they are often combined in one measurement and separate references for OC isotopes ("d13C_CITATION"; "D14C_CITATION") and concentrations of biomarkers ("BM_CITATION"). This facilitates registration of multiple measurements based on the same or split sediment sample material for individual oceanographic stations. A full list of references is separately provided on the CASCADE website and in the Supplement of this paper.

Data source and quality
A part of CASCADE builds on previous separate and partly inaccessible databases of OC parameters that key partners of the CASCADE consortium and others have collected over the years. This includes data from the informal Swedish-Russian collaboration network called the International Siberian Shelf Study (ISSS; Semiletov and Gustafs-son, 2009) and the "Carbon" database of the Shirshov Institute of Oceanology. This basis for CASCADE was strengthened by an extensive survey of the peer-reviewed literature and data mining in the grey literature of scientific cruise reports. To facilitate quality assurance criteria by the end users, the database also records metadata (e.g., sampling technique in the field, sample storage) and quality data when available. The quality assurance information for data in CASCADE is as follows.
-Data need to be (geo-)referenceable and located in the target region (i.e., the Arctic Ocean).
-Information about the analysis method is provided by the data source.
-For OC concentrations, values were generated by elemental analyzer (EA) or Rock-Eval pyrolysis and reported as weight-% OC. Total N concentrations and OC / TN ratios are based on EA only.
-For δ 13 C-OC, data stored in CASCADE are based on isotope ratio mass spectrometry (IRMS), often coupled to an EA and calibrated against the PDB/V-PDB analytical standards.
-For 14 C-OC, the measurements of 14 C data are based on mass spectrometry with 14 C data reported as 14 C, with fraction of modern (F m ) or conventional 14 C age in the original publication. We also kept records of the 14 C/AMS lab code of the sample if given.
-Terrigenous biomarker analysis was carried out by solvent extraction (for HMW n-alkanes and n-alkanoic acids) or by alkaline CuO oxidation of the lignin biopolymer (for lignin phenols) of the sediments, followed by wet chemistry purification and quantification using gas chromatography analysis with either flame ionization or mass spectrometry detection.
In addition to the abovementioned information, the aim was also to include information about carbonate removal by acid treatment prior to the measurement of OC, δ 13 C-OC and 14 C-OC. However, details about applied procedures were missing in most cases, and it is therefore assumed that the carbonate fraction was removed from total carbon prior to OC, δ 13 C-OC and 14 C-OC measurements. All meta-information (sampling, storage, analysis) for each CASCADE entry is included in a respective column in the data spreadsheet (Table 2).

Bulk OC and carbon isotopes
Gap filling was performed in surface sediments of regions with particularly poor data density. These efforts thus focused on areas north of western Siberia (Barents and Kara Sea region) and in the Canadian Arctic Archipelago, using archived sample material that was provided by CAS-CADE collaborators. For OC, TN and δ 13 C-OC analysis, about 10 mg each of a total of 153 freeze-dried sediment samples was weighed in silver capsules and acidified drop-wise with 3 M HCl in order to remove carbonates. The measurement was carried out using a Carlo Erba NC2500 elemental analyzer coupled to an isotope-ratio mass spectrometer (Finnigan DeltaV Advantage) in the Department of Geologi-cal Sciences, Stockholm University, with ±3 % precision for OC analysis and ±0.15 ‰ precision for δ 13 C-OC isotopic measurements. Furthermore, a subset of 95 samples was selected for gapfilling bulk-level 14 C-OC analysis at the Tandem Laboratory, Department of Physics, Uppsala University. A sample amount corresponding to 1 mg OC was weighed in tin capsules and acidified with 3 M HCl to remove carbonates. Samples with low OC concentrations (< 0.5 %) were placed in small beakers and exposed to acid fumes in a desiccator for 24 h to remove carbonates and combusted to CO 2 in evacuated quartz tubes prior to graphitization at the 14 C/AMS laboratory. An additional set of 30 gap-filling samples was analyzed for 14 C at the 14 C laboratory of ETH Zurich after acid fumigation. The measurements at Uppsala University had a precision of on average ±1.9 % while the precision at ETH Zurich was on average ±1.1 % (based on 14 C counting statistics).
In CASCADE, all new data points are labeled by citing the database (Martens et al., 2021, https://doi.org/10.17043/cascade) in the respective reference columns.

Analysis of lignin phenols
Gap-filling analysis was also performed for lignin phenols as molecular biomarkers for terrestrial organic matter using a set of 64 samples from data-lean regions. To extract lignin phenols from marine sediments, we applied an alkaline CuO oxidation protocol using a microwave-based method as originally presented by Goñi and Montgomery (2000) and followed the same laboratory routine as described in greater detail elsewhere (Tesi et al., 2014;Martens et al., 2019).

Data conversion and harmonization
Recalculations of literature data (e.g., for unit conversions) were in some cases necessary to harmonize the data to the standard units as defined in Table 2.
In CASCADE the concentration of OC is reported in percent (%) of the dry weight; values previously published as milligrams of OC per gram of dry weight were divided by a factor of 10.
CASCADE uses 14 C with age correction (Eq. 1) to report the activity of radiocarbon according to convention (Stuiver and Polach, 1977;Stenström et al., 2011). For radiocarbon values that were reported as conventional 14 C ages we used Eq. (2) to calculate the age-corrected 14 C.
Here F m is the fraction modern, λ C the decay constant of the Cambridge half-life of 14 C (T 1/2−C = 5730; λ C = 1/8267), Y C the year of sample collection, λ L the decay constant of the Libby half-life of 14 C (T 1/2−L = 5568; λ C = 1/8033) and T14 C−years the conventional 14 C age (Stuiver and Polach, 1977). All biomarker concentrations of HMW n-alkanes and nalkanoic acids are reported as micrograms per gram of OC while lignin phenols are reported as milligrams per gram of OC. Biomarker concentrations that in the original publication were reported as normalized to dry sediment weight were for CASCADE normalized to the OC concentration of the sample.

Data interpolation
CASCADE provides interpolated files (GEOtiff, ASCII; coordinate system WGS 1984 Arctic Polar Stereographic) for OC content, δ 13 C-OC and for 14 C-OC in surface sediments across the Arctic Ocean. OC data were mapped in ArcGIS 10.6 and interpolated to a resolution of 5 × 5 km per grid cell using the empirical Bayesian kriging function (EBK; Gribov and Krivoruchko, 2020) in the commercially available Ar-cGIS 10.8 software package (ESRI). Kriging builds on the assumption that two points located in proximity are more similar than two points further apart and creates a gridded surface of predicted values using an empirical semivariogram model. As an advancement to kriging, EBK repeatedly simulates semivariogram models in subsets of up to 200 data points and thus not only improves the prediction but also optimizes interpolation across areas with strongly varying data availability in the Arctic Ocean (e.g., shelf seas vs. central basins).

Dataset inventory
Surface sediments show by far the largest data availability. The dataset of OC concentrations in CASCADE includes 4244 different locations across the Arctic Ocean (Fig. 2), while the concentration of TN and the OC / TN ratio are known for 2317 locations (Table 1). For carbon isotopes, the number of individual δ 13 C-OC values is 1555, and for 14 C-OC it is 268. CASCADE also holds concentrations of terrigenous biomarkers at 131-213 locations per compound group. Most of the biomarker data are for HMW n-alkanes, with either concentrations of HMW n-alkanes ( C 21 -C 31 ; 213 stations) or chain lengths more specific for higher plants ( C 27 , C 29 , C 31 ; 164). Fewer data are available for concentrations of HMW n-alkanoic acids ( C 20 -C 30 ; 131) and the concentrations of lignin phenols (145).
In addition to surface sediments, a total number of 326 sediment cores (79 centennial, 229 millennial and 18 glacial cycle scale cores) are included in the first version of CAS-CADE. Combined, these hold another 10 552 observations of OC concentrations, 4769 concentrations of TN and 2122 δ 13 C-OC ratios in core samples from across the Arctic Ocean.

Spatial distribution of data
The data coverage for surface sediments is highly variable among the shelf seas, yet improved by the extensive gapfilling analysis ( Table 1). The largest number of OC concentrations is in the Barents Sea (1092; Table 1). Despite the large total number of available Arctic sediment OC concentrations, there are only 236 samples analyzed for δ 13 C-OC and 33 with 14 C-OC in the Barents Sea, and of these most are located in the Norwegian (western) sector of the Barents Sea. For the eastern Siberian Arctic and the North American sector of the Arctic Ocean, observations of OC concentrations are lower, but the availability of δ 13 C-OC data is higher (Table 1, Fig. 2b, c). Accordingly, the Kara, Laptev, East Siberian and Chukchi seas each support more than 200 δ 13 C-OC observations. The number of 14 C-OC observations is generally lower but reveals the highest coverage in near-coastal areas, with 28 values in the Kara Sea, 42 values in the Laptev Sea and 71 values in the East Siberian Sea. Data availability in the Chukchi Sea for 14 C-OC is lower (n = 12), stressing the need for future analysis. The lowest availability of data is in the Canadian Arctic Archipelago. Gap-filling analysis of OC here increased the number of OC concentrations from 21 to 54, with a similar number for carbon isotopes (51 of δ 13 C-OC; 22 of 14 C-OC) distributed over its vast area of 1 171 000 km 2 . The largest individual regime area is covered by the interior basins of the central Arctic Ocean, which holds 529 observations of OC concentrations, 130 of δ 13 C-OC and 27 of 14 C-OC values.

Assessment of data quality
Based on the quality assurance data available, CASCADE provides detailed information about the techniques involved in analyzing OC concentrations, isotopes and biomarkers. The development of CASCADE included the collection of meta-information about sampling, storage and analysis, as described in Sect. 2.6. This information is included and detailed in CASCADE. The quality assurance information shows that 86 % of the reported OC concentrations were analyzed using EA, and only a minority were analyzed by Rock-Eval pyrolysis. For δ 13 C-OC, in 66 % of the cases IRMS cou-pled to EA was reported as the method of analysis. Regarding sample storage, information was given in about 59 % of all data sources that the samples were kept frozen between sampling and analysis, while for < 1 % of the cases it was documented that the samples were stored refrigerated; this means that for 40 % of the samples, there was no information provided about sample storage. For 78 % of the 14 C-OC values, the laboratory 14 C/AMS label was documented and thus also added to the CASCADE sheet.

Circum-Arctic carbon features
Visualization of CASCADE data directly reveals several large-scale features of OC in Arctic Ocean sediments. These include clear differences in both OC concentration and source-diagnostic isotope composition among the shelf seas. For instance, interpolated OC concentrations (Fig. 2) indicate that high sedimentary OC content is found both in regions of high terrestrial input (e.g., Kara Sea, Laptev Sea, East Siberian Sea and Beaufort Sea) and in regions of high nutrient availability and marine primary productivity (Barents Sea and Chukchi Sea). The combination of δ 13 C and 14 C isotope values delineates large-scale differences in OC sources. Values of δ 13 C-OC close to marine OC (−21 ‰; Fry and Sherr, 1989) and 14 C reflecting contemporary carbon are consistent with high marine primary productivity in the Barents Sea and Chukchi Sea. The Kara Sea receives input from major West Siberian catchments (Ob and Yenisey rivers), with sediment OC that appears to reflect OC from contemporary terrestrial sources (∼ −27 ‰; Fry and Sherr, 1989). By contrast, the terrigenous OC fraction in the Laptev and East Siberian seas is much older with a presumably substantial contribution from remobilization of thawing permafrost or other old deposits via erosional or fluvial processes (Figs. 1, 2). These and other features can now be investigated through CASCADE in greater quantitative detail over large intra-and inter-system scales.

Data availability
CASCADE will be hosted and actively updated and extended by a database management at the Bolin Centre for Climate Research at Stockholm University. CAS-CADE is accessible at the Bolin Centre Database (https://doi.org/10.17043/cascade; Martens et al., 2021). When using the CASCADE, this paper and the database should be cited. The website also includes contact details, which can be used to submit new data for incorporation into future versions of CASCADE -a community effort and resource.

Vision and future development
CASCADE is the largest and most comprehensive openaccess database of OC parameters for Arctic Ocean sedi-ments. It is a resource that can facilitate a wide range of investigations on OC cycling in the high northern latitudes. For instance, CASCADE may help research on sources of organic matter, marine primary production, OC degradation, and OC transport both in the offshore direction and vertically from the sea surface to the sediment, and all this from both the contemporary and the historical perspectives. CAS-CADE provides opportunities to expand our still limited understanding of how sensitive terrestrial permafrost in different circum-Arctic regions is towards remobilization in both the current and earlier periods of rapid climate change. Future versions of CASCADE may also expand on parameters by adding more compound classes of terrestrial biomarkers, marine biomarkers, environmental contaminants (e.g., Hg and organic legacy and emerging substances) and others to investigate biogeochemical distribution and the fate of these in the Arctic Ocean.

Supplement.
The supplement related to this article is available online at: https://doi.org/10.5194/essd-13-2561-2021-supplement. yond the International Siberian Shelf Study (ISSS) network and all participants of the Arctic Partner Forum 2018 for their advice in constructing the CASCADE database and for pointing out data sources during the development of the database. We also thank the crew and the scientific party of the ISSS-08 expedition on board RV Yacob Smirnitskyi, the SWERUS-C3 expedition on board the IB Oden in 2014 and various other field campaigns organized by the ISSS in 2004, 2007, 2008and 2017 Wennström are thanked for their help with gap-filling analysis of lignin phenols. We also thank August Andersson and Henry Holmstrand for their long-term assistance and advice during various field campaigns, laboratory analyses and computer-based work that contributed to the realization of this database.
Financial support. Development of CASCADE was supported by the European Research Council (ERC Advanced Grant CC-TOP 695331 to Örjan Gustafsson), the EU H2020-funded project Nunataryuk (grant 773421), and the Swedish Research Council (grant 2017-01601). Field campaigns to obtain gap-filling samples were supported by the Knut and Alice Wallenberg Foundation (KAW contract 2011.0027 to Örjan Gustafsson) as part of the SWERUS-C3 program, as well as by the Russian Science Foundation (grant 21-77-30001 to Igor Semiletov) and the Russian Ministry of Science and Higher Education (grant 0211-2021-0010 to Pacific Oceanological Institute, Vladivostok). Furthermore, this study was supported by the assignment of the Russian Academy of Sciences (grant 0128-2021-0005) and the Russian Science Foundation (grant 18-05-60214) to the Shirshov Institute of Oceanology (Evgeny Romankevich, Alexander Vetrov). The collection of sample material in the Barents Sea was supported by the Research Council of Norway (grant 228107 to Michael L. Carroll; grant 223259) and VISTA (grant 6172 to Emmelie K. L. Åström). Gapfilling samples from the Canadian Arctic were supported by the Research Council of Canada (NSERC Discovery Grant RGPIN-2016-05457 to Anna J. Pieńkowski). Bart van Dongen was supported by an NERC research grant (NE/I024798/1) and Jorien Vonk was supported by the Dutch-NWO (Veni grant 863.12.004).
Review statement. This paper was edited by Jens Klump and reviewed by Gerrit Müller and one anonymous referee.