A database of marine and terrestrial radiogenic Nd and Sr isotopes for tracing earth-surface processes

The database presented here contains radiogenic neodymium and strontium isotope ratios measured on both terrestrial and marine sediments. The main purpose of this dataset is to help assess sediment provenance and transport processes for various time intervals. This can be achieved by either mapping sediment isotopic signature and/or fingerprinting source areas using statistical tools. The database has been built by incorporating data from the literature and the SedDB database and harmonizing the metadata, especially units and geographical coordinates. The original data were processed in three steps. Firstly, specific attention has been devoted to providing geographical coordinates to each sample in order to be able to map the data. When available, the original geographical coordinates from the reference (generally DMS coordinates) were transferred into the decimal degrees system. When coordinates were not provided, an approximate location was derived from available information in the original publication. Secondly, all samples were assigned a set of standardized criteria that help split the dataset into specific categories. For instance, samples were distinguished according to their location (“Region”, “Sub-region” and “Location” that relate to locations at continental to city or river scale) or the sample type (terrestrial samples – “aerosols”, “soil sediments”, “river sediments”, “rocks” – or marine samples – “marine sediment” or “trap sample”). Finally, samples were distinguished according to their deposition age, which allowed us to compute average values for specific time intervals. Graphical examples illustrating the functionality of the database are presented and the validity of the process was tested by comparing the results with published data. The dataset will be updated bi-annually in order to add more data points to increase the sampling density or provide new types of samples (e.g. seawater signature) and/or integrate additional information regarding the samples. It is publicly available (under CC4.0-BY Licence) from the GFZ data management service at https://doi.org/10.5880/GFZ.4.3.2019.001.


Introduction
A large amount of sediments is deposited by rivers and winds on continental margins and in the deeper parts of marine basins.These deposits constitute valuable climatic archives that are used in conjunction with terrestrial records and model outputs to better understand the climate-earth system.In that general context, the radiogenic isotopes of neodymium (Nd) and strontium (Sr) measured in marine sediments have proven a powerful tool to determine their ori-gin and their mode of transportation (i.e.fluvial or aeolian), related to climatic fluctuations (Frank, 2002).Neodymium isotope ratios are generally used to fingerprint provenance changes, as continental rocks have specific Nd isotopic signatures that are preserved during transportation and burial of sediments.Strontium isotope ratios are also sensitive provenance tracers, but their original signature can be modified by weathering processes in the source area as well as grain-size sorting during sediment transportation.In conjunction with C. L. Blanchet: Global Nd and Sr database Nd isotopes, Sr isotope ratios therefore provide additional information on earth-surface processes, such as changes in hydrological conditions, vegetation cover and modes of sediment transport.
The value of compiling Nd and Sr radioisotope datasets has already been demonstrated by pioneering studies that investigated sediment generation and transport processes (Goldstein et al., 1984;Goldstein and O'Nions, 1981;Grousset et al., 1988Grousset et al., , 1990Grousset et al., , 1992)).More recently, several data compilations were used to trace submarine sediment transport processes or boundary exchanges (Jeandel et al., 2007;Krom et al., 1999b;Tachikawa et al., 2017;Weldeab et al., 2002a) and fingerprint continental source areas (e.g.Padoan et al., 2011, for the Nile River basin and Scheuvens et al., 2013, for northern Africa).The sedimentary database for geochemical analyses, SedDB, which is hosted on the EarthChem platform, provides a large number of data for Nd and Sr isotopes (http://www.earthchem.org/seddb,last access: 20 February 2019).This useful instrument allows us to sort data per type of analysis, age and location (among other criteria), but has been put on hold since 2013 and is therefore not up to date.Consequently, there is at present no combined dataset that allows us to evaluate the contribution of specific sources to the sedimentary records, and authors use parts of these datasets arbitrarily, based on their geographical relevance, together with their own discrete measurements (Blanchet et al., 2013;Castañeda et al., 2016;Revel et al., 2010;Wu et al., 2016).The lack of a comprehensive dataset therefore hinders the possibility of obtaining statistically significant estimations of source contribution to the sediments and the use of harmonized identifiers for provenance.
This paper introduces a compilation of published and unpublished data, which includes an integrated filtering system using criteria to subset the dataset.In addition to presentday measurements provided by the previously cited and additional studies, specific time intervals were selected in order to plot and analyse paleo-data in view of present-day values.This dataset is envisaged as an evolutive tool that will be biannually updated and will remain in the public domain.Other relevant proxies and/or filtering criteria can be implemented in collaboration with peers.
The functionality of the database will be demonstrated by presenting some examples.Plotting has been realized with the R freeware (R Core Team, 2013), and R scripts are also published to allow other users to subset and plot the data (Blanchet, 2018a, b).

Input data
The database has been built by incorporating data from the literature and the SedDB database and harmonizing the metadata, especially units and geographical coordinates.An overview of the input data is shown in Table 1.
In a first iteration (published in September 2018, https://doi.org/10.5880/GFZ.5.2.2018.001), the pre-existing datasets from Padoan et al. (2011) and Scheuvens et al. (2013) were used (which included datasets from Krom et al., 1999b, a, andWeldeab et al., 2002a, b).The focus of these studies is different (river runoff and dust characterization respectively), but they are complementary and provide a large number of data (86 and 192 data points respectively).Second, 70 points were retrieved from the SedDB database, which could be identified as core tops and siliciclastic fraction (criteria set for Africa and Europe -40 Finally, a literature search has been conducted in order to add discrete samples that were not part of the previously cited compilations (276 data points).Data were collected from 48 different references with 631 data points in total (Table 1).
In a second iteration (published in April 2019, https://doi.org/10.5880/GFZ.4.3.2019.001),data compiled by Jeandel et al. (2007) have been added to the database, which provided an additional 222 data points and extended the geographical extent towards a global coverage (Table 1).Then, 116 points were retrieved from the SedDB database, which could be identified as core tops and siliciclastic fraction (global geographical extent).Based on external contribution (addition proposed by colleagues), author contribution (data published and provided by first or co-authors) or literature search, another 561 data points were added.The locations of these new samples are shown on maps in Fig. 1b.
The database contains samples on which either the Nd or Sr (or both) radiogenic isotope ratios were measured and expressed as ε Nd(0) and 87 Sr/ 86 Sr.The notation ε Nd(0) is widely used and is calculated as where CHUR stands for chondritic uniform reservoir and has a 143 Nd/ 144 Nd ratio of 0.512638 (Bouvier et al., 2008;Jacobsen and Wasserburg, 1980).When possible, additional variables were incorporated, such as the raw 143 Nd/ 144 Nd isotope ratio and the concentration in Sr and/or Nd in parts per million (ppm).As shown by Cole et al. (2009), sedimentary Nd and Sr concentrations can provide valuable clues for paleo-environmental interpretations.

Data processing
The original data were then processed in three steps, as shown in Table 1.Firstly, specific attention has been devoted to providing geographical coordinates to each sample in order to be able to map the data (cf.Fig. 1).When Earth Syst.Sci.Data, 11, 741-759, 2019 www.earth-syst-sci-data.net/11/741/2019/ d Contribution from colleagues.e Contribution from first or co-author.Indications of the type of source and the characteristics of the data retrieved are provided.The data were then submitted to three processes: sorting criteria were attributed to all the samples, a homogenous geographical location (coordinates in decimal degrees) was attributed to some samples, and the isotopic values of specific time intervals were determined when possible (see Sect. 2 and Table 2 for further information).
available, the original geographical coordinates from the reference (generally DMS coordinates, with a different precision standard) were transferred into the decimal degrees system.When coordinates were not provided, an approximate location for the samples was estimated from available information in the original publication.Maps or mention of actual locations (e.g.cities) were used to determine their geographical location using online coordinate finders (e.g.https://www.latlong.net/,last access: 20 February 2019).All samples were then assigned a set of standardized criteria that help split the dataset into specific categories (Table 1).Samples were attributed criteria related to their location.The "Region" category provides a general sorting at continental or oceanic scale (e.g."Mediterranean", "Atlantic", "Africa", "Europe") and the "Sub-region" category allows us to select only specific areas at oceanic sub-basins or country level (e.g.Mediterranean sub-basins or African countries).A third category, "Location", permits us to select specific areas (e.g.cities) or entities such as river basins or potential source areas (PSAs) for dust production as defined in Scheuvens et al. (2013) (Fig. 1a).Criteria were also defined in order to select specific types of samples: terrestrial samples ("aerosols", "soil sediments", "river sediments", "bivalves", "rocks") or marine samples ("marine sediment", "trap sample") (Fig. 1).When available, the grain-size fraction on which the measurements were carried out (bulk or fraction in µm) was re- ported, as such information can be useful for tracing specific transportation modes (Blanchet et al., 2013).
Third, samples were discriminated according to their age, and average values were computed for specific time intervals (Table 2).Most terrestrial samples are categorized as presentday samples, but marine samples were sorted according to their age.
-Labelled "P" (present-day): I report here only surface seafloor samples for present-day sedimentation (i.e.generally core top, the upper centimetre or past millennium).Sediment core samples that were collected below 1 cm or that were older than 1000 years were not reported here as their value might be significantly different than the present-day value, due to different climatic and oceanographic conditions (e.g.see values at 3-4 ka in Blanchet et al., 2014).
-Labelled "S1, S3, S4, S5, S6" (sapropels): this refers to samples from the well-defined sapropel layers in the Mediterranean.Climatic and oceanic conditions are known to be radically different during these time intervals (with large freshwater delivery and low oxygen content in deeper parts of the Mediterranean Basin), which led to the occurrence of specific depositional environments (Rossignol-Strick, 1985).These layers are generally visible (distinct black to grey-coloured sediments) in the sediment records, and their extent is defined by specific geochemical tracers, such as the total organic carbon content or the barium-to-aluminium ratio (De Lange et al., 2008).Using these markers and indications from the original publication, an average for these specific layers was calculated (used as a single value in the dataset).The depth or age interval used for the calculation, as well as the number of sample and ob-
-Labelled "LGM" (Last Glacial Maximum): samples of Last Glacial Maximum age (i.e.ca.20-25 ka BP) that were clearly identified in the original references were also added to the database.Related depth intervals and averaged values were determined in the cited publications (see Table 2).

Results
The dataset assembled includes the following fields for each sample: name of the sample or sediment core; criteria for location: region, sub-region, location; sample type: soil sediment, river sediment, marine sediment, aerosol, trap sample, rocks; grain-size fraction on which the measurements were made; criteria for time interval: present, sapropel layers, Last Glacial Maximum, or others as specified in the original publication; concentration and isotopic ratio in strontium and/or neodymium; geographical coordinates: original longitude and latitude (from the reference publication) and longitude and latitude (in decimal degrees), as well as notes on coordinates; notes on sample: specific information about the sample (from the reference publication);  2013), which were inspirational to this work.The sample type is given by the colour of the markers: blue for terrestrial samples, black for marine samples, green for aerosols (sediment traps) and red for deposited samples.Isotopic values are reported for each sample and the isolines from Weldeab et al. (2002b;cf. Fig. 3) are also reported.
reference: original reference publication of the sample; date of contribution: when the sample was added to the database; and Table 3 provides an overview of the number of samples in the various categories defined in Sect.2.2.Most samples are located in Africa, the Atlantic Ocean and the Indian Ocean and represent the present-day sedimentation patterns.Most of the samples in the database are marine sediments, while there is also a significant contribution from river and soil sediments.
The sorting criterion allows users to select only a subset of the data and to map the isotopic values (see Figs. 2 and 3).As an example, samples originating from various PSAs in Africa are reported in Table 3 and will be used to fingerprint PSAs based on their isotopic signature and standard statistical methods (see Fig. 4).

Technical validation
As the database is built by incrementing new measurements and homogenizing the metadata, one way to check its validity is to compare it with previously published compilations.
Earth Syst.Sci.Data, 11, 741-759, 2019 www.earth-syst-sci-data.net/11/741/2019/   3 and Fig. 1), the range and skewness of datasets in the PSA were analysed using box and whiskers plots (when sample number n was higher than 5) and compared to the isotopic ranges reported by Scheuvens et al. (2013) (green bars).Data points are shown as red diamonds and the arithmetical mean is provided (blue dots).The rectangles indicate the upper and lower quartiles and the median is shown as a thicker horizontal line (Krzywinski and Altman, 2014).
Some of the earlier works that inspired and motivated this exercise are the mapping of Sr and Nd isotopes in seafloor sediments in the Mediterranean Sea by Krom et al. (1999b) and Weldeab et al. (2002a).These studies were innovative and provided a clear illustration of the role of continental sediment sources and land-to-sea transportation as well as submarine currents in building sedimentary deposits.It also highlighted the importance of accurately reconstructing present-day sedimentary dynamics to interpret the geological record.Both studies being (almost) 20 years old, the initial intention was to update their data compilation to integrate new measurements and generate more detailed maps of seafloor sediment signatures.The comparisons between the original maps and new maps based on the database are presented in Fig. 2. If the general pattern already identified by both studies (i.e. the large influence of Nile-derived sedi-ment input on the eastern Levantine Basin) is reproduced by the new compilation, it allows the extension of the record to the western part of the basin and unravels some new features.For instance, the updated maps demonstrate the influence of runoff from the Aegean sub-basin and the large impact of dust delivery on the Ionian sub-basin (with perhaps some local runoff from the Syrian and Tunisian coasts).Not shown here, the compilation and addition of sapropel layers also allow us to map the effect of the increase in river runoff on the sedimentary signature of seafloor sediments.
Another motivation to build this database is the recent publication by Scheuvens et al. (2013), which provides a synoptic view of the geochemical signatures of African PSAs for dust generation.In particular, this study compiled a large number of data for Nd and Sr radioisotopes from soils, aerosols and marine sediments.The present database largely builds on the compilation by Scheuvens et al. (2013), which has been homogenized and completed with recent measurements, especially the Nile River sedimentary data from Padoan et al. (2011).One of the main modifications that was implemented is that approximate coordinates were attributed to samples with no given coordinate in the original reference.This was realized by using all available information, e.g.mention of cities or locations in the sample label or sample description, or approximate location from the published maps (see Sect. 2.2).This operation was realized with great care as this is the main source of error and is clearly indicated in the database.It is however an important step as it allows us to map contour lines that help in unravelling features associated with earth-surface processes (dust transportation, river runoff) that cannot be readily identified on maps in Scheuvens et al. (2013) (Fig. 3).
The validity of the approach was controlled by comparing the values provided in Scheuvens et al. (2013) for the African PSAs to those computed using the present dataset (Fig. 4).Overall, the values obtained for each PSA are in good agreement with previous estimations.The integration of additional samples allowed us to either confirm the observed values (e.g. for PSA1, PSA2 and PSA3) or to extend the value range and the number of data points (e.g. for PSA4, PSA5 and PSA6).One further advantage of using sorting criteria is that it allows us to determine and plot statistics values associated with the PSA (Fig. 4).When the number of samples was higher than 5, the data range was depicted as box plots, which allow us to determine the skewness of the data (i.e. by looking at the difference between the mean in blue and the median, which is represented as the bar in the rectangles) (Krzywinski and Altman, 2014).The presence of outliers like in the Sr signature of PSA6 can be identified and dismissed from the source fingerprinting.(Blanchet, 2019).The dataset and associated metadata are stored at the GFZ Data Service as comma-separated files, but they can also be provided as an Excel file upon request.All figures were realized using the R freeware (R Core Team, 2013) and packages "marmap" (Pante and Simon-Bouhet, 2013) and "ggplot2" (Wickham, 2016).The R codes to reproduce maps in Figs. 1, 2 and 3 as well as the box and whiskers plots in Fig. 4 are available on Figshare (Blanchet, 2018a, b).

Conclusion and outlook
The dataset assembled and presented here provides new insights into present and past earth-surface processes and the building of the marine sedimentary record.It allows us to compare various types of sediments from the terrestrial to marine realms: soils, deposited dust, river sediments, rocks, bivalves or marine sediments (Table 4).The attribution of standardized geolocations enables us to map the data and therefore to visualize sedimentary dynamics, while the use of sorting criteria related to the sample location or deposi-tional age permits us to determine source and sink isotopic signatures using statistical methods.
This database is thought of as an evolving tool and is intended to grow as new measurements are published or provided by peers.Users are encouraged to contact the author (who will act as a curator) to submit new data and/or to propose any modification or improvement of the database.With that aim, an indication of the entry date is provided, which will help users to follow the database updates and versions.
Competing interests.The author declares that they have no conflict of interest.
Acknowledgements.I would like to acknowledge the financial support of the Free University Berlin and the GFZ Potsdam.The valuable help of Kirsten Elger and Boris Radosavljevic from the GFZ Data Service was instrumental in getting the dataset ready for publication.Both dedicated a lot of time and patience, for which I am very grateful.Constructive comments by Michael D. Krom and an anonymous reviewer are also gratefully acknowledged.
Review statement.This paper was edited by David Carlson and reviewed by Michael Krom and one anonymous referee.

Figure 1 .
Figure 1.Overview of the location of samples assembled in the database for neodymium and strontium isotope ratios.The sample types are indicated by different markers: blue crosses for marine sediments, green dots for sediment traps, red crosses for soil samples, black dots for river sediments (river banks or particulate matter), blue triangles for fossil bivalves (Osborne et al., 2008) and yellow diamonds for deposited dust samples.(a) Dataset assembled for the first iteration published in September 2018.The African PSAs are indicated as grey underlines and were redrawn from Scheuvens et al. (2013).(b) Dataset assembled for the second iteration published in April 2019.A complete list of references is provided in the Supplement to the main dataset (http://doi.org/10.5880/GFZ.4.3.2019.001).

Figure 2 .
Figure 2. Contour maps of the isotopic signature of marine surface sediments in the Mediterranean.(a) Contour maps produced using the assembled dataset for strontium and neodymium isotopes.The contour lines and filled contour maps were realized using individual surface (or core-top) sediment samples.Each sample is represented by a dot whose colour indicates its isotopic value according to the scale at the right of the panels.Aerosol samples are overlaid and represented by a diamond with a similar colour code.These maps are compared to previous contour maps from Krom et al. (1999b) (where the Sr isotopes calculate a percentage of surface sediments derived from the Nile River runoff) (b) and Weldeab et al. (2002a) (c), which were inspirational to this work.Their respective extent is reported on the maps in (a).

Figure 3 .
Figure 3. Contour maps of the isotopic signature of marine and terrestrial sediment in the northern African sector realized using the first iteration of the database (September 2018).(a) Comparison of the isotopic signature of marine surface sediments (contour maps) to that of terrestrial samples.Sample type is indicated by specific markers, whose colour indicates its isotopic value according to the scale to the right of the panels: dots for marine surface sediments, squares for terrestrial (soil and river) sediment samples, and diamonds for aerosols and triangles for fossil bivalves.(b) Maps from Scheuvens et al. (2013), which were inspirational to this work.The sample type is given by the colour of the markers: blue for terrestrial samples, black for marine samples, green for aerosols (sediment traps) and red for deposited samples.Isotopic values are reported for each sample and the isolines fromWeldeab et al. (2002b; cf.Fig. 3) are also reported.

Figure 4 .
Figure 4. Box and whiskers plot for the isotopic signatures of African PSAs.After identifying samples that are located in each PSA (see Table3and Fig.1), the range and skewness of datasets in the PSA were analysed using box and whiskers plots (when sample number n was higher than 5) and compared to the isotopic ranges reported byScheuvens et al. (2013) (green bars).Data points are shown as red diamonds and the arithmetical mean is provided (blue dots).The rectangles indicate the upper and lower quartiles and the median is shown as a thicker horizontal line(Krzywinski and Altman, 2014).

Table 3 .
Grousset and Biscaye (2005)ted in the African PSA (potential source area for dust generation).Headers from left to right: Label: name of the sampling location or sample; Region: country where the samples were taken; Location: name of the PSA (see Fig.1); Sample type: soil or river sediment; Isotopic ratios of Sr and Nd; Longitude and Latitude (decimal degrees); Ref.: reference publications.These data were used to determine and plot the statistical values in Fig.4.References: 1:Grousset et al. (1992), 2:Grousset and Biscaye (2005), 3: Grousset

Table 4 .
Output table after second iteration(April 2019).Overview of the results obtained in number of data points per type of sample, time interval and location of the samples (Af: Africa, Am: America, Ant: Antarctica, Arct O: Artctic Ocean, Asia, Atl O: Atlantic Ocean, Aus: Australia, Eur: Europe, Green: Greenland, Ind O: Indian Ocean, South O: Southern Ocean, Medit S: Mediterranean Sea, Pac O: Pacific Ocean).The dataset of neodymium and strontium isotope ratios and the associated metadata table, as well as Table2(determination of isotopic signature and identification of specific time intervals) and the associated metadata table, are available at https://doi.org/10.5880/GFZ.4.3.2019.001