A database of databases for Common Era paleoclimate applications
Abstract. We present a database of curated databases (DoD2k version 1) developed for Common Era (1–2000 A.D.) paleoclimate research. The DoD2k leverages existing community efforts, many of which arise from the PAGES (Past Global Changes) 2k working group, and the codebase developed by the paleoclimate data informatics communities over the past decade. Using a common, compact set of terms for metadata and data management, we merge five existing curated databases. These individual curated databases represent a range of approaches, from single archive-single observation to multiarchive-multiobservation collections, and span a total of 14 archives, 49 data types, and 4613 records within the Common Era. We then use a multistage algorithm to remove duplicates, checking against a common set of metadata and comparison metrics. We illustrate the value of the DoD2k with two applications. In the first, we extract the moisture and temperature subset of records and perform an empirical orthogonal function (EOF) analysis on the resulting multi-archive, multi-observation dataset. In the second, we show that calcite speleothem oxygen isotopic composition is consistent with proxy system simulations. DoD2k may also be useful for paleoclimatic detection and attribution analysis using proxy system modeling, data assimilation, and deep learning for the development and testing of improved proxy system models.