Preprints
https://doi.org/10.5194/essd-2021-43
https://doi.org/10.5194/essd-2021-43

  17 Mar 2021

17 Mar 2021

Review status: this preprint is currently under review for the journal ESSD.

The Surface Water Chemistry (SWatCh) database: A standardized global database of water chemistry to facilitate large-sample hydrological research

Lobke Rotteveel and Shannon M. Sterling Lobke Rotteveel and Shannon M. Sterling
  • Sterling Hydrology Research Group, Dalhousie University, Halifax, B3H 4R2, Canada

Abstract. Openly accessible global scale surface water chemistry datasets are urgently needed to detect widespread trends and problems, to help identify their possible solutions, and identify critical spatial data gaps where more monitoring is required. Existing datasets are limited in availability, sample size/sampling frequency, and geographic scope. These limitations inhibit the answering of emerging transboundary water chemistry questions, for example, the detection and understanding of delayed recovery from freshwater acidification. Here, we begin to address these limitations by compiling the global surface water chemistry (SWatCh) database, available on Zenodo (DOI: https://doi.org/10.5281/zenodo.4559696) We collect, clean, standardize, and aggregate open access data provided by six national and international agencies to compile a database consisting of three relational datasets: sites, methods, and samples, and one GIS shapefile of site locations. We remove poor quality data (for example, values flagged as suspect), standardize variable naming conventions and units, and perform other data cleaning steps required for statistical analysis. The database contains water chemistry data across seven continents, 17 variables, 38,598 sites, and over 9 million samples collected between 1960 and 2019. We identify critical spatial data gaps in the equatorial and arid climate regions, highlighting the need for more data collection and sharing initiatives in these areas, especially considering freshwater ecosystems in these environs are predicted to be among the most heavily impacted by climate change. We identify the main challenges associated with compiling global databases – limited data availability, dissimilar sample collection and analysis methodology, and reporting ambiguity – and provide recommendations to address them. By addressing these challenges and consolidating data from various sources into one standardized, openly available, high quality, and trans-boundary database, SWatCh allows users to conduct powerful and robust statistical analyses of global surface water chemistry.

Lobke Rotteveel and Shannon M. Sterling

Status: open (until 10 Jul 2021)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse

Lobke Rotteveel and Shannon M. Sterling

Data sets

The Surface Water Chemistry (SWatCh) database Lobke Rotteveel https://doi.org/10.5281/zenodo.4559696

Model code and software

The Surface Water Chemistry Database (SWatCh) Lobke Rotteveel https://github.com/LobkeRotteveel/SWatCh

Lobke Rotteveel and Shannon M. Sterling

Viewed

Total article views: 263 (including HTML, PDF, and XML)
HTML PDF XML Total Supplement BibTeX EndNote
203 52 8 263 15 5 11
  • HTML: 203
  • PDF: 52
  • XML: 8
  • Total: 263
  • Supplement: 15
  • BibTeX: 5
  • EndNote: 11
Views and downloads (calculated since 17 Mar 2021)
Cumulative views and downloads (calculated since 17 Mar 2021)

Viewed (geographical distribution)

Total article views: 213 (including HTML, PDF, and XML) Thereof 213 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 15 Jun 2021
Download
Short summary
Data is needed to detect environmental problems, find their solutions, and identify knowledge gaps. Existing datasets have limited availability, sample size and/or frequency, or geographic scope. Here, we begin to address these limitations by collecting, cleaning, standardizing, and compiling the Surface Water Chemistry (SWatCh) database. SWatCh contains global surface water chemistry data for seven continents, 17 variables, 38,598 sites, and over 9 million samples collected from 1960 to 2019.