Articles | Volume 16, issue 12
https://doi.org/10.5194/essd-16-5563-2024
https://doi.org/10.5194/essd-16-5563-2024
Data description paper
 | 
05 Dec 2024
Data description paper |  | 05 Dec 2024

Gas exchange velocities (k600), gas exchange rates (K600), and hydraulic geometries for streams and rivers derived from the NEON Reaeration field and lab collection data product (DP1.20190.001)

Kelly S. Aho, Kaelin M. Cawley, Robert T. Hensley, Robert O. Hall Jr., Walter K. Dodds, and Keli J. Goodman
Abstract

Air–water gas exchange is essential to understanding and quantifying many biogeochemical processes in streams and rivers, including greenhouse gas emissions and metabolism. Gas exchange depends on two factors, which are often quantified separately: (1) the air–water concentration gradient of the gas and (2) the gas exchange velocity.  There are fewer measurements of gas exchange velocity compared to concentrations in streams and rivers, which limits accurate characterization of air–water gas exchange (i.e., flux rates). The National Ecological Observatory Network (NEON) conducts SF6 gas-loss experiments in 22 of their 24 wadeable streams using standardized methods across all experiments and sites, and publishes raw concentration data from these experiments on the NEON data portal. NEON also conducts NaCl injections that can be used to characterize hydraulic geometry at all 24 wadeable streams. These NaCl injections are conducted both as part of the gas-loss experiments and separately. Here, we use these data to estimate gas exchange and water velocity using the reaRate R package. The dataset presented includes estimates of hydraulic parameters, cleaned raw concentration SF6 tracer-gas data (including removing outliers and failed experiments), estimated SF6 gas-loss rates, normalized gas exchange velocities (k600; m d−1) and normalized depth-dependent gas exchange rates (K600; d−1). This dataset provides one of the largest compilations of gas-loss experiments (n=339) in streams to date. This dataset is unique in that it contains gas exchange estimates from repeated experiments in geographically diverse streams across a range of discharges. In addition, this dataset contains information on the hydraulic geometry of all 24 NEON wadeable streams, which will support future research using NEON aquatic data. This dataset is a valuable resource that can be used to explore both within- and across-reach variability in the hydraulic geometry and gas exchange velocity in streams. The data are available at https://doi.org/10.6073/pasta/18dcc1871ee71cf0b69f2ee4082839d0 (Aho et al., 2024), and the reaRate R package code is available at https://doi.org/10.5281/zenodo.12786089 (Cawley et al., 2024).

1 Introduction

Air–water gas exchange contributes to many aquatic processes in streams and rivers, including greenhouse gas emissions (Liu et al., 2022; Rocher-Ros et al., 2023), aquatic metabolism (Aristegi et al., 2009; Hall et al., 2016; Hall and Hotchkiss, 2017), and reoxygenation rates after wastewater discharge (O'Connor and Dobbins, 1958). Despite this importance, gas exchange can be difficult to measure and model (Churchill et al., 1964; Hornberger and Kelly, 1975; Rathbun, 1977; Ulseth et al., 2019). According to Fick's law of diffusion, gas flux across the air–water boundary depends on the concentration gradient of the gas and the gas exchange velocity (k; m d−1):

(1) flux = k gas dissolved - gas equilibrium ,

where [gas]dissolved is the concentration of the gas of interest, and [gas]equilibrium is the concentration of the gas at equilibrium with the atmosphere.

In streams and rivers, measurements of gas concentrations are more readily available than estimates of k; resultantly, estimates of k are often extrapolated from a few measurements. Several methods exist for assessing k, including predictive models (Raymond et al., 2012), models of gas dynamics through time and space in rivers (Appling et al., 2018), and direct measurements with tracers (Hall and Hotchkiss, 2017). Here, k refers to exchange of dissolved gases (i.e., diffusive and bubble-mediated gas transfer) and does not capture ebullitive fluxes. Characterizing k from direct measurement of tracer-gas exchange velocities and modeling based on observed diurnal gas dynamics are likely more accurate for any particular stream or river than more general predictive models (Appling et al., 2018; Hall and Ulseth, 2020; Riley and Dodds, 2013).

Gas exchange velocity is spatiotemporally variable; it is controlled by energy dissipation rate and, therefore, turbulence at the air–water boundary (Zappa et al., 2007). Models that estimate k at broad spatial scales and in low-versus-high gradient streams have found that hydraulic variables (e.g., streambed slope (S; unitless), water velocity (v; m s−1), mean water depth (z; m), discharge (Q; L s−1)) are the dominant controls on variation in k (Churchill et al., 1964; O'Connor and Dobbins, 1958; Rathbun, 1977; Raymond et al., 2012). Although similar models for within-reach temporal variability are not widely available, hydrology is expected to control k locally. Existing reach-scale studies have reported different k responses to Q (Genzoli and Hall, 2016; Maurice et al., 2017; McDowell and Johnson, 2018) and point to the importance of quantifying the variable relationships between k and Q on a site-by-site basis. The dataset presented here leverages a unique set of tracer-gas experiments conducted by the National Ecological Observatory Network (NEON) that will allow for assessment of within- and across-reach variability in lotic gas exchange in a wide variety of streams.

Tracer-gas experiments are an approach to estimating k in streams and rivers and involve diffusing an inert tracer gas (e.g., sulfur hexafluoride (SF6)) at a constant rate at the upstream end of a stream reach and measuring how concentrations decline downstream of the injection point. Often a conservative salt (e.g., sodium chloride (NaCl) or sodium bromide (NaBr)) is also injected with the tracer gas to allow for correction of dilution from hydrologic inflows (referred to as “salt correction” hereafter).

Here, we present a continental-scale dataset of gas exchange rates for wadeable streams derived from NEON data. The substantial processing that was required to estimate gas exchange is described in detail below and archived alongside the dataset. The dataset presented here contains estimates of k600, or k normalized to a Schmidt number of 600, and K600, which is a depth-corrected rate (K600=k600/z) used in stream metabolism modeling, for 22 streams and v, z, and at-a-station hydraulic geometry for 24 streams (Fig. 1).

https://essd.copernicus.org/articles/16/5563/2024/essd-16-5563-2024-f01

Figure 1Map of NEON stream sites. Blue symbols indicate sites where NEON conducts full tracer-gas experiments and, thus, where we were able to estimate k600, K600, v, z, and at-a-station hydraulic geometry. Red symbols indicate sites where NEON only conducts NaCl slug injections and, thus, where we were able to calculate v, z, and at-a-station hydraulic geometry.

https://essd.copernicus.org/articles/16/5563/2024/essd-16-5563-2024-f02

Figure 2Diagram of a model study reach for NEON tracer-gas experiments. Each ∼500 m study reach comprises an injection site and four downstream sampling stations (Stations 1–4). At the upstream injection site, SF6 is diffused into the water column with an air stone, and NaCl or NaBr is dripped into the stream. After plateau concentrations are reached at the downstream end of the study reach, injection rates are maintained, and field quintuplicate samples for SF6 and NaCl or NaBr concentrations are collected at four downstream stations spaced along the study reach. At the upper and lower stations, conductivity sensors are deployed and used to monitor either (1) when NaCl plateau concentrations are reached (for NaCl continuous injections) or (2) when a NaCl “slug” peaks at each station (for NaBr continuous injections). Before each experiment, stream discharge is measured with a flow meter or ADCP, and 30 wetted widths are collected across the study reach. Also, before each injection, background NaCl or NaBr concentrations are collected at all four sampling stations. This diagram uses modified imagery from the University of Maryland Center for Environmental Science Integration and Application Network.

Download

2 Methods

2.1 NEON tracer-gas experiments

NEON conducts tracer-gas experiments at 22 stream sites, which are distributed across the United States, from Puerto Rico to Alaska (Fig. 1). For information about specific site characteristics, see the NEON website: https://www.neonscience.org/field-sites/explore-field-sites (last access: 2 October 2024). In general, the NEON streams are relatively small; the median watershed size is 11.5 km2, while the mean ± standard deviation watershed size is 27.5±58.5 km2. Walker Branch (WALK) drains the smallest watershed (1.1 km2) while Sycamore Creek (SYCA), an intermittent, desert stream, drains the largest (280 km2). Therefore, this dataset is only representative of streams and small rivers and not large rivers. These experiments entail continuous injections of SF6 and a conservative salt tracer (either NaCl or NaBr) at the upstream end of a ≤500 m stream reach (Fig. 2). NEON uses SF6 because it does not occur naturally, is not biologically or chemically reactive, and can be detected at low concentrations (Cole and Caraco, 1998; Ho et al., 2011; Maurice et al., 2017; Wanninkhof et al., 1985); although used in very small amounts in these experiments, SF6 is a potent greenhouse gas, and tracer-gas experiments are ceasing at sites with sufficient hydrologic coverage. When NaBr is used as the salt tracer, an additional NaCl “slug” injection is performed, which allows for the subsequent calculation of v and the derivation of z from paired Q measurements (via flowmeter or acoustic Doppler current profiler, ADCP) and wetted width measurements taken at 30 points along the study reach. Before the injection, NaCl or NaBr is collected at each of the four stations along the study reach; these data can later be used to correct NaCl or NaBr concentrations during the injection for background conditions. Once conductivity during the injection either reaches a plateau (for constant-rate NaCl injections) or returns to background levels (for NaCl slug injections) at the most downstream station, five replicate SF6 and NaCl or NaBr samples are collected at each of the four stations located along the study reach. Samples for SF6 are collected via headspace equilibration in 60 mL syringes, stored in gas-tight evacuated vials (12 mL, Exetainer), and run on a gas chromatograph with an electron capture detector (ECD). Samples for NaCl and NaBr are filtered to 0.7 µm, collected in 60 mL HDPE bottles, and run on an ion chromatograph. In addition, high-frequency (0.1 Hz) sensors are deployed to monitor NaCl conductivity at the upstream and downstream end of the study reach (Fig. 2). NEON publishes SF6 mixing ratios, NaCl and NaBr concentrations, wetted width data, and conductivity time series from these experiments as the Reaeration field and lab collection data product, DP1.20190.001 (NEON, 2024b), and measurements of Q in the Discharge field collection data product, DP1.20048.001 (NEON, 2024a). More detailed information on NEON's data collection procedures can be found on their website, http://www.neonscience.org (last access: 2 October 2024).

NEON has conducted tracer-gas experiments 6–10 times per year for 6–8 years at all 22 sites to capture a range of discharge conditions. Presently, tracer-gas experiments are ceasing at sites with sufficient hydrograph coverage (https://www.neonscience.org/impact/observatory-blog/protocol-change-reaeration-field-and-lab-collection-dp120190001, last access: 2 October 2024). However, NaCl slug injections will continue to be performed quarterly to collect high-frequency conductivity time-series data that allow for the calculation of v and the derivation of z from paired Q and wetted-width measurements. Similarly, NaCl slug injections are and will continue to be conducted for the two sites where tracer-gas experiments are not collected (Blue River (BLUE), where wide channel widths and high discharges make tracer-gas studies challenging, and Arikaree River (ARIK), where long travel times make tracer-gas studies infeasible).

The dataset presented here represents substantial processing of these published data (i.e., SF6 mixing ratios, NaCl and NaBr concentrations, conductivity time series, wetted widths, and measurements of Q) to estimate k600 or K600 using the reaRate R package (Cawley et al., 2024). In addition, this dataset contains estimates of v from NaCl injections, which, as mentioned above, are performed both during tracer-gas experiments and at quarterly intervals at sites where tracer-gas experiments are not conducted or have ceased. Along with the paired Q measurement and the average wetted width for the study reach (w; m), the estimate of v was used to derive z. The dataset presented here contains estimates of v and z and at-a-station hydraulic geometry for all 24 NEON wadeable streams. This dataset provides a large compilation of direct measurements of tracer-gas experiments and at-a-station hydraulic geometry in small streams across the United States.

2.2 reaRate R package

The data processing pipeline described below uses the reaRate R package (Cawley et al., 2024). The package estimates k600 and K600 from data available on the NEON data portal. The package works by fitting an exponential, first-order decay function to the observed longitudinal decline in published SF6 concentrations and solving for the longitudinal tracer-gas-loss rate (Kd; m−1):

(2) C x = C 0 e - K d x ,

where C0 and Cx are tracer-gas concentrations at the top of the study reach and at a downstream point x, respectively, and Kd is the average distance traveled by an SF6 molecule before it is emitted to the atmosphere. For sites where lateral inflows (e.g., groundwater inputs, overland flow, tributaries) appear to dilute SF6 concentrations, the ratio of SF6 to NaCl or NaBr is used to calculate a salt-corrected Kd value. The Kd values can then be converted to the gas exchange velocity for the tracer gas (e.g., kSF6; m d−1):

(3) k SF 6 = z ν K d .

This gas-specific k can be normalized to k600 using a Schmidt number of 600:

(4) k 600 = k SF 6 600 Sc SF 6 - n ,

where n is the Schmidt number exponent (0.5 for flowing waters), and ScSF6 is the temperature-dependent Schmidt number for SF6 at water temperature T in degrees Celsius (Jähne et al., 1987; Raymond et al., 2012; Wanninkhof, 1992):

(5) Sc SF 6 = 3255.3 - 217.13 T + 6.8370 T 2 - 0.086070 T 3 .

Reporting estimates of k600 is common; it allows for comparisons with existing work and can be scaled to other gases, including CO2 and O2, using the same approach as in Eq. (4) with gas-specific, temperature-dependent Schmidt numbers:

(6) k gas = k 600 Sc gas 600 - 1 2 .

Some applications (e.g., metabolism, reoxygenation rates) explicitly include water depth in their modeling frameworks and thus require a depth-dependent gas exchange rate (K; d−1). In these cases, a normalized gas exchange rate (K600) can be used and is related to k600 by dividing by z for the upstream reach corresponding to a length of at least 1Kd . Using the same scaling relationships shown in Eqs. (3) and (5), K600 can be converted to gas-specific Kgas estimates.

The reaRate package uses NaCl breakthrough curves (i.e., from either continuous injection or slug injections) to estimate the travel time between the upstream and downstream stations and then calculates v as the distance between stations divided by the travel time. Using the continuity equation, z is calculated by dividing Q by v and w. Finally, kSF6 and KSF6 are calculated from Kd, v, and z (Eq. 3) and then normalized to k600 and K600 (Eq. 4). The reaRate package includes two approaches to estimate k600 and K600: an un-pooled frequentist approach and a partially pooled Bayesian approach, both of which are described in more detail below. Implementation of the package and all processing described below were conducted in R 4.2.3 (R Core Team, 2023). More information about the package, including details about the individual functions and a processing pipeline, is provided in Sect. 2.3.

https://essd.copernicus.org/articles/16/5563/2024/essd-16-5563-2024-f03

Figure 3Overview of data processing to estimate k600 and K600 from NEON tracer-gas experiments.

Download

2.3 Data processing

Substantial processing was required to estimate k600 and K600 from the NEON data (Fig. 3). All data used were downloaded from the NEON data portal. Downloads consisted of two NEON data products: Reaeration field and lab collection (DP1.20190.001) and Discharge field collection (DP1.20048.001). Data were from RELEASE-2023, in addition to nine experiments that were provisional but are now included in RELEASE-2024 (NEON, 2023a, b, 2024a, b). On a site-by-site basis, data were merged and formatted using the def.format.reaeration() function from the reaRate R package. This function compiles variables from across the downloaded data into a single data frame. These variables include Q, w, water temperature, station location as a distance downstream from the injection point, and SF6 and NaCl or NaBr concentrations for each station during the experiment. The function also applies a salt correction to the SF6 data (e.g., SF6 concentration divided by background-corrected NaCl or NaBr concentrations). The function removes outliers (points more extreme than 1.5×IQR below and above the first and third quartile, respectively) from the quintuplet SF6 and NaCl or NaBr concentrations for each station, calculates the mean and standard deviation SF6 and NaCl or NaBr concentration for each station, and flags stations as “unmixed” when the coefficient of variation (CV=SDmean) of the replicate SF6 and NaCl or NaBr concentrations is greater than 10 %.

Table 1Sites and number of experiments.

Download Print Version | Download XLSX

Next, SF6 and NaCl or NaBr declines were visualized and quantified for each experiment and initial quality control flags were assigned. The gas.loss.rate.plot() function from the reaRate package was used to visualize and calculate both salt-corrected and uncorrected longitudinal gas-loss rates over the length of the study reach (Kd). For the sites requiring a salt correction to account for lateral inflows, mean SF6 mixing ratios at each station were first divided by mean background-corrected NaCl or NaBr concentration for the corresponding station. Station-specific outliers (i.e., values more extreme than first quartile -1.5×IQR and third quartile +1.5×IQR) were removed. SF6 concentrations were then log-normalized, and Kd was calculated from the resulting linear decline. A quality control flag was assigned to an individual experiment if any of the following criteria applied:

  • Visually, the SF6 gas-loss rate over the entire study reach was unduly affected by anomalous SF6 concentrations (potentially indicating contamination, leaked vials, or analytical error).

  • SF6, NaCl, or NaBr concentrations increased in a downstream direction between any of the stations (likely indicating incomplete mixing in the water column).

  • The salt-corrected Kd was larger than the uncorrected Kd (a salt correction should correct for overestimation due to lateral inflows, with the reverse potentially indicating contamination or analytical error).

For each experiment, v was calculated from the conductivity time series using the def.calc.trvl.time() function. The function requires that the user manually selects points bracketing either the rising limb (for constant rate injections) or the peak concentration (for slug injections). From within the user-selected bracket, the def.calc.trvl.time() function smooths the data using a loess function and then identifies the peak of the breakthrough by finding either where the first derivative is 0 (for a slug injection) or where it is at its maximum (for a constant rate injection). This function then calculates the breakthrough travel time between the two stations and uses the distance between stations to calculate v. Site-specific relationships between v and Q were visualized in log–log space, and any anomalous values were reprocessed with the def.calc.trvl.time() to confirm that the user-selected bracketing allowed the function to pick the correct points on the time series. Finally, z was calculated using the def.calc.trvl.time() function, which divides Q by v and w.

Two separate approaches were used to estimate k600 or K600 values from the formatted data. The first approach used the def.calc.reaeration() function to multiply Kd for each individual experiment by the corresponding v and z values (Eq. 3) to estimate kSF6, which were then converted to k600 (Eq. 4). The resulting k600 estimates were converted to K600 by dividing by water depth. This approach is subsequently referred to as the un-pooled, frequentist approach and is included in this data descriptor because it represents the current, prevailing approach for processing this type of data, is straightforward to implement, and represents the output of the def.calc.reaeration() function included in the reaRate package.

The second approach used Bayesian multilevel models that pooled experiments from the same site across releases. The models, coded in the Stan probabilistic programming language, used for this approach are also included in the reaRate package. A Bayesian approach provides flexibility in specifying models that consider repeat experiments at a site and current theory surrounding gas exchange. Bayesian inference allowed partial pooling of k600 estimates across releases in any one stream. Partial pooling reduces the error in any one estimate of k600 and shrinks all k600 estimates to the site-level mean (as conditioned on discharge) if error in measuring SF6, NaCl, and/or NaBr is high.

The Bayesian approach included error at two levels. First, the models pooled k600 estimates across releases from the same site to estimate Kd from normalized SF6 concentrations (both salt-corrected and uncorrected). For this step, the relationship between the SF6 loss rate and the product of Kd and reach length was assigned a prior normal distribution with a normally distributed sigma (0, 0.2) and intercept (0, 0.1). We fully pooled the intercept with a strong prior near 0 because all SF6 concentrations (i.e., measurements from Stations 1–4) were normalized to the SF6 concentration at Station 1; this approach means that the intercept should be near 1, or 0 when logged. Thus, the model fit can be described as variable-slope, fixed-intercept linear regression. Second, the models pooled the estimates of k600 and K600 by Q, using linear relationships between Q and k600 or K600. Priors were assigned for both the slope and the intercept based on predictions from an existing scaling model (Eq. 4 in Raymond et al., 2012). These priors were given large variances when possible (i.e., 10 for the intercept and 1 for the slope) to allow for divergence from the model predictions. However, at sites with a limited number of experiments (e.g., Pringle Creek, PRIN, and Kings Creek, KING), we used smaller variances to allow the model to converge. The site-specific priors used are listed in Table S1.

The two levels described can be referred to as a within-release model and an among-release model. The within-release model was a log-transformed (base e) exponential model. We log-transformed the model because SF6 is always positive (ambient = 0) and because errors in the measurement of SF6 can scale with the magnitude of the concentration.

logSi,j=logS0-KD,jxi,j+εi,j

and

εi,jnormal0,σrelease,

where Si,j is the SF6 concentration normalized to the concentration at Station 1 for any one release (sample i in release j), S0 is the normalized SF6 concentration at Station 1, xi,j is the distance downstream to which the normalized concentration corresponds, and εi,j is a normally distributed random variable with μ=0 and SD=σrelease. We then converted KD,j to gas exchange velocity kSF6,j using Eq. (3) and normalized to k600,j using Eq. (4). The among-release model included a linear model predicting the parameter k600,j as a linear function of discharge:

logk600,j=a+b×logQj+εj

and

εjnormal0,σstream,

where a and b are the intercept and slope parameters of a model regressing log (k600,j) and log (Qj), where Qj is the discharge during any one release and σstream is the residual variation. We also fit linear second-level models with log (K600,j), where K600,j is the per-time gas exchange rate.

We fit models using Stan in the RStan package in R (Stan Development Team, 2023). The models were run for at least 5000 iterations over four chains. Models were assessed according to the number of divergent transitions, the effective sample size (ESS) for each estimated parameter (>1000), and posterior predictive checks with the ShinyStan R package (Gabry et al., 2023). In addition, the model fits for each experiment were visually assessed (Figs. S1–S2). Finally, the median estimates for k600 and K600 were visualized in log(k600)–log(Q) or log(K600)–log(Q) space, respectively, along with 1000 Markov chain–Monte Carlo (MCMC) estimates of the log(k600)–log(Q) or log(K600)–log(Q) relationship, respectively. If an estimate of k600 and K600 fell outside the overall Q relationship and if that experiment's model fits showed signs of being unduly influenced by unrealistic gas-loss patterns (e.g., very little decline indicating the study reach was too short, an abrupt decline indicating improper mixing), the experiment was assessed for the QC flags described above. If a QC flag had previously been assigned, then that experiment was removed (e.g., it was identified as an outlier and could be attributed to experimental error), and the model was rerun without that experiment.

2.4 Recommended estimates

The processing pipeline outlined above in Sect. 2.3 resulted in both un-pooled frequentist and Bayesian estimates of k600 and K600, both with and without salt corrections. We include outputs from all four approaches in the dataset for completeness and to allow future users to decide which estimates best fit their needs and to compare the two approaches. The complete dataset is available in the gasExchange_ds_v2.csv file (Aho et al., 2024).

During data processing, we found that the NaCl and NaBr concentration data also could introduce errors and uncertainties into our estimates of k600 and K600. For instance, background concentrations at a single station were occasionally so high that contamination was the likely explanation. Further, sometimes samples taken during the constant-rate injection could vary across the reach in unpredictable ways (e.g., increases across the reach, random peaks along the reach instead of the expected stable, declining concentrations), which was likely the result of incomplete mixing with the water column. In many cases, the quality of the salt-corrected SF6 data precluded Bayesian-model convergence. Through assessing the gas-loss plots and salt concentration declines for all experiments and the failures to produce model convergence for salt-corrected data, we determined that salt corrections had the strong potential to either introduce errors into or preclude estimates of k600 and K600. Therefore, we suggest only using salt-corrected data when clearly necessary (e.g., obvious lateral inflow) and possible in terms of data quality and model convergence. We determined that salt correction was important for five sites: Como Creek (COMO), Rio Cupeyes (CUPE), Rio Yahuecas (GUIL), Martha Creek (MART), and Walker Branch (WALK). Notably, several of these sites have lateral inflows within the study reach under certain hydrologic conditions, which explains the necessity for the salt correction. For completeness, our dataset includes estimates for both uncorrected and salt-corrected k600 and K600 when a salt correction is possible.

In addition to the complete dataset of all estimates (i.e., estimates from both frequentist and Bayesian approaches for both uncorrected and salt-corrected data), we also include a curated dataset of recommended estimates of k600 and K600. These recommended values are called “best_k600_mPerDay” and “best_K600_perDay” in the gasExchange_ds_v2.csv file. In all cases, the curated selection uses the Bayesian estimates because pooling across experiments and the use of informative priors resulted in more meaningful estimates than the non-Bayesian approach. The choice of whether we recommend an uncorrected or salt-corrected estimate stems from examining the relationships between uncorrected or salt-corrected estimates (Fig. S3).

3 Data description

3.1 Hydraulics

Our processing pipeline included calculating hydraulic parameters (v and z) for each of the NaCl injections and measurements of Q and w. These variables (v, z, Q, and w) for each NaCl release are available in the hydraulics_v2.csv file (Aho et al., 2024). Here, we present those data in terms of at-a-station hydraulic geometries, which are commonly used to quantify reach-specific relationships between Q and w, z, and v (Leopold and Maddock, 1953):

(7)w=aQb(8)z=cQf(9)v=kQm.

In log–log space, these exponential relationships become linear relationships (e.g., w=aQb becomes logw=loga+b×logQ), where the exponent is the slope log-linear relationship. Future users of NEON data can use these relationships along with discharge values from either the Continuous discharge data product (DP4.00130.001) or the Discharge field collection (DP1.20048.001) to approximate w, z, and v for NEON streams.

https://essd.copernicus.org/articles/16/5563/2024/essd-16-5563-2024-f04

Figure 4At-a-station hydraulic geometries for the relationship between Q and w. The exponent b of the power law relationship w=aQb is the linear slope of the relationship in log–log space and is denoted on each subplot.

Download

https://essd.copernicus.org/articles/16/5563/2024/essd-16-5563-2024-f05

Figure 5At-a-station hydraulic geometries for the relationship between Q and z. The exponent f of the power law relationship z=cQf is the linear slope of the relationship in log–log space and is denoted on each subplot.

Download

https://essd.copernicus.org/articles/16/5563/2024/essd-16-5563-2024-f06

Figure 6At-a-station hydraulic geometries for the relationship between Q and v. The exponent of the power law relationship v=kQm is the linear slope of the relationship in log–log space and is denoted on each subplot.

Download

Table 2Coefficients and exponents from at-a-station hydraulic geometries. The 95 % confidence intervals for each coefficient and exponents are shown. In addition, the R2 values from the log-linear relationships are also presented.

Download Print Version | Download XLSX

The hydraulic relationships are illustrated (Figs. 4–6, Table 2). These geometries leverage field measurements of w (n=783) and Q (n=601), estimates of v (n=618) from NaCl injections, and estimates of z (n=581) calculated from Q/vw. The timing of these measurements is denoted on site-specific hydrographs for each site (Fig. S4), and their coverage of site-specific Q ranges is illustrated on flow duration curves (Fig. S5). In general, zQ and vQ relationships were the strongest, with all but three relationships having R2>0.5 and relatively narrow 95 % confidence intervals around the coefficients from these relationships (Table 2). The wQ relationships are the weakest; 9 of the 24 sites have R2<0.5 and large 95 % confidence intervals (Table 2). The wQ relationships may be weaker because our width estimates represent an average of 30 measurements across the ∼500 m study reach. It is possible that this across-reach averaging contributes to the weaker relationships with Q and that perhaps the relationships would be stronger if the measurement was only taken at the same location as the Q measurement. However, this single-point approach would be less compatible with v measured of the entire reach and would alter the resulting calculations of z.

We assess the quality of our hydraulic parameters by examining the product of the constants (a×c×k) and sum of the exponents (b+f+m) for unity on a site-by-site basis. These unity relationships stem from the fact that Q=wzv (Leopold and Maddock, 1953). The products of the constants ranged from 0.93 to 1.04 and averaged 1.00; the site-specific sum of the exponents ranged from 0.96 to 1.01 and averaged 1.00. There was one instance where the product of the constants deviated more than 5 % from unity (0.93, PRIN). Pringle Creek (PRIN) is a semi-arid, intermittent stream in Texas, and so the deviation from unity may stem from logistical difficulties in measuring low and non-perennial stream flows (Seybold et al., 2023; Shanafield et al., 2021). However, the remainder of the sites had both products of constants and sums of exponents within 5 % of unity.

We assess the representativeness of our hydraulic parameters through comparison to literature values for the exponents. Previous studies have shown large ranges for all three exponents, with ranges spanning 0–0.6 for b, 0–0.8 for f, and 0–0.8 for m (Park, 1977; Rhodes, 1977). In addition, exponents have not been shown to vary predictably with region or climate (Park, 1977) but rather with channel geometry (Ferguson, 1986). Our parameters fall within published ranges, and our average values for each exponent (b=0.11, f=0.33, m=0.56) are similar to averages in other studies that span many streams (b=0.14, f=0.30; Morel et al., 2020; b=0.12, f=0.37, m=0.51; Dingman and Afshari, 2018). In sum, the hydraulics dataset and associated hydraulic-geometry relationships presented here can be used to characterize w, z, and v for NEON streams.

3.2k600 and K600 estimates

As described above, k600 was estimated in two ways: (1) via an unpooled frequentist approach using the def.calc.reaeration() function to estimate k600 independently for each experiment and (2) via a partially pooled Bayesian approach that partially pooled experiments from the same site according to Q. Both approaches were attempted for raw SF6 concentrations and salt-corrected SF6 concentrations. Salt-corrected SF6 concentrations are only recommended for the five sites mentioned above (COMO, CUPE, GUIL, MART, WALK). All estimates are available in the gasExchange_ds_v2.csv file (Aho et al., 2024).

https://essd.copernicus.org/articles/16/5563/2024/essd-16-5563-2024-f07

Figure 7Comparison of partially pooled Bayesian and unpooled frequentist estimates of k600. Black 1:1 lines overlay the points for reference.

Download

The relationship between the partially pooled and unpooled estimates (with salt correction when appropriate) is shown in Fig. 7. Any points falling above the 1:1 line are instances when partial pooling resulted in a lower estimate than no pooling, and vice versa. Overall, there are instances where partially pooled estimates are both higher and lower than un-pooled estimates, suggesting that partial pooling successfully regularized estimates. This shrinkage is more obvious when both estimates are plotted against Q (Fig. 8). We made recommendations (best_k600_mPerDay and best_K600_perDay in the gasExchange_ds_v2.csv) of which estimates to use; this curated dataset includes only Bayesian estimates, and a salt correction was only recommended for the five sites where it was possible and necessary.

Table 3Mean, standard deviation (SD), minimum, maximum, and count for k600 (m d−1) and K600 (d−1) estimates by site.

Download Print Version | Download XLSX

https://essd.copernicus.org/articles/16/5563/2024/essd-16-5563-2024-f08

Figure 8Relationship between Q and partially pooled Bayesian and unpooled frequentist estimates of k600. The unpooled estimates are shown in green, with a green regression line with 95 % confidence intervals, while the partially pooled estimates are shown in purple, with a purple regression line with 95 % confidence intervals.

Download

https://essd.copernicus.org/articles/16/5563/2024/essd-16-5563-2024-f09

Figure 9Box plots of the test estimates of (a) k600 and (b) K600 by site. Boxes represent the median and interquartile range (IQR), whiskers mark the lesser of first (third) quartile (+) 1.5×IQR or minimum/maximum, and points denote outliers more extreme than first (third) quartile (+) 1.5×IQR.

Download

There are 339 estimates of k600 and K600 included in our recommended dataset (Fig. 9, Table 3) that span large Q ranges at each site (Figs. S4–S5). The number of estimates per site ranges from 4 (Kings Creek (KING) and Pringle Creek (PRIN)) to 29 (Posey Creek (POSE)). The issueLog.csv file documents the reason that 340 experiments did not successfully produce gas exchange estimates. These reasons are grouped into broad categories and summarized in Table S2. It is possible that some of the experiments that we removed could produce an estimate of k600 and K600 (e.g., if there was incomplete mixing at the first station, one could remove this station and only estimate k600 and K600 for Stations 2–4). However, this type of selective cleaning would have resulted in less comparable estimates (e.g., changing the length of the study reach), so we chose to include only the most comparable and high-quality experiments in this dataset.

The values for recommended estimates of k600 ranged from 0.1 to 142 m d−1. LeConte Creek (LECO) had the highest mean k600 (mean ± SD: 53±35 m d−1), while Pringle Creek (PRIN) had the lowest mean k600 (mean ± SD: 1.1±0.1 m d−1). Lower Hop Brook (HOPB) had the widest spread of k600 values, with estimates ranging almost 2 orders of magnitude (1.5–124 m d−1), while Pringle Creek (PRIN) had the smallest spread, with estimates only ranging from 0.9 to 1.2 m d−1. The values for recommended estimates of K600 range from 1.3 to 481 d−1. Like for k600, LeConte Creek (LECO) had the highest mean K600 (mean ± SD: 276±94 d−1), while Pringle Creek (PRIN) had the lowest mean K600 (mean ± SD: 7.0±1.9 d−1). Also, Lower Hop Brook (HOPB) had the widest spread of K600 values, with estimates ranging from 30 to 368 d−1, while Pringle Creek (PRIN) had the smallest spread, with estimates ranging from 5.3 to 9.3 d−1. These ranges, in part, reflect the various ranges of Q captured at each site (Table S3). HOPB was among the sites with the largest range of Q captured, while PRIN was among the sites with the smallest Q range captured (Table S3). Overall, this large compilation of k600 and K600 estimates indicates high variability both across and within sites.

Finally, to allow future users to scale k600 and K600 with Q, we include both the slope and intercept for the k600Q and K600Q relationships (Table 4) and the stanfit objects for the Bayesian models (“Other Entities” in the data release). The slope and intercept will allow future users a straightforward way to scale k600 or K600 as a function of Q at each site. The stanfit objects, on the other hand, will allow future users to sample from the posterior distribution of slope and intercept to assess uncertainty in the scaling relationships.

Table 4Coefficients and exponents for k600Q and K600Q relationships. Estimates and standard deviations (SD) are given.

Download Print Version | Download XLSX

4 Code and data availability

The reaRate R package is available at https://doi.org/10.5281/zenodo.12786089 (Cawley et al., 2024). The dataset of hydraulic parameters and gas exchange estimates is available from the Environmental Data Initiative: https://doi.org/10.6073/pasta/18dcc1871ee71cf0b69f2ee408 2839d0 (Aho et al., 2024).

5 Conclusions

Here, we present 339 estimates of gas exchange velocity (k600) and gas exchange rate (K600) from 22 NEON streams. To our knowledge, this dataset is the largest compilation of gas exchange estimates from standardized tracer-gas experiments (i.e., standardized methods across all experiments and sites) published to date. Given the wide geographic spread of NEON streams (e.g., spanning Puerto Rico to Alaska), this dataset will enable understanding of gas exchange across biomes. In addition, in our estimation process, we leverage new Bayesian multilevel models that allow estimates of gas exchange to be partially pooled by site according to Q; this modeling approach can be applied to future studies where repeat tracer-gas experiments are conducted to characterize gas exchange as a function of Q. Here, we also present relationships between discharge and gas exchange (i.e., k600Q and K600Q) from these models that will allow future users to scale k600 or K600 as a function of Q in NEON streams.

Regarding hydraulics, we present hydraulic geometries for 24 NEON streams. These geometries leverage field measurements of w and Q and estimate v and z. The coefficients and exponents from at-a-station hydraulic geometries are presented and can be used in the future, along with Continuous discharge (DP4.00130.001), to estimate w, v, and z at NEON streams. In sum, this large dataset could allow for quantification of both within- and across-reach variability in hydraulics and gas exchange in streams, which could be useful to modeling stream metabolism, greenhouse gas emissions, and other biogeochemical fluxes in NEON streams. In addition, this dataset may facilitate the development of new predictive models of gas exchange in small streams.

Supplement

The supplement related to this article is available online at: https://doi.org/10.5194/essd-16-5563-2024-supplement.

Author contributions

Conceptualization: KSA, KMC, RTH, ROH Jr., WKD, KJG. Methodology: KSA, KMC, RTH, ROH Jr. with support from all authors. Software: KSA, KMC, ROH Jr. Writing (original draft): KSA with support from all authors. Writing (review and editing): KSA, KMC, RTH, ROH Jr., WKD, KJG.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Acknowledgements

The National Ecological Observatory Network (NEON) is a programme sponsored by the National Science Foundation and operated under cooperative agreement by Battelle. This material is based in part upon work supported by the National Science Foundation through the NEON programme. The idea for this dataset initially came from meetings of the NEON Reaeration Technical Working Group.

Review statement

This paper was edited by Yuanzhi Yao and reviewed by Chunlin Song and Liwei Zhang.

References

Aho, K. S., Cawley, K., Hensley, R., Hall, R. O., Dodds, W., and Goodman, K.: Gas exchange velocities (k600), gas exchange rates (K600), and hydraulic geometries for streams and rivers derived from the NEON Reaeration field and lab collection data product (DP1.20190.001) ver 2, Environmental Data Initiative [data set], https://doi.org/10.6073/pasta/18dcc1871ee71cf0b69f2ee4082839d0, 2024. 

Appling, A. P., Hall, R. O., Yackulic, C. B., and Arroita, M.: Overcoming Equifinality: Leveraging Long Time Series for Stream Metabolism Estimation, J. Geophys. Res.-Biogeo., 123, 624–645, https://doi.org/10.1002/2017JG004140, 2018. 

Aristegi, L., Izagirre, O., and Elosegi, A.: Comparison of several methods to calculate reaeration in streams, and their effects on estimation of metabolism, Hydrobiologia, 635, 113–124, https://doi.org/10.1007/s10750-009-9904-8, 2009. 

Cawley, K., Aho, K. S., and Hall, R. O.: reaRate R package, NEONScience/NEON-reaeration: v0.0.2, Zenodo [code], https://doi.org/10.5281/zenodo.12786089, 2024. 

Churchill, M. A., Elmore, H. L., and Buckingham, R. A.: The Prediction of Stream Reaeration Rates, in: Advances in Water Pollution Research, Elsevier, 89–136, https://doi.org/10.1016/B978-1-4832-8391-3.50015-4, 1964. 

Cole, J. J. and Caraco, N. F.: Atmospheric exchange of carbon dioxide in a low-wind oligotrophic lake measured by the addition of SF6, Limnol. Oceanogr., 43, 647–656, https://doi.org/10.4319/lo.1998.43.4.0647, 1998. 

Dingman, S. L. and Afshari, S.: Field verification of analytical at-a-station hydraulic-geometry relations, J. Hydrol. (Amst), 564, 859–872, https://doi.org/10.1016/j.jhydrol.2018.07.020, 2018. 

Ferguson, R. I.: Hydraulics and hydraulic geometry, Prog. Phys. Geogr.-Earth and Environment, 10, 1–31, https://doi.org/10.1177/030913338601000101, 1986. 

Gabry, J., Veen, D., Team, S. D., Andreae, M., Betancourt, M., Carpenter, B., Gao, Y., Gelman, A., Goodrich, B., Lee, D., Song, D., and Trangucci, R.: shinystan: Interactive visual and numerical diagnostics and posterior analysis for Bayesian models, R package version 2.6.0, CRAN [code], https://CRAN.R-project.org/package=shinystan (last access: 8 May 2024), 2023. 

Genzoli, L. and Hall, R. O.: Shifts in Klamath River metabolism following a reservoir cyanobacterial bloom, Freshwater Sci., 35, 795–809, https://doi.org/10.1086/687752, 2016. 

Hall, R. O. and Hotchkiss, E. R.: Stream Metabolism, in: Methods in Stream Ecology, Academic Press, 219–233, 2017. 

Hall, R. O. and Ulseth, A. J.: Gas exchange in streams and rivers, WIREs Water, 7, 1–18, https://doi.org/10.1002/wat2.1391, 2020. 

Hall, R. O., Tank, J. L., Baker, M. A., Rosi-Marshall, E. J., and Hotchkiss, E. R.: Metabolism, Gas Exchange, and Carbon Spiraling in Rivers, Ecosystems, 19, 73–86, https://doi.org/10.1007/s10021-015-9918-1, 2016. 

Ho, D. T., Schlosser, P., and Orton, P. M.: On Factors Controlling Air-Water Gas Exchange in a Large Tidal River, Estuar. Coast., 34, 1103–1116, https://doi.org/10.1007/s12237-011-9396-4, 2011. 

Hornberger, G. M. and Kelly, M. G.: Atmospheric Reaeration in a River Using Productivity Analysis, J. Environ. Eng. Div., 101, 729–739, https://doi.org/10.1061/JEEGAV.0000398, 1975. 

Jähne, B., Münnich, K. O., Bösinger, R., Dutzi, A., Huber, W., and Libner, P.: On the parameters influencing air-water gas exchange, J. Geophys. Res.-Oceans, 92, 1937–1949, https://doi.org/10.1029/JC092iC02p01937, 1987. 

Leopold, L. B. and Maddock Jr., T.: The Hydraulic Geometry of Stream Channels and Some Physiographic Implications, Geological Survey Professional Paper 252, United States Government Printing Office, Washington, 1953. 

Liu, S., Kuhn, C., Amatulli, G., Aho, K., Butman, D. E., Allen, G. H., Lin, P., Pan, M., Yamazaki, D., Brinkerhoff, C., Gleason, C., Xia, X., and Raymond, P. A.: The importance of hydrology in routing terrestrial carbon to the atmosphere via global streams and rivers, P. Natl. Acad. Sci. USA, 119, 1–9, https://doi.org/10.1073/pnas.2106322119, 2022. 

Maurice, L., Rawlins, B. G., Farr, G., Bell, R., and Gooddy, D. C.: The Influence of Flow and Bed Slope on Gas Transfer in Steep Streams and Their Implications for Evasion of CO2, J. Geophys. Res.-Biogeo., 122, 2862–2875, https://doi.org/10.1002/2017JG004045, 2017. 

McDowell, M. J. and Johnson, M. S.: Gas Transfer Velocities Evaluated Using Carbon Dioxide as a Tracer Show High Streamflow to Be a Major Driver of Total CO2 Evasion Flux for a Headwater Stream, J. Geophys. Res.-Biogeo., 123, 2183–2197, https://doi.org/10.1029/2018JG004388, 2018. 

Morel, M., Booker, D. J., Gob, F., and Lamouroux, N.: Consistent Theoretical and Empirical Predictions of at-a-Station Hydraulic Geometry Exponents in Stream Reaches, Water Resour. Res., 56, 1–16, https://doi.org/10.1029/2020WR027242, 2020. 

NEON: Discharge field collection (DP1.20048.001) RELEASE-2023, NEON [data set], https://doi.org/10.48443/tys0-ze83, 2023a. 

NEON: Reaeration field and lab collection (DP.20190.001) RELEASE-2023, NEON [data set], https://doi.org/10.48443/bk29-6c91, 2023b. 

NEON: Discharge field collection (DP1.20048.001) RELEASE-2024, NEON [data set], https://doi.org/10.48443/3746-1981, 2024a. 

NEON: Reaeration field and lab collection (DP1.20190.001) RELEASE-2024, NEON [data set], https://doi.org/10.48443/4z25-4b94, 2024b. 

O'Connor, D. J. and Dobbins, W. E.: Mechanism of Reaeration in Natural Streams, T. Am. Soc. Civil Eng., 123, 641–666, https://doi.org/10.1061/TACEAT.0007609, 1958. 

Park, C. C.: World-wide variations in hydraulic geometry exponents of stream channels: An analysis and some observations, J. Hydrol. (Amst), 33, 133–146, https://doi.org/10.1016/0022-1694(77)90103-2, 1977. 

Rathbun, R. E.: Reaeration Coefficients of Streams – State-of-the-Art, J. Hydraul. Div., 103, 409–424, https://doi.org/10.1061/JYCEAJ.0004734, 1977. 

Raymond, P. A., Zappa, C. J., Butman, D., Bott, T. L., Potter, J., Mulholland, P., Laursen, A. E., McDowell, W. H., and Newbold, D.: Scaling the gas transfer velocity and hydraulic geometry in streams and small rivers, Limnol. Oceanogr.-Fluids and Environments, 2, 41–53, https://doi.org/10.1215/21573689-1597669, 2012. 

R Core Team: R: A language and environment for statistical computing, R Foundation for Statistical Computing, Vienna, Austria, https://www.R-project.org/, 2023. 

Rhodes, D. D.: The b-f-m diagram; graphical representation and interpretation of at-a-station hydraulic geometry, Am. J. Sci., 277, 73–96, https://doi.org/10.2475/ajs.277.1.73, 1977.  

Riley, A. J. and Dodds, W. K.: Whole-stream metabolism: Strategies for measuring and modeling diel trends of dissolved oxygen, Freshwater Sci., 32, 56–69, https://doi.org/10.1899/12-058.1, 2013. 

Rocher-Ros, G., Stanley, E. H., Loken, L. C., Casson, N. J., Raymond, P. A., Liu, S., Amatulli, G., and Sponseller, R. A.: Global methane emissions from rivers and streams, Nature, 621, 530–535, https://doi.org/10.1038/s41586-023-06344-6, 2023. 

Seybold, E. C., Bergstrom, A., Jones, C. N., Burgin, A. J., Zipper, S., Godsey, S. E., Dodds, W. K., Zimmer, M. A., Shanafield, M., Datry, T., Mazor, R. D., Messager, M. L., Olden, J. D., Ward, A., Yu, S., Kaiser, K. E., Shogren, A., and Walker, R. H.: How low can you go? Widespread challenges in measuring low stream discharge and a path forward, Limnol. Oceanogr. Lett., 8, 804–811, https://doi.org/10.1002/lol2.10356, 2023. 

Shanafield, M., Bourke, S. A., Zimmer, M. A., and Costigan, K. H.: An overview of the hydrology of non-perennial rivers and streams, Wiley Interdisciplinary Reviews: Water, 8, 1–25, https://doi.org/10.1002/wat2.1504, 2021. 

Stan Development Team: RStan: the R interface to Stan, R package version 2.26.22, https://mc-stan.org/ (last access: 8 May 2024), 2023. 

Ulseth, A. J., Hall, R. O., Boix Canadell, M., Madinger, H. L., Niayifar, A., and Battin, T. J.: Distinct air–water gas exchange regimes in low- and high-energy streams, Nat. Geosci., 12, 259–263, https://doi.org/10.1038/s41561-019-0324-8, 2019. 

Wanninkhof, R.: Relationship between wind speed and gas exchange over the ocean, J. Geophys. Res., 97, 7373–7382, https://doi.org/10.1029/92JC00188, 1992. 

Wanninkhof, R., Ledwell, J. R., and Broecker, W. S.: Gas exchange-wind speed relation measured with sulfur hexafluoride on a lake, Science, 227, 1224–1226, https://doi.org/10.1126/science.227.4691.1224, 1985. 

Zappa, C. J., McGillis, W. R., Raymond, P. A., Edson, J. B., Hintsa, E. J., Zemmelink, H. J., Dacey, J. W. H., and Ho, D. T.: Environmental turbulent mixing controls on air-water gas exchange in marine and aquatic systems, Geophys. Res. Lett., 34, L10601, https://doi.org/10.1029/2006GL028790, 2007. 

Download
Short summary
Gas exchange is fundamental to many biogeochemical processes in streams and depends on the degree of gas saturation and the gas transfer velocity (k). Currently, k is harder to measure than concentration. Here, we present a processing pipeline to estimate k from tracer-gas experiments conducted in 22 streams by the National Ecological Observatory Network. The processed dataset (n = 339) represents the largest compilation of standardized k estimates available.
Altmetrics
Final-revised paper
Preprint