Riverine phosphorus gain and loss across the conterminous United States

Wang, Yiming; Zhang, Xuesong; Zhao, Kaiguang; Sabo, Robert D.; Miao, Yuxin; Clark, Christopher M.

doi:10.5194/essd-18-3355-2026

Articles | Volume 18, issue 5

https://doi.org/10.5194/essd-18-3355-2026

Articles | Volume 18, issue 5

Data description article

19 May 2026

Data description article |

| 19 May 2026

Riverine phosphorus gain and loss across the conterminous United States

Yiming Wang, Xuesong Zhang, Kaiguang Zhao, Robert D. Sabo, Yuxin Miao, and Christopher M. Clark

Abstract

Excess riverine phosphorus represents a preeminent catalyst for water quality degradation. Spatial mapping and characterization of the net gain and loss of riverine phosphorus help discern the critical source areas. Here, we developed a dataset encompassing phosphate (PO $_{4}^{3 -}$ ) and total phosphorus (TP) gain and loss across catchments in the conterminous United States (CONUS). We compiled 51 394 PO $_{4}^{3 -}$ and 285 675 TP concentration measurements and estimated PO $_{4}^{3 -}$ and TP loads at 963 and 2317 stations, respectively. Next, we leveraged the upstream-downstream topology information from the National Hydrography Dataset Plus (NHDPlus) catchment map at the Hydrologic Unit Catalogue-12 (HUC12) level to derive the net gain and loss of riverine phosphorus across catchments in the CONUS. Such maps can be used to estimate potential contributions of point and non-point sources to riverine phosphorus pollution at refined spatial scales, identify different major factors controlling local riverine P gain and loss compared to P loads, and evaluate watershed model's fidelity for representing riverine P cycling. The resultant dataset is provided in Excel (.xlsx) format, accessible at Figshare (https://doi.org/10.6084/m9.figshare.28509317, Wang et al., 2025b). Leveraging the HUC12 information for spatialization, the new datasets aim to address the existing gap in regional characterization of riverine phosphorus and support effective management practices across the CONUS.

Download & links

Article (PDF, 15777 KB)

Supplement (3790 KB)

Download & links

Article (15777 KB)
Full-text XML
Supplement (3790 KB)
BibTeX
EndNote

How to cite.

Received: 05 Dec 2025 – Discussion started: 12 Feb 2026 – Revised: 28 Apr 2026 – Accepted: 01 May 2026 – Published: 19 May 2026

1 Introduction

Eutrophication in inland waters and estuaries is a widespread water quality challenge across the globe, with significant economic cost (e.g., USD 1 billion in Europe and USD 2.2 billion annually in the United States (U.S.)) (Wurtsbaugh et al., 2019). Excess phosphorus (P) is a primary contributor to eutrophication in streams and rivers, especially in intensive agricultural regions (Brownlie et al., 2022; Royer et al., 2006). Riverine export of P is also a major contributor to oxygen-depleted dead zones in coastal waters, causing damage to underwater life (Diaz and Rosenberg, 2008). There is an urgent need for global actions to reduce P pollution for the environment and human health (UNEP, 2025).

Nonpoint or diffuse sources, particularly nutrients applied to agroecosystems, are often recognized as the primary source of water pollution (Carpenter et al., 1998). P surplus in agricultural soils due to unused fertilization and manure application can be transported to water bodies through surface runoff and groundwater pathways, and cause persistent water pollution (Stackpoole et al., 2019). The diffuse nature of nonpoint source pollution poses challenges for directly identifying and regulating critical source areas. Existing riverine P pollution databases mainly focus on certain agricultural areas (Ringeval et al., 2024). Given the considerable spatial variation in P inputs to rivers (Arheimer and Lidén, 2000; Stackpoole et al., 2019; Zhang et al., 2017), there is a lack of large-scale riverine P datasets at sufficient spatial scales across the conterminous United States (CONUS) to quantify and analyze riverine P gain and loss. Such datasets, in conjunction with other observed and modelled P data (e.g., point source discharge) help identify regions with high non-point source P inputs, thereby supporting more effective targeting of measures for P pollution control. In addition, the datasets can also be used to assess fidelity of distributed watershed models and understand key factors influencing local riverine P cycling.

In this study, we aim to develop new datasets for spatial characterization of riverine P gain and loss across the CONUS, to help identify critical source areas and improve prioritization and implementation of nutrient management activities. The subsequent sections of this paper include the method used to generate spatial riverine P gain and loss (Sect. 2), results detailing the P dataset (Sect. 3), discussion of influencing factors and potential uncertainties (Sect. 4), codes and data availability (Sect. 5), and conclusions (Sect. 6).

2 Materials and Methods

2.1 Overview

To estimate riverine P gain and loss data across the CONUS, we compiled streamflow and P concentration data (i.e., unfiltered phosphate (PO $_{4}^{3 -}$ )) from 963 monitoring stations and total phosphorus (TP) from 2317 stations, and calculated P loads (kgP yr⁻¹) at these stations using the Load Estimator (LOADEST) program (Runkel et al., 2004) (Fig. 1). Next, we estimated P gain and loss across the catchments measured by one downstream station and its immediate upstream stations using the upstream-downstream connectivity information contained in the National Hydrography Dataset Plus (NHDPlus) catchments (https://nhdplus.com/NHDPlus/NHDPlusV2_data.php, last access: 7 May 2026), resulting in 547 and 1225 unique Hydrologic Unit Catalogue (HUC) groups for PO $_{4}^{3 -}$ and TP, respectively. Each HUC group has a unique pair of downstream and upstream stations that allows us to calculate the gain and loss of riverine P (as illustrated in Fig. S1 in the Supplement). Note that headwater HUC groups only have one downstream station without upstream stations draining to them. Due to differences in data availability, the riverine PO $_{4}^{3 -}$ and TP gain and loss data cover 52 020 and 65 735 Hydrologic Unit Catalogue-12 (HUC12) catchments, respectively. Then we estimated potential contribution to P pollution from nonpoint sources by subtracting upstream P inputs and point source P inputs from the riverine P gain and loss in each catchment. Finally, we used land cover and climate data to evaluate major controls of riverine P gain and loss.

https://essd.copernicus.org/articles/18/3355/2026/essd-18-3355-2026-f01

Figure 1Overview of the generation of TP and PO $_{4}^{3 -}$ gain and loss data across the CONUS. A more detailed description and higher resolution figure about HUC group generation can be found in Sect. S2 and Fig. S1.

2.2 Study area

The CONUS (i.e., the lower 48 states of the U.S.) is located in North America from 46°20^′ to 98°34^′ W longitude and from 39°49^′ to 41°43^′ N latitude, covering an area of 8 080 464.3 km² with a north-to-south distance of approximately 2660 km. The terrain features higher elevations in the west and flatter areas in the east. Based on the Watershed Boundary Dataset (WBD; https://water.usgs.gov/GIS/huc.html, last access: 7 May 2026), the CONUS includes 18 major watersheds, encompassing several large rivers such as the Mississippi and Colorado Rivers.

2.3 Data compilation

We compiled 51 394 PO $_{4}^{3 -}$ (USGS parameter code 00650) concentration data from 963 stations (spanning from 1952 to 2022) and 285 675 TP (USGS parameter code 00665) observations from 2317 stations (spanning from 1958 to 2023) across the CONUS from the Water Quality Portal (Read et al., 2017). To ensure consistency with streamflow records, only records from the U.S. Geological Survey (USGS) National Water Information System (NWIS) data source were used in this study. For each P observation, we identified co-located monitoring stations (Wang et al., 2024b) and downloaded and processed daily streamflow data from the USGS NWIS. Only stations with both phosphorus concentration observations and corresponding daily streamflow records were retained. We calculated an average P concentration where there were multiple P concentration observations on the same day. Before calculating the P load at a station with the LOADEST model, we excluded stations with less than 12 measurements. Thus, we finally selected 547 stations for PO $_{4}^{3 -}$ and 1225 stations for TP. For TP, the “Point-Source Nutrient Loads to Streams of the Conterminous United States” dataset provides estimated annual total point-source inputs during 2012 at the HUC12 level (Skinner and Maupin, 2019). This allowed us to aggregate the total TP input to rivers for the catchments used to calculate riverine TP gain and loss. Note that the point source dataset does not contain PO $_{4}^{3 -}$ .

Land cover and climatic controls of riverine P gain and loss were also assessed. Land cover data were derived from the National Land Cover Database (NLCD; https://doi.org/10.5066/P94UXNTS, USGS, 2024), which provided long-term average information on various land cover types: barren land, crops, forest, hay, herbs, impervious surfaces, scrub, water, herbaceous wetlands, and woody wetlands (Homer et al., 2012). Climate data were sourced from the PRISM dataset (https://www.prism.oregonstate.edu/, last access: 7 May 2026), including annual average temperature and total precipitation (Page et al., 2021). Using upstream-downstream topology information, we calculated the total area of each land cover type from the headwater to the current catchment at the HUC12 scale, representing their cumulative impact. For climate data, the local climate within each HUC12 was used. The P surplus was accessed from the National Inventory of Phosphorus (NIP), which provides major inputs and outputs of reactive P at the HUC8 scale across the CONUS (Sabo et al., 2021).

To ensure consistency across datasets with different spatial resolutions, all analyses were anchored at the HUC12 group scale. Point-source inputs, originally reported at the HUC12 level, were aggregated to HUC12 groups to match the gain-loss estimates. For datasets available at coarser scales (e.g., HUC8 for NIP inputs and HUC4 for agricultural inputs), gain and loss were upscaled using area-weighted averaging. No downscaling was applied.

2.4 Riverine P gain and loss across catchments in the CONUS

Riverine P gain and loss was estimated by calculating the difference between P loads at a downstream monitoring station and the sum of P loads from its neighbouring upstream stations. For multiple USGS stations located in the same HUC12 catchment, we kept only one station on the mainstem of the river that is closest to the outlet of the HUC12 catchment, by comparing the drainage area of the gaging station and the HUC12 catchment in which it is located. For headwaters, since there are no upstream gages, the P load was used as the net riverine gain. The upstream-downstream topology relationship between the monitoring stations was derived from the HUC12 catchments from the watershed boundary dataset (WBD; https://water.usgs.gov/GIS/huc.html, last access: 7 May 2026). Such a method has been outlined and tested by Qiu et al. (2023) (Sect. S2). As explained above, we identified 547 and 1225 unique HUC groups for PO $_{4}^{3 -}$ and TP, respectively. Note that each HUC group includes multiple HUC12 polygons and these HUC12 catchments share the same gain and loss data.

For each HUC group, the balance of riverine P can be expressed as follows:

\begin{matrix} (1) & \begin{aligned} P load at downstream outlet = \\ (P loads from upstream inputs) + (P from point sources) \\ - (Riverine removal of  P from point sources) \\ + (P from non-point sources) \\ - (Riverine removal of P from non-point sources) \end{aligned} \end{matrix}

Because in-stream removal terms cannot be directly constrained and may vary across systems (e.g., about 12 % globally) (Maavara et al., 2015), we assume the values of these two terms to be negligible for the purpose of deriving a simplified, first-order estimate for nonpoint source inputs. Thus, riverine removal is not explicitly estimated in this study, and the calculated P gain and loss represent the net difference between downstream and upstream loads. Conceptually, this net difference reflects the combined effects of watershed P inputs (from both point and nonpoint sources) and in-stream processes (e.g., retention, transformation, and remobilization) occurring along the flow path, rather than a direct measure of any single process such as in-stream removal. Under this assumption, Eq. (1) reduces to:

\begin{matrix} (2) & \begin{aligned} (P from non-point sources) = P gain and loss \\ - (P from point sources) \end{aligned} \end{matrix}

where P gain and loss = (P load at downstream outlet) − (P loads from upstream inputs).

Accordingly, negative values of P gain and loss indicate net decreases in load along the flow path, which may reflect a combination of retention, transformation, or other processes, rather than being interpreted solely as riverine removal. This formulation neglects in-stream P removal, and therefore (P from nonpoint sources) = P gain and loss − (P from point sources) is a lower-end estimate of the nonpoint source contribution to riverine P for each HUC group. Since only TP from point sources is available, we derived nonpoint-source TP loads but not for PO $_{4}^{3 -}$ .

2.5 Evaluation of estimated riverine load

We evaluated the consistency of the PO $_{4}^{3 -}$ and TP loads against another independent dataset derived with the Weighted Regressions on Time, Discharge, and Season (WRTDS) model (Hirsch et al., 2010; Zhang and Hirsch, 2019). First, we compared multi-year average TP loads from 151 monitoring stations. We also evaluated the estimated unfiltered PO $_{4}^{3 -}$ loads, which assess the mass of reactive P susceptible to being released in the water column under various redox conditions. Furthermore, the reliability of the upstream-downstream connectivity information is important for deriving the drainage area of HUC groups that are controlled by pairs of upstream and downstream stations. Here we used a quality-checked and corrected NHDPlus HUC12 catchment map (Wang et al., 2024a) that has been verified for reliably deriving the drainage area of each USGS station as compared to the USGS GAGES-II reported values (Falcone and Survey, 2011). These efforts helped ensure the quality of the riverine P gain and loss data developed in this study. In addition, we evaluated long-term trends in riverine P loads using the Sen's slope estimator in combination with the Mann–Kendall test (Hamed and Rao, 1998). To ensure robust trend detection, only monitoring stations with more than 30 years of load estimates were included. This resulted in 405 TP stations and 53 PO $_{4}^{3 -}$ stations used for trend analysis. The Sen's slope and corresponding p-values are provided in the dataset.

2.6 Analysis of environmental controls

Recent studies reveal that shifts in land use, agricultural practices, and climatic conditions have introduced a pervasive increase in soluble P concentrations across many different watersheds (Houser and Richardson, 2010; Singh et al., 2023). To assess the spatial factors influencing riverine P gain and loss, we employed random forest modelling to evaluate the relative importance of multiple environmental variables (Breiman, 2001). These factors were categorized into three groups: climatic factors, land cover types, and additional influences such as cumulative agricultural inputs and upstream loads. Given that the NIP dataset is only available at the HUC8 scale and some HUC groups are larger than the HUC8 catchment areas, we calculated the cumulative agricultural inputs at the HUC4 scale. In more detail, we used the ranger package, optimizing the model structure with the caret package in R. Key tuning parameters included the number of variables to use in each split (mtry), the number of trees (n_trees) and the minimum size of data points before splitting a tree (min_n). The tuning process was performed by doing a grid search for mtry (2–6) and min_n (10–20), then a second search was performed to find the optimal n_trees parameter (500–3000). To minimize random effects, the model was run 10 times, and we calculated the average importance value and harmonic mean p-value (Wilson, 2019).

3 Results

3.1 Riverine phosphorus data

We created two datasets, “Riverine PO $_{4}^{3 -}$ ” and “Riverine TP”, that encapsulate estimated multi-year average riverine gain and loss and loads, as well as point source and nonpoint source contributions for each HUC group for PO $_{4}^{3 -}$ and TP, respectively (Table 1). Complementing this information, the datasets encompass the location (i.e., longitude and latitude) of the outlet of the HUC group, the area of the HUC group, the count of observations used to calculate P loads, and commencement and termination years of observed data, to facilitate user-defined subsetting of the datasets. Additional information regarding the regression model is also included, such as the form of the regression model selected by LOADEST and the associated coefficient of determination (r²) values.

Across the board, the average r² values for the best-fit model (Table S1) across all sites are 0.76 for PO $_{4}^{3 -}$ loads and 0.83 for TP loads. Generally, in the load regression, the r² values for TP estimation outperformed those for PO $_{4}^{3 -}$ estimation, particularly in the Mid-Atlantic region (Fig. S2). It is noteworthy, however, that certain monitoring stations exhibited low r² values due to the limited availability of paired P concentration with streamflow data for regression.

Table 1Data records in the “Riverine PO $_{4}^{3 -}$ ” and “Riverine TP” datasets.

Note: ^* only for TP as only point source TP inputs are available.

Download Print Version | Download XLSX

Our TP load estimates are highly consistent with the WRTDS model with high r² and low root mean square error (RMSE) (Fig. 2). The minor disparities observed between these two datasets are likely attributable to variations in temporal coverage and different regression equations. For PO $_{4}^{3 -}$ , we found that only 11 stations with WRTDS estimates matched the stations used here, and all 11 stations are located in small watersheds. Therefore, we leveraged filtered PO $_{4}^{3 -}$ loads estimated by WRTDS to assess if the LOADEST estimated unfiltered PO $_{4}^{3 -}$ loads. Unfiltered PO $_{4}^{3 -}$ measures both the dissolved PO $_{4}^{3 -}$ as well as PO $_{4}^{3 -}$ compounds bound to suspended sediments and organic materials, and thus will have higher load compared to filtered PO $_{4}^{3 -}$ measurements (Fig. S3). Nonetheless, the high correlation indicates our estimates of unfiltered PO $_{4}^{3 -}$ are reasonable.

https://essd.copernicus.org/articles/18/3355/2026/essd-18-3355-2026-f02

Figure 2Comparison of riverine TP loads between our estimates and previous WRTDS estimated values at 151 monitoring stations. Note that the riverine TP loads vary greatly at different locations and over time, with most sites being below 10 000 MgP yr⁻¹.

Download

Spatial patterns of PO $_{4}^{3 -}$ and TP loads from each HUC group across the CONUS are shown in Fig. 3. Additionally, the location of the station at the outlet of each HUC group and the loads of streams in which it is located are shown in Fig. S4. The datasets encompass 547 stations/HUC groups for PO $_{4}^{3 -}$ and 1225 stations/HUC groups for TP, covering 4 894 464 and 6 118 360 km² PO $_{4}^{3 -}$ and TP, respectively. At the HUC2 scale, the derived gain and loss estimates cover 21 % to 99.9 % of basin area for TP and 7 % to 96.5 % for PO $_{4}^{3 -}$ , with 72 % and 44 % of HUC2 basins exceeding 50 % coverage, respectively (Fig. S5 and Table S2). The difference in spatial coverage is mainly due to the abundance of TP compared to PO $_{4}^{3 -}$ . Stations with high P loads are predominantly situated in the Midwest or proximate to megacities, with a general pattern of higher P loads observed in the eastern U.S. PO $_{4}^{3 -}$ loads range from 111 to 31 671 885 kgP yr⁻¹ and TP loads range from 235 to 336 223 136 kgP yr⁻¹. Median loads are 76 202 and 108 305 kgP yr⁻¹ and average loads are 436 311 and 1 012 363 kgP yr⁻¹, for PO $_{4}^{3 -}$ and TP, respectively.

We further examined temporal trends in riverine P loads at stations with long-term records (> 30 years). A substantial fraction of stations exhibited statistically significant trends, with 46 % of TP stations and 64 % of PO $_{4}^{3 -}$ stations showing significant changes. Decreasing trends were more prevalent, accounting for 72 % and 58 % of TP and PO $_{4}^{3 -}$ stations, respectively. Stations with increasing TP trends were primarily located in the Mississippi River Basin (Fig. S6), suggesting regional differences in nutrient dynamics.

https://essd.copernicus.org/articles/18/3355/2026/essd-18-3355-2026-f03

Figure 3Riverine (a) PO $_{4}^{3 -}$ and (b) TP loads from HUC groups across the CONUS. The boundary lines show the Hydrologic Unit Catalogue 2-digit (HUC2) watersheds. Grey areas indicate regions with no data. For visualization purposes, the logarithm was used here.

The spatial distribution of riverine P gain and loss is shown in Fig. 4. Both PO $_{4}^{3 -}$ and TP gain and loss exhibit similar spatial patterns over the CONUS, with most areas exhibiting riverine P gains. The area-weighted average PO $_{4}^{3 -}$ gain stands at 25.39 kgP km⁻² yr⁻¹, and the TP gain is 33.68 kgP km⁻² yr⁻¹. Median PO $_{4}^{3 -}$ and TP gains are lower than averages, standing at 16.75 and 33.57 kgP km⁻² yr⁻¹, respectively. At the HUC group scale, the highest area-weighted PO $_{4}^{3 -}$ gain was identified in the Upper Mississippi Region (UMR), amounting to about 113.96 kgP km⁻² yr⁻¹. The highest TP gain reached 186.55 kgP km⁻² yr⁻¹ in the Tennessee Region (TN). Notably, widespread regions in the Midwest exhibit heightened P gains (Fig. S7), particularly in terms of PO $_{4}^{3 -}$ , suggesting a discernible impact of human activities (e.g., agricultural fertilization). At the HUC2 level, the lowest area-weighted PO $_{4}^{3 -}$ gain (1.72 kgP km⁻² yr⁻¹) was found in the Rio Grande Region (RG), and the lowest TP gain (0.62 kgP km⁻² yr⁻¹) was found in the Upper Colorado Region (UCR). Refined examination at the HUC group level showed that, over the CONUS, 392 778 and 1 468 973 km² areas exhibited riverine PO $_{4}^{3 -}$ and TP losses, respectively.

https://essd.copernicus.org/articles/18/3355/2026/essd-18-3355-2026-f04

Figure 4The spatial distribution of the gain and loss in riverine (a) PO $_{4}^{3 -}$ and (b) TP over the CONUS. The boundary lines show the Hydrologic Unit Catalogue 2-digit (HUC2) watersheds. Grey areas indicate regions with no data.

3.2 Point and nonpoint source contributions

We also mapped the spatial point source and nonpoint source inputs of TP, as shown in Fig. 5. The nonpoint source contributions are estimated based on Eq. (2), which provides a lower-end estimate given that the riverine removal of point and nonpoint source P as shown in Eq. (2) was not considered. Point and nonpoint source contributions to riverine TP pollution exhibited large differences in both the magnitude and spatial distribution. Over the CONUS, the area-averaged point source input of TP is 5.44 kgP km⁻² yr⁻¹. By subtracting point source inputs from the calculated TP gain and loss, we obtained an area-averaged nonpoint source TP contribution of 28.24 kgP km⁻² yr⁻¹. Regions characterized by high TP gain with minimal point source pollution were observed in the Midwest. Notably, in most of the agriculturally intensive Missouri and Tennessee-Ohio river basins, total nonpoint source discharge significantly surpassed point source contributions (Fig. S8). Upon the exclusion of point source contributions (Fig. 5b), there is a substantial change, with the areas with riverine TP losses expanding to 1 603 258 km², most of them in the Missouri and Arkansas-White-Red River Basins. In general, most watersheds with negative nonpoint sources are concentrated in the western U.S. This does not mean that the nonpoint source P inputs are negative, but indicates that riverine processes likely removed a large fraction of point and nonpoint source P.

https://essd.copernicus.org/articles/18/3355/2026/essd-18-3355-2026-f05

Figure 5The spatial distribution of (a) nonpoint source and (b) point source contributions to riverine P pollution over the CONUS. The boundary lines show the Hydrologic Unit Catalogue 2-digit (HUC2) watersheds. Grey areas indicate regions with no data.

3.3 Factors influencing TP

We employed a random forest model to assess the influence of climate, land use, human activities, and catchment characteristics on riverine TP gain and loss and TP loads (Fig. 6). Note that the calculated P loads represent the outcome of the entire upstream catchment processes, while the riverine P gain and loss data represent both upstream catchment processes (e.g., P inputs from upstream) and local catchment properties (e.g., climate and land use in a HUC group). Such differences lead to the use of different sets of influencing factors (Fig. 6). The land use, climate and point source factors were calculated for the entire upstream area draining to a monitoring station for TP load analysis. In contrast, those factors were averaged over a HUC group for riverine gain and loss analysis. Additionally, for the analysis of riverine P gain and loss data, we included upstream P inputs. Analysis results indicate that upstream input is the sole statistically significant factor affecting TP gain and loss, with climate and land cover showing no notable impact. Conversely, TP loads are predominantly influenced by climatic factors, alongside significant contributions from point source discharges and urban land use.

https://essd.copernicus.org/articles/18/3355/2026/essd-18-3355-2026-f06

Figure 6The importance of factors influencing (a) TP gain and loss and (b) TP loads. Asterisks indicate significance at a level of 0.05.

Download

4 Discussion

4.1 Important contributions from nonpoint sources to riverine P pollution

The estimated area-averaged nonpoint source TP contribution (28.24 kgP km⁻² yr⁻¹) represents a reduction of 16.2 % from the calculated TP gain and loss that includes contributions from both point and nonpoint sources (33.68 kgP km⁻² yr⁻¹). Given that the TP inputs from point and nonpoint sources are often subject to riverine removal (Withers and Jarvie, 2008), the estimated nonpoint source TP based on Eq. (1) should be augmented by the amount of total TP inputs (including both nonpoint and point source) removed through riverine processes. Therefore, the calculated nonpoint source inputs of TP represent an underestimate of the contributions from nonpoint sources. If we assume a 12 % removal rate for TP inputs (Maavara et al., 2015), then the nonpoint source inputs of TP would increase from 28.24 to 32.28 kgP km⁻² yr⁻¹. Collectively, the results show that the nonpoint sources likely contribute more than 84 % of riverine TP pollution.

4.2 Implications for analysing environmental controls of riverine P

Climatic factors were key drivers of TP loads at the outlet of a watershed, which in general aligns with findings from previous studies (Sabo et al., 2023) and underscores the role of climate in nutrient transport dynamics. Our environmental control analysis using the gain and loss data showed that upstream inputs are leading control of local riverine gain and loss (Fig. 6a), in addition to local inputs of P from point and nonpoint sources and local riverine processes, such as in-stream retention through mechanisms such as P absorption by periphyton via photosynthesis and hydrological processes like reduced streamflow and sedimentation (Dodds, 2003; Withers and Jarvie, 2008). Notably, although accumulated agricultural P inputs (i.e., livestock waste and agricultural fertilizer) positively influenced TP gain and loss (Fig. S9), they were not included in this analysis due to mismatch of spatial scales. In general, using TP loads and riverine P gain and loss can lead to pronounced differences in the analysis of importance of environmental controls. Although climate conditions (i.e., precipitation and temperature) are the major controls of TP loads, which represent the integration of the entire watershed conditions, while riverine P gain and loss indicate that the amount of upstream P inputs entering a local catchment is an important factor influencing the riverine processing of P.

Both the differences and the analysis using TP loads and riverine gain and loss data revealed the importance of urban land and agricultural management. For example, both analyses show that irrigation can influence riverine TP, indicating that improving irrigation efficiency and technology holds potential to reduce TP inputs from cropland fertilization (Xia et al., 2020). Though we didn't assess factors influencing PO $_{4}^{3 -}$ due to the lack of point source PO $_{4}^{3 -}$ data, TP hotspots are expected to occur further downstream than PO $_{4}^{3 -}$ hotspots (Fig. 3). This is probably because rivers typically can retain (e.g., periphyton assimilation, adsorption onto suspended or bed sediment) a considerable proportion of incoming soluble-reactive P (e.g., PO $_{4}^{3 -}$ ) within the upper network, whereas particulate P continue to transport to downstream (Jarvie et al., 2012; Robertson and Saad, 2019; Royer et al., 2006). Overall, the intricate interplay between climate and land use factors underscores the complex nature of P dynamics in riverine systems. These newly developed riverine gain and loss datasets help improve understanding of local controls of riverine P dynamics and identify hotspots of changes in riverine P.

4.3 Limitations and contribution

While the newly developed datasets leverage upstream-downstream topology information at the HUC12 level to help increase the spatial resolution of riverine P gain and loss data, it is essential to acknowledge limitations relevant to understanding and quantifying P cycles and identifying sources of P inputs. First, the accuracy of load estimation via LOADEST is contingent upon the availability of paired P concentration data from monitoring stations. Stations with limited observations may introduce higher uncertainties in load estimations. The robustness of our datasets is partially reflected in the number of observations available. The average and median numbers of PO $_{4}^{3 -}$ observations per site is 54 and 29, respectively; for TP, the average and median numbers of observations per site is 134 and 90, respectively. In addition, the average center year of TP observations is 1992, with a median of 1991, while for PO $_{4}^{3 -}$ , the overall observation time is relatively early, with an average of 1981 and a median of 1980. The use of AIC to choose the most parsimonious regression model (average r² of 0.76 and 0.83 for PO $_{4}^{3 -}$ and TP, respectively) helped reduce uncertainties in load estimates.

Additionally, we calculated the P loads by averaging over different time periods with available data for each monitoring station. The mismatch between observational periods of upstream and downstream stations could introduce uncertainties, given that the available data cover various time periods for different stations. For example, streamflow discharge, which is important for calculating nutrient loads, can vary from year to year. Here, we assumed that multi-year average estimates of P loads are representative of the long-term pattern at a monitoring station. This may not hold for some upstream and downstream stations covering time periods that do not overlap with each other. Therefore, we provided the number and period of observations and model performance information in the datasets, which can help users to refine the calculation of riverine P gain and loss by further screening the P loads data at the monitoring stations. Note that available stations with observed P concentration and streamflow data are relatively sparse in the western vs. eastern U.S., particularly for PO $_{4}^{3 -}$ . This led to large gaps in the spatial coverage of the datasets (Fig. 4). Increasing the number of stations with P observations holds the potential to enhance the accuracy of riverine P estimates in the future.

It is also worth noting that the calculated contribution of TP from nonpoint sources represents a conservative estimate, given the unknown rate of TP removal from point and nonpoint sources. Although previous studies showed that TP removal rates were generally small, they could vary substantially across regions, as evidenced by the areas with riverine P loss (Fig. 4). It is reasonable to assume that the estimated TP contribution from nonpoint sources is greater than 84 % over the CONUS; however, the local TP contribution from nonpoint sources could be much lower, particularly in regions with high point source inputs. Also, the removal rates of P from point and nonpoint sources are likely different due to differences in the quality of P inputs (e.g., biodegradability and adsorption and desorption to sediments) and flow pathways (Wang et al., 2025a). Therefore, caution should be taken when interpreting the local contribution to P pollution from nonpoint sources.

Despite these challenges, our datasets make a unique contribution to the quantification and analysis of riverine P load, gain and loss, and sources across the CONUS. They can support the evaluation and diagnosis of large-scale dynamic watershed models, the examination of environmental controls on riverine P loads, and the estimation of contributions to P gain and loss. For instance, models such as SPARROW could incorporate the spatial patterns of riverine P gain and loss to better constrain nutrient sources and in-stream processing across river networks. The insights derived from our datasets contribute to a more comprehensive understanding of P dynamics under long-term and multi-decadal conditions, providing a foundation for improved water quality management on local, regional, and national scales.

5 Code and data availability

All codes for validating and visualizing PO $_{4}^{3 -}$ and TP gain and loss from the load estimations were run in R version 4.3.1 and are archived with the datasets presented in the paper at https://doi.org/10.6084/m9.figshare.28509317 (Wang et al., 2025b).

6 Conclusions

In this study, we estimated riverine loads of PO $_{4}^{3 -}$ and TP and derived their gain and loss across the CONUS, leveraging the upstream-downstream hydrological connectivity information contained in the NHDPlus catchment map. On average, rivers across the CONUS gain TP at a rate of 33.68 kgP km⁻² yr⁻¹, with notable hotspots in the Midwest. Due to the limitations of data availability, the precision of estimated P gain and loss data could be influenced by the number and periods of observations available at upstream and downstream stations. We provided additional information regarding the number of observations available, temporal coverage of data, the regression model used, and the model's statistical performance, so that users can further subset the datasets to meet certain specific criteria.

The riverine P gain and loss datasets allow the estimation of riverine P removal or accrual at a refined spatial resolution to better reflect the impacts of local controls. In contrast, riverine P loads at monitoring stations embody the integrated processes from the entire area upstream of a specific station. Also, by combining point source inputs with the riverine P gain and loss datasets, we derived conservative estimates of the long-term average contribution of nonpoint sources to riverine TP (28.24 kgP km⁻² yr⁻¹). The control factor analysis with a random forest model demonstrated that upstream inputs had the greatest influence on the local riverine P gain and loss, while climatic factors dominated riverine P loads at monitoring stations. This suggests that nutrient management practices that prioritize enhancing irrigation efficiency and integrating strategies such as targeted fertilizer application and wetland restoration may more effectively capture and reduce phosphorus mobilization from agricultural lands. The newly developed riverine P datasets in this study extend utility to diverse applications, encompassing, but not limited to, the evaluation of watershed models, identification of critical source areas, and optimization of agricultural management strategies. Future studies may concentrate on filling gaps in the spatial and temporal coverage of the datasets (particularly for PO $_{4}^{3 -}$ ).

Supplement

The supplement related to this article is available online at https://doi.org/10.5194/essd-18-3355-2026-supplement.

Author contributions

Y.W.: Conceptualization, methodology, investigation, formal analysis, data curation, visualization, and writing – original draft. X.Z.: Conceptualization, methodology, investigation, writing – original draft, writing – review and editing, supervision, project administration, and funding acquisition. K.Z.: methodology, investigation, writing – review and editing. R.D.S.: investigation, writing – review and editing. Y.M.: visualization, writing – review and editing. C.M.C.: writing – review and editing.

Competing interests

At least one of the (co-)authors is a member of the editorial board of Earth System Science Data. The peer-review process was guided by an independent editor, and the authors also have no other competing interests to declare.

Disclaimer

The views expressed are those of the authors, and do not necessarily represent the views or policies of the U.S. Environmental Protection Agency or any other Federal agency. Any use of trade, firm, or product names is for descriptive purposes only and does not imply endorsement by the U.S. Government. The mention of trade names or commercial products in this publication is solely for the purpose of providing specific information and does not imply recommendation or endorsement by the funding agencies.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Acknowledgements

We thank Sarah Stackpoole, Qian Zhang, and Kate Schofield for constructive comments and editorial suggestions on earlier versions of the manuscript.

Financial support

This research is in part supported by the U.S. Department of Agriculture – Agricultural Research Service and the National Aeronautics and Space Administration (NASA 22-CMS22-0027).

Review statement

This paper was edited by Giulio G. R. Iovine and reviewed by two anonymous referees.

References

Arheimer, B. and Lidén, R.: Nitrogen and phosphorus concentrations from agricultural catchments – influence of spatial and temporal variables, J. Hydrol., 227, 140–159, https://doi.org/10.1016/S0022-1694(99)00177-8, 2000.

Breiman, L.: Random Forests, Mach. Learn., 45, 5–32, https://doi.org/10.1023/A:1010933404324, 2001.

Brownlie, W. J., Sutton, M. A., Heal, K. V, Reay, D. S., and Spears, B.: Our phosphorus future: towards global phosphorus sustainability, UK Centre for Ecology & Hydrology, 371 pp., 2022.

Carpenter, S. R., Caraco, N. F., Correll, D. L., Howarth, R. W., Sharpley, A. N., and Smith, V. H.: Nonpoint pollution of surface waters with phosphorus and nitrogen, Ecol. Appl., 8, 559–568, https://doi.org/10.2307/2641247, 1998.

Diaz, R. J. and Rosenberg, R.: Spreading Dead Zones and Consequences for Marine Ecosystems, Science, 321, 926–929, https://doi.org/10.1126/science.1156401, 2008.

Dodds, W. K.: The role of periphyton in phosphorus retention in shallow freshwater aquatic systems, J. Phycol., 39, 840–849, https://doi.org/10.1046/j.1529-8817.2003.02081.x, 2003.

Falcone, J. A. and USG Survey: GAGES-II: Geospatial Attributes of Gages for Evaluating Streamflow, Reston, VA, https://doi.org/10.3133/70046617, 2011.

Hamed, K. H. and Rao, A. R.: A modified Mann-Kendall trend test for autocorrelated data, J. Hydrol., 204, 182–196, 1998.

Hirsch, R. M., Moyer, D. L., and Archfield, S. A.: Weighted Regressions on Time, Discharge, and Season (WRTDS), with an Application to Chesapeake Bay River Inputs ¹, JAWRA J. Am. Water Resour. Assoc., 46, 857–880, https://doi.org/10.1111/j.1752-1688.2010.00482.x, 2010.

Homer, C. G., Fry, J. A., and Barnes, C. A.: The national land cover database, U.S. Geological Survey, https://doi.org/10.3133/fs20123020, 2012.

Houser, J. N. and Richardson, W. B.: Nitrogen and phosphorus in the Upper Mississippi River: Transport, processing, and effects on the river ecosystem, Hydrobiologia, 640, 71–88, https://doi.org/10.1007/s10750-009-0067-4, 2010.

Jarvie, H. P., Sharpley, A. N., Scott, J. T., Haggard, B. E., Bowes, M. J., and Massey, L. B.: Within-River Phosphorus Retention: Accounting for a Missing Piece in the Watershed Phosphorus Puzzle, Environ. Sci. Technol., 46, 13284–13292, https://doi.org/10.1021/es303562y, 2012.

Maavara, T., Parsons, C. T., Ridenour, C., Stojanovic, S., Dürr, H. H., Powley, H. R., and Van Cappellen, P.: Global phosphorus retention by river damming, P. Natl. Acad. Sci. USA, 112, 15603–15608, https://doi.org/10.1073/pnas.1511797112, 2015.

Page, M. J., Moher, D., Bossuyt, P. M., Boutron, I., Hoffmann, T. C., Mulrow, C. D., Shamseer, L., Tetzlaff, J. M., Akl, E. A., Brennan, S. E., Chou, R., Glanville, J., Grimshaw, J. M., Hróbjartsson, A., Lalu, M. M., Li, T., Loder, E. W., Mayo-Wilson, E., Mcdonald, S., Mcguinness, L. A., Stewart, L. A., Thomas, J., Tricco, A. C., Welch, V. A., Whiting, P., and Mckenzie, J. E.: PRISMA 2020 explanation and elaboration: updated guidance and exemplars for reporting systematic reviews, BMJ, 372, https://doi.org/10.1136/BMJ.N160, 2021.

Qiu, H., Zhang, X., Yang, A., Wickland, K. P., Stets, E. G., and Chen, M.: Watershed carbon yield derived from gauge observations and river network connectivity in the United States, Sci. Data, 10, 1–13, https://doi.org/10.1038/s41597-023-02162-7, 2023.

Read, E. K., Carr, L., De Cicco, L., Dugan, H. A., Hanson, P. C., Hart, J. A., Kreft, J., Read, J. S., and Winslow, L. A.: Water quality data for national-scale aquatic research: The Water Quality Portal, Water Resour. Res., 53, 1735–1745, https://doi.org/10.1002/2016WR019993, 2017.

Ringeval, B., Demay, J., Goll, D. S., He, X., Wang, Y. P., Hou, E., Matej, S., Erb, K. H., Wang, R., Augusto, L., Lun, F., Nesme, T., Borrelli, P., Helfenstein, J., McDowell, R. W., Pletnyakov, P., and Pellerin, S.: A global dataset on phosphorus in agricultural soils, Sci. Data, 11, 1–34, https://doi.org/10.1038/s41597-023-02751-6, 2024.

Robertson, D. M. and Saad, D. A.: Spatially referenced models of streamflow and nitrogen, phosphorus, and suspended-sediment loads in streams of the midwestern United States, Scientific Investigations Report 2019-5114, U.S. Geological Survey, https://doi.org/10.3133/sir20195114, 2019.

Royer, T. V., David, M. B., and Gentry, L. E.: Timing of Riverine Export of Nitrate and Phosphorus from Agricultural Watersheds in Illinois: Implications for Reducing Nutrient Loading to the Mississippi River, Environ. Sci. Technol., 40, 4126–4131, https://doi.org/10.1021/es052573n, 2006.

Runkel, R. L., Crawford, C. G., and Cohn, T. A.: Load Estimator (LOADEST): A FORTRAN program for estimating constituent loads in streams and rivers, USGS Numbered Series, https://doi.org/10.3133/tm4A5, 2004.

Sabo, R. D., Clark, C. M., Gibbs, D. A., Metson, G. S., Todd, M. J., LeDuc, S. D., Greiner, D., Fry, M. M., Polinsky, R., Yang, Q., Tian, H., and Compton, J. E.: Phosphorus Inventory for the Conterminous United States (2002–2012), J. Geophys. Res.-Biogeo., 126, https://doi.org/10.1029/2020JG005684, 2021.

Sabo, R. D., Pickard, B., Lin, J., Washington, B., Clark, C. M., Compton, J. E., Pennino, M., Bierwagen, B., LeDuc, S. D., Carleton, J. N., Weber, M., Fry, M., Hill, R., Paulsen, S., Herlihy, A., and Stoddard, J. L.: Comparing drivers of spatial variability in US lake and stream phosphorus concentrations, J. Geophys. Res.-Biogeo., 128, e2022JG007227, https://doi.org/10.1029/2022JG007227, 2023.

Singh, N. K., Van Meter, K. J., and Basu, N. B.: Widespread increases in soluble phosphorus concentrations in streams across the transboundary Great Lakes Basin, Nat. Geosci., 16, 893–900, https://doi.org/10.1038/s41561-023-01257-5, 2023.

Skinner, K. D. and Maupin, M. A.: Point-source nutrient loads to streams of the conterminous United States, 2012, Data Series, https://doi.org/10.3133/DS1101, 2019.

Stackpoole, S. M., Stets, E. G., and Sprague, L. A.: Variable impacts of contemporary versus legacy agricultural phosphorus on US river water quality, P. Natl. Acad. Sci. USA, 116, 20562–20567, https://doi.org/10.1073/PNAS.1903226116, 2019.

UNEP: Understanding phosphorus: global challenges and solutions, https://www.unep.org/news-and-stories/story/what-phosphorus-and-why-are-concerns-mounting-about-its-environmental-impact, last access: 26 February 2025.

U.S. Geological Survey (USGS): Annual NLCD Collection 1 Science Products (ver. 1.1, June 2025), U.S. Geological Survey data release, https://doi.org/10.5066/P94UXNTS, 2024.

Wang, F., Li, S., Yan, W., Yu, Q., Tian, S., Yan, J., Zhou, D., and Shao, Y.: Dependence of riverine total phosphorus retention and fluxes on hydrology and river size at river network scale, J. Hydrol., 652, 132676, https://doi.org/10.1016/j.jhydrol.2025.132676, 2025a.

Wang, Y., Zhang, X., and Zhao, K.: A dataset of riverine nitrogen yield across watersheds in the Conterminous United States, Sci. Data, 11, 1–8, https://doi.org/10.1038/s41597-024-03552-1, 2024a.

Wang, Y., Zhang, X., Zhao, K., and Singh, D.: Streamflow in the United States: Characteristics, trends, regime shifts, and extremes, Sci. Data, 11, 788, https://doi.org/10.1038/s41597-024-03618-0, 2024b.

Wang, Y., Zhang, X., Zhao, K., Sabo, R. D., Miao, Y., and Clark, C. M.: Riverine Phosphorus Gain and Loss across the conterminous United States, figshare [data set], https://doi.org/10.6084/m9.figshare.28509317, 2025b.

Wilson, D. J.: The harmonic mean p-value for combining dependent tests, P. Natl. Acad. Sci. USA, 116, 1195–1200, https://doi.org/10.1073/pnas.1814092116, 2019.

Withers, P. J. A. and Jarvie, H. P.: Delivery and cycling of phosphorus in rivers: A review, Sci. Total Environ., 400, 379–395, https://doi.org/10.1016/j.scitotenv.2008.08.002, 2008.

Wurtsbaugh, W. A., Paerl, H. W., and Dodds, W. K.: Nutrients, eutrophication and harmful algal blooms along the freshwater to marine continuum, WIREs Water, 6, https://doi.org/10.1002/wat2.1373, 2019.

Xia, Y., Zhang, M., Tsang, D. C. W., Geng, N., Lu, D., Zhu, L., Igalavithana, A. D., Dissanayake, P. D., Rinklebe, J., Yang, X., and Ok, Y. S.: Recent advances in control technologies for non-point source pollution with nitrogen and phosphorous from agricultural runoff: current practices and future prospects, Appl. Biol. Chem., 63, 8, https://doi.org/10.1186/s13765-020-0493-6, 2020.

Zhang, Q. and Hirsch, R. M.: River Water-Quality Concentration and Flux Estimation Can be Improved by Accounting for Serial Correlation Through an Autoregressive Model, Water Resour. Res., 55, 9705–9723, https://doi.org/10.1029/2019WR025338, 2019.

Zhang, W., Jin, X., Liu, D., Lang, C., and Shan, B.: Temporal and spatial variation of nitrogen and phosphorus and eutrophication assessment for a typical arid river – Fuyang River in northern China, J. Environ. Sci., 55, 41–48, https://doi.org/10.1016/J.JES.2016.07.004, 2017.

Articles

Download

Article (15777 KB)
Full-text XML

Short summary

Excess riverine phosphorus is a worldwide threat to water quality. Leveraging 51 394 phosphate (PO₄³⁻) and 285 675 total phosphorus (TP) data points and hydrological topology information, we developed a new dataset mapping PO₄³⁻ and TP gain/loss across catchments in the conterminous United States. The new net phosphorus gain/loss maps allow us to assess sources, drivers, and model fidelity, thereby supporting future research and management of phosphorus contamination.