Articles | Volume 17, issue 11
https://doi.org/10.5194/essd-17-6113-2025
https://doi.org/10.5194/essd-17-6113-2025
Data description paper
 | 
13 Nov 2025
Data description paper |  | 13 Nov 2025

A detailed streamflow and groundwater salinity dataset for Muttama Creek Catchment, NSW, Australia

R. Willem Vervoort, Floris F. van Ogtrop, Mina Tambrchi, Farzina Akter, Alexander J. V. Buzacott, Jason S. Lessels, James P. Moloney, Dipangkar Kundu, Feike A. Dijkstra, and Thomas F. A. Bishop
Abstract

Dryland salinity remains a major global natural resource management concern, and which is amplified in Australia. However, limited detailed space-time data sets with observations of stream and groundwater salinity has constrained a deep understanding of the range of processes that can lead to dryland salinity problems in landscapes. The aim of this study is to report on the open dataset (https://doi.org/10.25910/m0wp-8890, Vervoort et al.2025) resulting from a 14 year data collection effort in a subcatchment of the Murrumbidgee catchment in New South Wales, Australia. Over a 14 year period a series of different sampling campaigns has resulted in a large dataset with hydrogeochemical data which includes both in-situ (field) data and post laboratory analysis of major anions and cations. This data is augmented with observed groundwater levels and publicly available streamflow and climate data (https://doi.org/10.17605/OSF.IO/BEUWK, Vervoort and Akter2024). The data set covers 23 groundwater sample sites and 39 surface water sites. Because the data was collected by four distinct groups and over many years, we investigate whether this has caused a bias in the dataset. In addition, we show the major spatial and temporal trends to provide an overview of the dataset. The dataset is made open access to encourage further research and the current paper shows the richness of the collected data and opportunities for further research.

Share
1 Introduction

Dryland and irrigation salinity has long been a major natural resource management concern in Australia (Jolly et al.2001; White et al.2009; Scanlon et al.2007; Walker et al.2002; Finlayson et al.2010). Globally, the success of management of salinity, while extensively documented, has remained patchy (Leblanc et al.2012). As a result, in Australia, the volume of research and number of publications in this area has decreased significantly in recent years (Fig. 1). This is partly due to the effect of the millenium drought on groundwater levels and the consequent reduction in the appearance of salinity effects in the landscape (McFarlane et al.2016). However, the reduction in research is also because of the increased understanding that salinity processes are more complex than previously recognised. For example, salinity processes can vary substantially across the landscape (Conyers et al.2008) and the processes of salt delivery to the stream also varies in the landscape (Summerell et al.2006; Hughes et al.2007) and is dependent on landscape characteristics (van Dijk et al.2008; Dahlhaus et al.2010). As a result, the investment required to improve the understanding and increase the effectiveness of management is considerable and this has resulted in a reduction of the number of studies after the large investment in the late 1990s and 2000s in Australia.

https://essd.copernicus.org/articles/17/6113/2025/essd-17-6113-2025-f01

Figure 1Number of papers on the Web of Science related to the search terms (Dryland Salinity) AND Australia, 1980–2022.

Download

Dryland salinity also remains a global problem (Thorslund and van Vliet2020; Stavi et al.2021; McFarlane et al.2016). In particular, the impact of salinity on freshwater systems such as wetlands is recognised as a serious threat (Canedo-Arguelles et al.2016). More importantly, in this case it is recognised that not only total salt concentration, using the often reported electrical conductivity (EC, such as in the global database from Thorslund and van Vliet2020), is of importance, but the actual different chemical species, such as types of cations, as they have different impacts on ecology (Canedo-Arguelles et al.2016).

The poor spatial and timescale distribution of water quality datasets has long been an obstacle to measuring trends in salinity. Studies such as Jolly et al. (2001) and White et al. (2009) used large historical datasets to detect broadscale trends over large periods throughout the Murray Darling Basin (MDB). This work identified that southern and eastern dryland regions in the Murray Darling Basin have rising salinity trends that were worse in areas of low rainfall (White et al.2009; Jolly et al.2001). However, the ion composition varied greatly throughout the Murray Darling Basin (MDB) (White et al.2009). More specifically, Conyers et al. (2008) tried to isolate which areas in the middle portion of the Murrumbidgee catchment acted as sources of salinity, as well as whether this was predominantly marine cyclic salts (NaCl) as previously assumed, or whether salts from mineral weathering were also involved (e.g. Ca, Mg, HCO3). While both are captured in the bulk measurement of electrical conductivity (EC), a rise in marine cyclic salts can be a major source of osmotic stress, whereas mineral weathering salts are far less harmful and are more likely to precipitate at reasonably low concentrations (Conyers et al.2008). The ratio of Cl:HCO3 ions was identified as the best indicator of the source of salinity, with Cl acting as a measure of marine cyclic salts and HCO3- acting as a measure of mineral weathering salts. The Muttama catchment was specifically identified as a candidate for future research as ion concentrations appeared to result in a change from east to west, correlating with the underlying geology and their mineral composition.

It is clear from these examples that detailed spatial and temporal datasets are key to understanding different hydrogeochemical processes in the landscape (e.g. Cartwright et al.2010; Dahlhaus et al.2010), but overall publicly available datasets on dryland salinity in Australia remain limited to detailed data from small experimental catchments (< 100 ha) (Summerell et al.2006; Crosbie et al.2007; Hughes et al.2007) or sparse government datasets from official monitoring (i.e. https://waterinsights.waternsw.com.au/, last access: 11 November 2025) which tend to be limited in hydrogeochemical data. Part of this is related to the sensitivity of the data given the relationship with possible land values. However, as the understanding of salinity occurrence grows, this argument is less valid. Making data more widely available would increase the opportunities for research and increase our understanding of dryland salinity processes.

Without regular and expensive automated sampling, field campaigns to collect water quality data tend to be “snapshot” activities (Grayson et al.1997; Breuer et al.2015; Lyon et al.2008; Cartwright et al.2010; Lintern et al.2018) which can be biased due to the over representation of low flow conditions (Lessels and Bishop2020). Even the analyses of substantial government databases (Lintern et al.2018) are likely to be biased in this way. This means that overall there are limited streamflow and groundwater salinity data sets that combine multiple locations across a significant time period and that combine a range of flow characteristics.

The aim of this paper is to present and describe the space time dimensions and basic relationships of a complex groundwater and surface water hydrogeochemistry dataset that was collected over a 14 year period in a 1000 km2 agricultural catchment in New South Wales, Australia. The Muttama catchment, which is the focus of this paper, provides a microcosm of groundwater and surface water salinity variability in Australia. Focusing on a medium size catchment in greater detail creates opportunities to test whether sources of salinity can be traced back to specific areas of land. The Muttama catchment is representative of flat semi-arid catchments globally, but especially of Australian catchments, where a significant amount of research has taken place at the catchment scale (e.g. Crosbie et al.2007; Hughes et al.2007, 2008; Summerell et al.2006). Unfortunately, a lot of the older data is not easily accessible and extractable. This paper attempts to correct this by providing an open dataset, which hopefully will also encourage other research teams to summarise and report open data. We believe that the data would be relevant for semi-arid areas in the US, Canada, Asia and South America (Thorslund and van Vliet2020; Stavi et al.2021).

This paper gives a description of the dataset to facilitate open access of the dataset, but does not analyse the physiochemical relationships in the data in detail. This will be analysed in follow-up papers and was partly analysed in an earlier thesis (Akter2018). The main aim of this paper is to make the data set accessible to other researchers to encourage further research in this catchment and in salinity in general.

https://essd.copernicus.org/articles/17/6113/2025/essd-17-6113-2025-f02

Figure 2Muttama Catchment Sampling Locations with Elevation. Symbol colour indicates whether location was a groundwater or surface water source, with blue being surface water and coral red being groundwater. The numbers on the map represent the sample location number.

2 Methods

2.1 Muttama catchment

The 1000 km2 Muttama creek catchment (Fig. 2) is located in the Mid-Murrumbidgee catchment area of NSW in south eastern Australia. The landscape is undulating with elevation variations ranging from 227–719 m. Muttama creek flows north-south through the length of the catchment towards the Murrumbidgee River near Gundagai. The main township, Cootamundra is located in the upper half of the catchment. The dominant land use type of this catchment is about 93 % agriculture, dominated by winter-spring cropping and pasture. Mean annual rainfall (1891–2024) in the catchment is 654 mm for the longest running Bureau of Meteorology Landgrove station (station 073022), while potential evapotranspiration far exceeds this total.

Streamflow is measured continuously by WaterNSW, the state agency responsible for water data collection, at three locations in the catchment: Coolac, station no. 410044, the main downstream point, and Berthong, station no. 41000207 and Jindalee, station no. 410112 on two branches above the Cootamundra township. These data are available as open access via the https://waterinsights.waternsw.com.au/ (last access: 11 November 2025) or through the http://bom.gov.au/waterdata/ (last access: 11 November 2025). Here, we only use the data from the Coolac station as a comparison.

The depth to the nearest groundwater table varies across the Muttama catchment and it ranges from < 2 to 20 m below ground level (Department of Environment and Climate Change NSW2009). Deep groundwater in the catchment occurs mostly in fractured rock aquifers common on the eastern, and western fringes of the catchment. In contrast, shallow groundwater is associated with unconfined alluvial, colluvial, and eluvial aquifers. Some aquifers in the northern part of this catchment show artesian behavior (Webb1999; Akter2018).

Saline areas of the catchment tend to be associated with geological heterogeneity, primarily the sedimentary materials in the west and rhyolite on the northwest side (Conyers et al.2008). Overall, Muttama creek is a significant salt contributor to the downstream Murrumbidgee river with suggested contributions of around 58 % from cyclic sources and 42 % salts originating from mineral weathering (Conyers et al.2008).

2.2 Data collection

2.2.1 Data sources

The water quality dataset contains data from 4 main sampling sources related to four distinct groups of “people” doing the sample collection. The term “people” is used loosely, as it mainly related to four different types of sampling campaigns, which potentially had differences in the rigour of the sampling campaign (quality control, types of samples taken, training of the people taking the samples). These groups are designated as:

  • Source 1: Data from the PhD study by Akter (2018).

  • Source 2: Data from the sampling campaign of two former students, the PhD from Lessels (2014) and unpublished data from another student, Edward Milne.

  • Source 3: A dataset collected by undergraduate and postgraduate students as part of field trips in different units of study at the University of Sydney is identified as “Student data”. This data was sampled “ad-hoc” during the field trip period using standard sampling protocols as described for the data from Akter (2018).

  • Source 4: Data from several autosamplers installed in the catchment during the PhD from Lessels (2014). Because these samples were not taken by a “person” and were taken on a flow weighted basis, we separated the data from the “grab” samples in the other methods. These samples are also missing field measurements, as these were only analysed in the laboratory.

Overall, 1160 water samples were collected from 62 sample locations over the 2010–2024 period. However, not all sites were sampled at all times and not all samples were fully analysed for all hydrogeochemical variables. Both surface water and groundwater samples were collected at 23 groundwater sample sites and 39 surface water sites. These are distributed across the catchment, depending on standing water availability and access.

In addition to the water quality dataset, data from 23 groundwater data loggers is provided from the same groundwater sample sites as in the hydrogeochemical dataset.

2.2.2 Hydrogeochemical variables

The overall structure of the hydrogeochemical dataset consists of repeated measurements over time at multiple locations in Muttama catchment. For each location the name of the location and the spatial coordinates were recorded in decimal degrees (Longitude =x and Latitude =y) as well as whether the location was a groundwater or a surface water location. The names of the locations are fairly random and basic locality indicators, which cannot be interpreted exactly.

Table 1Variables measured in the field and laboratory.

Download Print Version | Download XLSX

The data for each location consist of up to 6 variables which were measured in the field (Table 1). These were complemented by laboratory analysis, which repeated some of the field measurements, and for additional major anion and cation variables (Ca, Mg, Na, K, Cl, SO4, HCO3), and total Nitrogen (N) and total Phosphorus (P) for part of the sample set. Some other variables were infrequently measured and are not included in the data set.

Table 2Different instruments used in the sampling.

Download Print Version | Download XLSX

The variables pH, EC, SPC (specific conductance: field temperature corrected EC), Temperature, and in some cases dissolved oxygen (DO) and Turbidity were measured using a range of field probes (Table 2). All field probes measured pH, EC, Temperature and calculated SPC. Early measurements (samples up to November 2014) used a YSI probe that included a turbidity and DO probe (YSI 6600 and YSI 600 for surface and groundwater, respectively). Later groundwater samples (After November 2014) used a different YSI probe (YSI ProDSS multi-parameter) that only included a DO probe, while the suface water sampling used a YSI EXO2 multiparameter sonde that included pH, EC, DO and Turbidity. Finally, (after mid 2019) surface water and groundwater sampling used a Xylem Exo probe with DO, pH, EC, Temperature and SPC. Some of the variability in the field measurements might be due to this variation in the field instrumentation.

Anions in most of the samples were analysed using the high-performance liquid chromatography method (Dionex P680 HPLC) and cations were measured on acidified samples using an Inductively Coupled Plasma Optical Emission Spectrometer (ICP-OES, Varian 720-ES) at the University of Sydney (Akter2018). Duplicate samples in the analysis had a reported relative percentage difference (RPD) lower than 5 % in part of the sample dataset (Akter2018). Some of the later samples and the student sample set (Source 3) were analysed by a commercial laboratory (https://www.alsglobal.com/en/locations/asia-pacific/pacific/australia/nsw/sydney-woodpark-environmental, last access: 11 November 2025). Alkalinity concentrations were generally measured in the field within 24 h of collection using a HACH digital titrator (model 16900) (Akter2018) up to 2017 and by the commercial laboratory after this.

The hydrogeochemical data is stored on the University of Sydney escholarship repository: https://doi.org/10.25910/m0wp-8890 (Vervoort et al.2025).

2.3 Continuous variables

The logger data, which early on collected groundwater pressure levels at 15 min and from mid-2016 at 2 h intervals, were adjusted for the length of the cable and the height of the standpipe above the ground level. They were subsequently summarised to raw daily values using an R script (SummariseDailyData.R), which is stored with the raw data in the Open Science Foundation (OSF) repository associated with this paper (https://doi.org/10.17605/OSF.IO/BEUWK, Vervoort and Akter2024).

The loggers in the field were uncalibrated. Due to logger failures, gaps occur in the daily data, followed by replacement of the faulty loggers. In some cases the cable length was adjusted and this was recorded in the field notes. Overall, this resulted in data with gaps and sometimes shifts in the recorded logger data.

Manual water level measurements were taken at each manual sampling date to allow calibration of the logger data. To correct the groundwater logger data, the daily data was matched to the observed data using linear regression, if more than 3 manual observed data were available and slope and intercept of the regression had a p-value < 0.1. If there were less than 3 manual observed data points for the specific logger an adjustment to the data was based on the difference between the average observed data and the average recorded water levels. Otherwise no adjustment was made. After the manual observations are made, the well is purged and the logger is temporarily removed from the well. This data (during the temporarily removal of the logger during the purging and subsequent recovery of the groundwater level) is removed from the logger data series. As a result, there is no direct time match between the logger data series and the manual observations. However, the data showed that, in most cases, the groundwater level recovered within 24 h. The pseudo code in the supplementary material describes the process in more detail.

The code used to match the manually observed data with the logger data is in the script Match_obs_logger_data.R, which is stored with the raw data in the Open Science Foundation (OSF) repository associated with this paper (https://doi.org/10.17605/OSF.IO/BEUWK, Vervoort and Akter2024).

After the automated process, two of the groundwater level data series still had substantial discrepancies in some sections of the data. This was most likely due to a lack of observed data for the specific logger. A final manual correction was applied. As this process is based on judgement of the data by the authors of this paper, we documented this in detail in the supplementary material.

The final corrected data that is published with this paper on https://doi.org/10.17605/OSF.IO/BEUWK (Vervoort and Akter2024) includes a column which describes whether the data is based on the automatic correction or a further manual correction.

2.4 Boxplots and maps

Using the most complete data, boxplots and spatial maps were generated to highlight the spatial and temporal variation in the data set. The boxplots visually highlight empirical differences in the data.

The mean concentration and interquartile range (25th–75th percentile) of the concentration data distributions were calculated to give an indication of variation of the data in the spatial maps.

All graphs and maps were produced using R version 4.3.1 (R Core Team2023). All code can be found in the associated https://github.com/WillemVervoort/MuttamaDataPaper (last access: 11 November 2025), as part of the markdown document for this paper.

https://essd.copernicus.org/articles/17/6113/2025/essd-17-6113-2025-f03

Figure 3Top: Distribution of missing values for the different data sources and measurement types. Most of these missing values were because not all the variables were analysed for all the samples, see the explanation in the article text. Bottom: Percent missing values for the groundwater data across all piezometers, summarised by month.

Download

3 Results

3.1 Distribution of missing values

In the hydrogeochemistry data, data source 3 was the most complete in terms of variables analysed, because in this set more variables were analysed in the commercial lab (Fig. 3 top). Some of the variables analysed in the commercial lab were not analysed with the equipment at the University of Sydney. However, source 3 had the smallest number of overall samples. Source 2 has the most incomplete data points. Source 4 has a very consistent number of missing values, possibly because not all samples were analysed in the set. For source 2, the missing data suggests that for many of the samples only a few of the variables were measured and analysed as highlighted above. In the data from source 2, almost 50 % of samples are incomplete in terms of the measurement of all the variables in Table 2. Similarly, source 1 had more incomplete data, because some of the minor elements were not analysed. The field recorded variables pH, EC, SPC and Temperature were the most complete as they were generally measured directly in the field. Thus the distribution of the NA values in the overall data set is mostly a reflection of the time period of sampling and the change in methodology over the 14 years of sampling.

In the groundwater level time series, the missing data relate mostly to logger failures and the different times that wells were instrumented. Rather than giving a full breakdown by well location, the overall level of completeness of the series is displayed (Fig. 3 bottom).

https://essd.copernicus.org/articles/17/6113/2025/essd-17-6113-2025-f04

Figure 4Top: Distribution of samples throughout the year (total number of samples collected during each month) with the blue line representing the average monthly rainfall during the study period. Bottom: Distribution of samples over the study period (total number of samples collected during each year).

Download

3.2 Temporal distribution of data

Overall the water quality sampling appears to have a reasonable distribution across all months (Fig. 4), therefore seasonal trends should be identifiable in the data. Average rainfall data (1995–2022) does not indicate any major seasonal trends, although there is a slight dominance of rainfall in the early Austral Spring (months 9 and 10, September and October). This also explains the higher number of samples because, more sampling trips were organised in this period, and spatially more channels could be sampled for surface water. The timing of source 3 (the student data) is the result of the yearly field trips, which tended to occur at approximately the same time of the year in early March and late September or early October, coinciding with the University semester breaks.

The number of samples collected in each year relative to the different sources changes throughout the years reflecting the duration of the different studies and funding cycles (Fig. 4); however the data is still well-distributed enough that overall trends should be clear. In addition, some of the sample volumes can be related to the occurrence of rainfall, as in drier years several of the channels would be dry and no sampling of surface water could occur.

Consistent groundwater sampling commenced later in the project, which means that there are very few groundwater samples before 2013. In contrast, the autosamplers were installed early in the project and there are no samples from this source after 2013.

Surface water samples were reasonably well distributed across the flow distribution, measured at the Coolac station (410044), with only a possible bias towards periods of medium flow. This is most likely since many of the upstream surface water sampling points are often completely dry during periods of low flow at Coolac, and therefore cannot be sampled. Conversely, there are no manual samples during high or very high flow as during flood situations sampling was dangerous and restricted by work health and safety considerations. The samples at high flow are all from our automated sampling.

Table 3Summary statistics for elements measured in the field. NA: not available.

Download Print Version | Download XLSX

3.2.1 Comparisons of the geochemistry of groundwater samples with surface water samples

The summary of the samples (Table 3) highlights the range of the data for the different variables. Obviously, surface water will record higher DO values, while groundwater recorded higher EC and SPC values. The rest of the variables have fairly similar ranges for both groundwater and surface water. Both total P and total N are low across the catchment samples, with only a few outliers related to specific locations and dates.

https://essd.copernicus.org/articles/17/6113/2025/essd-17-6113-2025-f05

Figure 5Sample distribution on flow duration curve derived from flow data at the Coolac NSW government station.

Download

https://essd.copernicus.org/articles/17/6113/2025/essd-17-6113-2025-f06

Figure 6Difference in pH and EC for groundwater and surface water samples. The source of the data (the sampling group) is indicated with colour.

Download

Groundwater samples have quite a distinctive hydrogeochemical signature compared to the surface water samples (Fig. 6). Since “Source 1” collected most of the groundwater samples, this results in differences between data collection sources. Field SPC measurements were used to represent EC since these samples had the fewest missing data. There also appears to be a slight bias towards lower EC values for sampling group 2, but this is likely because these samples were collected during two very wet years in 2010 and 2011 (Fig. 4) and are mostly associated with high flow values (Fig. 5).

https://essd.copernicus.org/articles/17/6113/2025/essd-17-6113-2025-f07

Figure 7Overview of the corrected groundwater time series for all the wells. Different panels relate to the different locations highlighted in Fig. 2.

Download

3.2.2 Groundwater level data

The overall corrected groundwater timeseries shows the shorter time that loggers were installed in the wells (Fig. 7), associated with the PhD thesis from Akter (2018). It also indicates that the manual data can not always be fully matched with the logger data, but further corrections are likely to be speculation.

In general, shallow groundwater occurs between 1 and 5 m below the surface and is responsive to dry and wet periods. Some of the wells have positive pressures, resulting in occasional groundwater levels above the ground surface, such as at GW10, GW12, GW13 and GW21 (Figs. 2 and 7).

https://essd.copernicus.org/articles/17/6113/2025/essd-17-6113-2025-f08

Figure 8Top row: Spatial variation of Cl-:HCO3- ratio throughout the catchment, highlighting mean and interquartile range (IQR) for each sampling location. Bottom row: Spatial Variation of EC throughout the catchment, using Mean EC in µS cm−1 and interquartile range (IQR) for each sampling location. Only locations with more than 10 observations over the 14 years are included. Surface water sample sites are in the left column, while groundwater sample sites are in the right column.

3.3 Spatial variation

There is clear spatial variation in water parameters throughout the catchment, including between groundwater and surface water sampling sites (Fig. 8). As examples, the spatial distributions for EC and Cl:HCO3 are shown for sample sites with more than 10 observations over the sampling period. Similar maps can be easily generated for other parameters using the code in the markdown document. In the map, the concentration is indicated by the size of the symbol, while the colour shading indicates the variability. This suggests surface water samples had lower variability and lower salt concentrations. In addition, samples on the North western side of the catchment had higher salt concentrations and higher Cl:HCO3 ratios, which is also associated with higher variance in the samples.

https://essd.copernicus.org/articles/17/6113/2025/essd-17-6113-2025-f09

Figure 9Variation across time of the Cl:HCO3 ratio (left) and EC (right) by sampling location represented as a boxplot. Only locations with more than 10 observations over the 14 years are included. The blue dashed line (left) indicates a Cl:HCO3 ratio of 1, while the blue dashed line (right) is the drinking water limit of 800 µS cm−1.

Download

Below the maps, boxplots (Fig. 9) highlight the difference in the distributions between the surface water sample sites and the groundwater sample sites. For sites that have more than 10 observations, this highlights the difference between sample sites, reflected spatially on the maps. For example, it highlights that high EC sites also tended to have high Cl:HCO3 values, and conversely that low salinity groundwater tended to be associated with low Cl:HCO3 values. It also points out the single well (GW23) that has a very low EC. Previous studies have suggested there may be a difference in Cl:HCO3 ratio in surface water between the eastern and western parts of the Muttama catchment (Conyers et al.2008), and Fig. 8 suggest a similar pattern, with samples in the North and West being higher in EC and high in Cl:HCO3 values, while the Eastern and Southern areas have lower variability in the surface water samples and lower Cl:HCO3 values.

https://essd.copernicus.org/articles/17/6113/2025/essd-17-6113-2025-f10

Figure 10Piper-plot of all groundwater and surface water data collected over the period with complete major anion data.

Download

3.4 Piper plot

The piper plot contains the data for all the samples with complete major anion and cation data, which are 500 samples. Of these, 229 are surface water samples and the rest groundwater samples. The piper plot of the data (Fig. 10) does not provide much clarity as the samples cover a large area across the ternary space. There is a slight shift towards the HCO3 and Ca/Mg type waters for the groundwater samples compared to the surface water samples, which are more Na dominated. There is also a small cluster of surface water samples that are more SO4 dominated, potentially indicating different geological origins as mentioned earlier in the paper. However, some of the groundwater samples are very high in Cl, explaining the high EC values observed for these samples.

4 Discussion

The dataset in this paper is unique in Australia. There are surface water and groundwater geochemistry datasets from experimental small catchments (i.e. Hughes et al.2007; Crosbie et al.2007; Summerell et al.2006), but not many of these are publicly or easily accessible. In contrast, there are data from very large state and national datasets (i.e. Jolly et al.2001; Thorslund and van Vliet2020), but there are limited publicly available data sets that cover similar substantial space and time scales, and the range of hydrogeochemistry covered here. This is particularly true in the case of shallow groundwater (< 20 m) which is the focus in this dataset.

https://essd.copernicus.org/articles/17/6113/2025/essd-17-6113-2025-f11

Figure 11Boxplots comparing the electrical conductivity for samples of the Australian database extracted from Thorslund and van Vliet (2020) and the database of samples from Muttama Catchment.

As a baseline comparison, we compared the EC data from the catchment samples with data from the global database from Thorslund and van Vliet (2020). We subset the global database by Australia, and restricted the groundwater data to shallow groundwater < 20 m from the surface (Fig. 11). The data from this figure are not included in the github due to the size of the global data set and because the original data is readily available. The figure clearly shows that the data collected in the Muttama catchment fall well within the overall distribution of comparable observed salinity values in Australia for both surface water and groundwater.

In addition, comparison with Table 3 and Fig. 5 in Hughes et al. (2007) clearly highlights the value of the Cl/HCO3 ratio in comparing values of EC, Cl, and HCO3. For example, in a catchment about 100 km north of Muttama catchment, Hughes et al. (2007) found a much lower mean of 77 mg L−1 for CaCO3 in runoff. This study found a higher mean of 334 mg L−1 (Table 3). However, Hughes et al. (2007) found a mean of 1056 mg L−1 for Cl, while this study found a much lower mean of 294 mg L−1, suggesting quite different ratios. Finally, Hughes et al. (2007) reported a mean EC of 3717 µS cm−1 in runoff, while our data has a mean of 1246 µS cm−1 (using the temperature corrected value). In other words, the EC values in the Muttama Creek catchment are more dominated by the alkalinity, resulting in lower EC values. However, similar to Muttama Creek catchment, Hughes et al. (2007) also indicates much higher alkalinity in the groundwater.

The Muttama catchment dataset presented here covers multiple sites, multiple time periods and multiple sampling campaigns. Despite the strengths of this dataset (i.e., the large spatial and temporal range and large range in hydrogeochemical parameters), there are also limitations. As the groundwater data clearly shows, the actual number of possible sample sites is limited by the existing and accessible groundwater wells. Over the 14 years of research, several groundwater wells were accidentally destroyed during farm operations, further reducing the sample opportunities. The number of surface water sampling sites in the catchment is limited by the ephemeral nature of the stream network, with some creeks not flowing for long periods. The overall sampling is therefore limited in scope, as is clearly shown in the spatial maps (Fig. 8).

The development of the dataset over many years regrettably does not allow a full uncertainty analysis. There is reasonable quality assessment of the laboratory analysis of the samples taken by Akter (2018) (see the appendices in Akter2018), there is less reporting of this for the other samples. An exception to this is the samples analysed by the commercial laboratory, where a strict quality protocol was followed. Some uncertainty can be gleaned from the mass balance closure of the major cations for the laboratory analysis. None of this provides an understanding of the uncertainty associated with the field measurements and the potential manual handling errors. Despite these potential sources of uncertainty, the analysis in this paper highlights the spatial and temporal consistency of the data, thus providing evidence of manageable uncertainty in overall dataset. Therefore, the dataset is valuable in the general space time information that it provides.

The dataset suggests implications for management of salinity in the Muttama catchment. The samples clearly indicate that the main source of high Cl salinity originates from the sediments and rocks on the northwestern side of the catchment. This is further complicated by the artesian nature of some of the wells in this area. However, as the water level data and Akter et al. (2021) has highlighted, the shallow groundwater levels and concentrations only partly respond to rainfall recharge. Investment and incentives to reduce recharge should focus on these areas, particularly to limit the movement of saline discharge into the creek (Akter2018).

The presented dataset provides significant opportunities for further research, particularly because of the length of the time series. For example, there is the opportunity to examine trends in salinity due to changes in climate. There are few datasets that cover shallow groundwater and concurrent surface water across a similar wide range of hydrogeochemistry. This opens up the opportunity to look at temporal variability in groundwater surface water connections, particularly for flat semi-arid systems similar to Muttama Catchment, as was done for a more limited set of data in Akter (2018).

The comprehensive nature of the data also creates opportunities for testing more complex hydrological and hydrogeological models. An example could be to extend the work by Deb et al. (2019) to look at variations in rainfall-runoff response during wet and dry periods, which for Muttama catchment was linked to groundwater surface water connections. Finally, given our intention to continue collecting data in the catchment, there is an opportunity to look at shifts in the hydrogeochemistry as a result of wet and dry periods.

5 Conclusions

Detailed datasets for medium to large scale catchments (greater or equal than 1000 km2) are underrepresented in salinity research in Australia and worldwide, with very few public datasets identified for Australia. This paper reports on a long term (14 year) hydrogeochemistry dataset from a single medium scale catchment (> 1000 km2) in NSW, Australia. This dataset includes a total of 1160 water samples from 62 locations within the catchment and includes groundwater and surface water sites. While the dataset was collected by different groups of people at different times and locations, it still provides a valuable long term and spatially diverse data set. The complete sample set covers a wide range of flow and wetness conditions. Clear differences were observed in pH, electroconductivity, and in ion ratios between groundwater and surface water samples. Spatial differences in water chemistry are also apparent, with our data reinforcing prior chemical gradients observed in the catchment. Though in this paper we only provide a limited analysis of the data, we anticipate the dataset can be used for research to gain better insight into the spatio-temporal evolution of hydrogeochemical processes from larger catchments and improve the understanding of catchment salinity.

6 Code and data availability
Supplement

The supplement related to this article is available online at https://doi.org/10.5194/essd-17-6113-2025-supplement.

Author contributions

TFAB initiated the sampling campaign. TFAB, FFvO, FAD and RWV conceptualised the overall study, and managed the project. RWV and MT wrote the draft paper and analysed results. FA, DK and JSL collected and analysed the majority of the samples. MT, AVJB, JPM, FA and DK analysed and managed the data. All authors participated in sampling, laboratory analysis and review of the paper.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Acknowledgements

This dataset would not have been possible without the generous assistance and access to properties from the following land owners in the Muttama Creek area: R. Last, the Tozer family, M. Sullivan, P. McClintock, P. McGuire, S. Sharman, A. Hollihan, the managers at Brawlin Springs as part of the Romani Pastoral Company, and the managers and owners of Wavehill, in particular J. Litchfield. We would like to thank several generations of students in the units LWSC2002, ENVX3003 and ENSY5708, as well a multiple interns from French Institutions and Wageningen University for assisting with the sample collection.

Review statement

This paper was edited by Hao Shi and reviewed by Justin Hughes and three anonymous referees.

References

Akter, F.: Groundwater salinity and interaction with surface water near Cootamundra, NSW, Australia, PhD thesis, http://hdl.handle.net/2123/18578 (last access: 11 November 2025), 2018. a, b, c, d, e, f, g, h, i, j, k, l

Akter, F., Bishop, T. F. A., and Vervoort, R. W.: Space-time modelling of groundwater level and salinity, Science of the Total Environment, 776, 145865, https://doi.org/10.1016/j.scitotenv.2021.145865, 2021. a

Breuer, L., Hiery, N., Kraft, P., Bach, M., Aubert, A. H., and Frede, H.-G.: HydroCrowd: a citizen science snapshot to assess the spatial control of nitrogen solutes in surface waters, Scientific Reports, 5, 16503, https://doi.org/10.1038/srep16503, 2015. a

Canedo-Arguelles, M., Hawkins, C. P., Kefford, B. J., Schäfer, R. B., Dyack, B. J., Brucet, S., Buchwalter, D., Dunlop, J., Frör, O., Lazorchak, J., Coring, E., Fernandez, H. R., Goodfellow, W., Achem, A. L. G., Hatfield-Dodds, S., Karimov, B. K., Mensah, P., Olson, J. R., Piscart, C., Prat, N., Ponsá, S., Schulz, C.-J., and Timpano, A. J.: Saving freshwater from salts, Science, 351, 914–916, https://doi.org/10.1126/science.aad3488, 2016. a, b

Cartwright, I., Weaver, T. R., Simmons, C. T., Fifield, L. K., Lawrence, C. R., Chisari, R., and Varley, S.: Physical hydrogeology and environmental isotopes to constrain the age, origins, and stability of a low-salinity groundwater lens formed by periodic river recharge: Murray Basin, Australia, Journal of Hydrology, 380, 203–221, https://doi.org/10.1016/j.jhydrol.2009.11.001, 2010. a, b

Conyers, M. K., Hume, I., Summerell, G., Slinger, D., Mitchell, M., and Cawley, R.: The ionic composition of the streams of the mid-Murrumbidgee River: Implications for the management of downstream salinity, Agricultural Water Management, 95, 598–606, https://doi.org/10.1016/j.agwat.2008.01.007, 2008. a, b, c, d, e, f

Crosbie, R. S., Hughes, J. D., Friend, J., and Baldwin, B. J.: Monitoring the hydrological impact of land use change in a small agricultural catchment affected by dryland salinity in central NSW, Australia, Agricultural Water Management, 88, 43–53, https://doi.org/10.1016/j.agwat.2006.08.009, 2007. a, b, c

Dahlhaus, P., Evans, T., Nathan, E., Cox, J., and Simmons, C.: Groundwater-level response to land-use change and the implications for salinity management in the West Moorabool River catchment, Victoria, Australia, Hydrogeology Journal, https://doi.org/10.1007/s10040-010-0616-y, 2010. a, b

Deb, P., Kiem, A. S., and Willgoose, G.: Mechanisms influencing non-stationarity in rainfall-runoff relationships in southeast Australia, Journal of Hydrology, 571, 749–764, https://doi.org/10.1016/j.jhydrol.2019.02.025, 2019. a

Department of Environment and Climate Change NSW: Salinity Audit Upland catchments of the New South Wales Murray–Darling Basin, Report, Department of Environment and Climate Change NSW, 2009. a

Finlayson, J., Bathgate, A., Nordblom, T., Theiveyanathan, T., Farquharson, B., Crosbie, R., Mitchell, D., and Hoque, Z.: Balancing land use to manage river volume and salinity: Economic and hydrological consequences for the Little River catchment in Central West, New South Wales, Australia, Agricultural Systems, 103, 161–170, https://doi.org/10.1016/j.agsy.2009.12.003, 2010. a

Grayson, R. B., Gippel, C. J., Finlayson, B. L., and Hart, B. T.: Catchment-wide impacts on water quality: the use of 'snapshot' sampling during stable flow, Journal of Hydrology, 199, 121–134, https://doi.org/10.1016/S0022-1694(96)03275-1, 1997. a

Hughes, J. D., Khan, S., Crosbie, R., Helliwell, S., and Michalk, D. L.: Runoff and solute mobilization processes in a semiarid headwater catchment, Water Resources Research, 43, W09402, https://doi.org/10.1029/2006WR005465, 2007. a, b, c, d, e, f, g, h, i

Hughes, J. D., Crosbie, R. S., and van de Ven, R. J.: Salt mobilisation processes from a salinised catchment featuring a perennial stream, Journal of Hydrology, 362, 308–319, https://doi.org/10.1016/j.jhydrol.2008.09.001, 2008. a

Jolly, I., Williamson, D. R., Gilfedder, M., Walker, G., Morton, R., Robinson, G. H., Jones, H., Zhang, L., Dowling, T., Dyce, P., Nathan, R., Nandakumar, N., Clarke, R., and McNeill, V.: Historical stream salinity trends and catchment salt balances in the Murray-Darling Basin, Australia, Marine and Freshwater Research, 52, 53–63, 2001. a, b, c, d

Leblanc, M., Tweed, S., Van Dijk, A., and Timbal, B.: A review of historic and future hydrological changes in the Murray-Darling Basin, Global and Planetary Change, 80–81, 226–246, https://doi.org/10.1016/j.gloplacha.2011.10.012, 2012. a

Lessels, J. S.: The optimisation of water quality monitoring schemes, PhD thesis, http://hdl.handle.net/2123/11703 (last access: 11 November 2025), 2014. a, b

Lessels, J. S. and Bishop, T. F. A.: A post-event stratified random sampling scheme for monitoring event-based water quality using an automatic sampler, Journal of Hydrology, 580, 123393, https://doi.org/10.1016/j.jhydrol.2018.12.063, 2020. a

Lintern, A., Webb, J. A., Ryu, D., Liu, S., Waters, D., Leahy, P., Bende-Michl, U., and Western, A. W.: What Are the Key Catchment Characteristics Affecting Spatial Differences in Riverine Water Quality?, Water Resources Research, 54, 7252–7272, https://doi.org/10.1029/2017wr022172, 2018. a, b

Lyon, S. W., Seibert, J., Lembo, A. J., Steenhuis, T. S., and Walter, M. T.: Incorporating landscape characteristics in a distance metric for interpolating between observations of stream water chemistry, Hydrol. Earth Syst. Sci., 12, 1229–1239, https://doi.org/10.5194/hess-12-1229-2008, 2008. a

McFarlane, D. J., George, R. J., Barrett-Lennard, E. G., and Gilfedder, M.: Salinity in Dryland Agricultural Systems: Challenges and Opportunities, Springer International Publishing, Cham, https://doi.org/10.1007/978-3-319-47928-6_19, pp. 521–547, 2016. a, b

R Core Team: R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing, Vienna, Austria, https://www.R-project.org/ (last access: 11 November 2025), 2023. a

Scanlon, B. R., Jolly, I., Sophocleous, M., and Zhang, L.: Global impacts of conversions from natural to agricultural ecosystems on water resources: Quantity versus quality, Water Resources Research, 43, W03437, https://doi.org/10.1029/2006wr005486, 2007. a

Stavi, I., Thevs, N., and Priori, S.: Soil Salinity and Sodicity in Drylands: A Review of Causes, Effects, Monitoring, and Restoration Measures, Frontiers in Environmental Science, 9, https://doi.org/10.3389/fenvs.2021.712831, 2021. a, b

Summerell, G. K., Tuteja, N. K., Grayson, R. B., Hairsine, P. B., and Leaney, F.: Contrasting mechanisms of salt delivery to the stream from three different landforms in South Eastern Australia, Journal of Hydrology, 330, 681–697, 2006.  a, b, c, d

Thorslund, J. and van Vliet, M. T. H.: A global dataset of surface water and groundwater salinity measurements from 1980–2019, Scientific Data, 7, 231, https://doi.org/10.1038/s41597-020-0562-z, 2020. a, b, c, d, e

van Dijk, A. I. J. M., Gilfedder, M., and Austin, J.: Influences of climate, terrain and land cover on stream salinity in southeastern Australia, and implications for management through reforestation, Hydrological Processes, 22, 3275–3284, https://doi.org/10.1002/hyp.6968, 2008. a

Vervoort, R. and Akter, F.: Muttama Creek Catchment Groundwater data, Open Science Foundation [data set], https://doi.org/10.17605/OSF.IO/BEUWK, 2024. a, b, c, d, e

Vervoort, R., van Ogtrop, F., Tambrchi, M., Akter, F., Buzacott, A., Lessels, J., Moloney, J., Kundu, D., Dijkstra, F., and Bishop, T.: A detailed streamflow and groundwater salinity dataset for Muttama Creek Catchment, NSW, Australia, Sydney eScholarship Repository [data set], https://doi.org/10.25910/m0wp-8890, 2025. a, b, c

Walker, G., Zhang, L., Ellis, T., Hatton, T., and Peterham, C.: Estimating impacts of changed landuse on recharge: review of modelling and other approaches appropriate for management of dryland salinity, Hydrogeology Journal, 10, 68–90, 2002. a

Webb, L.: Hydrogeology Report for the Muttama Catchment, Report, NSW Department of Land and Water Conservation, 1999. a

White, I., Macdonald, B. C. T., Somerville, P. D., and Wasson, R.: Evaluation of salt sources and loads in the upland areas of the Murray–Darling Basin, Australia, Hydrological Processes, 23, 2485–2495, https://doi.org/10.1002/hyp.7355, 2009. a, b, c, d

Download
Short summary
A detailed water quality dataset for a 1000 km2 catchment in New South Wales, Australia is presented. The water quality data for surface and groundwater was collected over 14 years and contains more than 1000 records. Despite missing values and different sampling methods and groups, the data provides a consistent overview of the hydrogeochemistry of the catchment. This dataset is valuable for global modelling studies as well as better understanding of local dryland salinity and water quality processes.
Share
Altmetrics
Final-revised paper
Preprint