Articles | Volume 13, issue 3
Data description paper
19 Mar 2021
Data description paper |  | 19 Mar 2021

Landsat-derived bathymetry of lakes on the Arctic Coastal Plain of northern Alaska

Claire E. Simpson, Christopher D. Arp, Yongwei Sheng, Mark L. Carroll, Benjamin M. Jones, and Laurence C. Smith

The Pleistocene sand sea on the Arctic Coastal Plain (ACP) of northern Alaska is underlain by an ancient sand dune field, a geological feature that affects regional lake characteristics. Many of these lakes, which cover approximately 20 % of the Pleistocene sand sea, are relatively deep (up to 25 m). In addition to the natural importance of ACP sand sea lakes for water storage, energy balance, and ecological habitat, the need for winter water for industrial development and exploration activities makes lakes in this region a valuable resource. However, ACP sand sea lakes have received little prior study. Here, we collect in situ bathymetric data to test 12 model variants for predicting sand sea lake depth based on analysis of Landsat-8 Operational Land Imager (OLI) images. Lake depth gradients were measured at 17 lakes in midsummer 2017 using a Humminbird 798ci HD SI Combo automatic sonar system. The field-measured data points were compared to red–green–blue (RGB) bands of a Landsat-8 OLI image acquired on 8 August 2016 to select and calibrate the most accurate spectral-depth model for each study lake and map bathymetry. Exponential functions using a simple band ratio (with bands selected based on lake turbidity and bed substrate) yielded the most successful model variants. For each lake, the most accurate model explained 81.8 % of the variation in depth, on average. Modeled lake bathymetries were integrated with remotely sensed lake surface area to quantify lake water storage volumes, which ranged from 1.056×10-3 to 57.416×10-3 km3. Due to variations in depth maxima, substrate, and turbidity between lakes, a regional model is currently infeasible, rendering necessary the acquisition of additional in situ data with which to develop a regional model solution. Estimating lake water volumes using remote sensing will facilitate better management of expanding development activities and serve as a baseline by which to evaluate future responses to ongoing and rapid climate change in the Arctic. All sonar depth data and modeled lake bathymetry rasters can be freely accessed at (Simpson and Arp, 2018) and (Simpson, 2019), respectively.

1 Introduction

The Arctic Coastal Plain (ACP) of Alaska is distinguished by the presence of thousands of lakes, many of which are the product of thermokarst processes (Hopkins, 1949). Thermokarst is the melting of ice in permafrost, resulting in thaw settlement and land surface subsidence (van Everdingen, 1998); such activity may lead to the development of thermokarst lakes (Hopkins, 1949; Jorgenson and Shur, 2007). While thermokarst lakes on the ACP typically reach maximum depths between 1 and 3 m (Hinkel et al., 2012), an anomalous group of lakes on the ACP approach depths up to approximately 25 m.

We collected depth measurements and mapped bathymetry at a group of deep lakes located on the Pleistocene sand sea (Fig. 1), a distinctive region of the ACP named for its foundational Pleistocene-aged sand sheet and sand dunes (Carter, 1981; Williams, 1983; Williams et al., 1978). Located west of the Colville River, this region spans approximately 15 000 km2 and contains over 16 000 lakes (Jorgenson et al., 2014). The underlying dune field impacts the regional lithology and lake morphology. Lakes here are nestled between the crests of sand dunes and display a form distinct from that of lakes across the rest of Alaska's North Slope (Hinkel et al., 2005; Jorgenson and Shur, 2007). Deep central basins and wide, shallow littoral shelves surrounded by bluffs distinguish sand sea lakes from lakes that have formed in ice-rich permafrost terrain. Studies by Livingstone (1954), Rex (2019), Carson and Hussey (1962), and Carson (1968) assert that the bluffs around lakes erode by winds which carry sand from the bluff faces into the lakes, forming characteristic sandy littoral shelves. These shelves only reach depths of up to 3 m, whereas the central basins of such lakes can reach depths over 7 times that. Due to this striking depth contrast, the distinction between littoral shelves and central basins is apparent in satellite imagery of most lakes in the area (given low-wind and ice-free conditions). Understanding the geological context and morphology of sand sea lakes is important when interpreting their spectral signatures in remotely sensed imagery.

Figure 1The lake-rich area of interest on Alaska's Arctic Coastal Plain (ACP) southeast of Utqiaġvik (Barrow). The imagery used in our models is a Landsat-8 tile (Path 077, Row 011) acquired on 5 August 2016. The Pleistocene sand sea, a geologically unique region of the ACP, is delineated based on a classification of eolian sand by Jorgenson et al. (2014). Landsat-8 image is courtesy of the US Geological Survey.

We present a dataset to help fill the gap concerning lake depth – particularly deep lake depth – measurements in Arctic regions. By leveraging the in situ dataset to tune linear spectral-depth models at individual lakes, we produce lake-wide bathymetry maps and integrate these modeled depths across each lake to quantify water volumes. Finally, we assess spectral-depth similarity in lakes across the sand sea to evaluate the prospects of regional water volume modeling. Bathymetry measurements and associated estimates of water volume such as those provided in our datasets are important when evaluating aquatic habitats, conducting industrial activities that require local freshwater supplies (e.g., ice road construction), and understanding regional water and energy balance. Compared with lakes in surrounding regions of the ACP, sand sea lakes tend to be deeper and thus less likely to freeze to the bottom during the winter. Their notable depth means that sand sea lakes tend to have lower evaporative losses and are more likely to have basins characterized by floating (rather than bedfast) ice in the winter (Arp et al., 2015; Engram et al., 2018). These unfrozen lake basins provide crucial overwintering habitat for fish and other aquatic life (Jones et al., 2009; Sibley et al., 2008). Furthermore, liquid water is essential for industry during winter, primarily for ice road construction but also for ice airstrip and ice pad construction, exploratory oil-well drilling, and withdrawal of water for drillers' and researchers' in-camp use (Jones et al., 2009). Unfrozen winter lakes can also store more heat, affecting the regional energy balance (Jeffries et al., 1999). Therefore, depth and volume quantifications of deep sand sea lakes can help monitor fish habitat and direct locations of water extraction for wintertime infrastructure and consumption for other purposes.

Previous studies have evaluated water depth and bathymetry of lakes in nearby regions using various methods but are limited either to shallow lakes or by coarse depth resolution (e.g., Hinkel et al., 2012; Jeffries et al., 1996; Jones et al., 2017; Kozlenko and Jeffries, 2000; Sellman et al., 1975). Such limitations make deep lake depth and volumetric estimation unfeasible. For example, Jeffries et al. (1996) used satellite imagery and radar data to determine which lakes in regions near Utqiaġvik (Barrow) and Atqasuk, Alaska (including lakes in this paper's study area), froze to the bottom during the winter, extrapolating from their results a classification of lakes as being less than or greater than 2.2 m deep. When used in concert with an ice-growth model, this provided a proxy for coarse lake volume estimation but was limited to shallow lakes. Hinkel et al. (2012) measured in situ bathymetry for 28 lakes. However, the maximum lake depth of this study on the inner ACP was 2.3–5.2 m. Thus, our dataset is unique in its consideration of deep lakes. Furthermore, while optical remote-sensing-based retrieval of bathymetry (applied to create our bathymetry maps) is a well-documented approach (e.g., Clark et al., 1987; Hodúl et al., 2018; Pacheco et al., 2015; Pope et al., 2016; Yunus et al., 2019), in part due to limited data acquisition, such methods have historically been challenging to apply in our study area. One of the model variants we employ was successfully used to extrapolate bathymetry in tropical and subtropical coastal marine environments (Jagalingam et al., 2015; Stumpf et al., 2003); however, to our best knowledge, the model has never been applied to high-Arctic lakes. Volumetric estimates with the resolution provided here (30 m horizontal, 0.03 m vertical) have never been attempted for Pleistocene sand sea lakes, and the method of depth derivation used in this paper has not been employed in the Arctic.

2 Data and methods

2.1 Depth data acquisition

Depth points were sampled across 19 lakes during a field expedition between 22 and 27 July 2017. The method of data collection required landing on each target lake in a float plane. A Humminbird 798ci HD SI Combo automatic sonar unit was attached to the back of a float and sampled depth as the plane taxied or drifted across the lake. Depth points were each measured discretely as part of a depth-gradient transect and were sampled at a frequency of one point per second with an accuracy of 0.03 m (due to intrinsic machine error). The number of points collected per lake is specified in Table 1.

Table 1Sampling specifications for each study lake. The number of sample points and measured depth range were calculated after the points were processed for quality assurance (e.g., anomalous depth pixels removed) but before resampling to the single point per pixel dataset.

Download Print Version | Download XLSX

Lakes were targeted that were large enough for a float plane to land on in windy conditions (i.e., >∼1 km2 lake surface area) and that showed the presence of a distinct littoral shelf and a deep basin in 2.5 m color-infrared aerial photography (US Geological Survey digital orthophoto quadrangles, DOQs). A single straight transect line was mapped across each target lake prior to field visits to encompass a wide depth range; however, due to windy conditions, such lines were not always followed (Fig. 2). Nevertheless, in all but two lakes, a depth range from the littoral shelf to the deep central basin was captured (Table 1). It should be noted that, as transects were comprised of individual points whose relationship to one another was unimportant to the modeling, the direction, angle, and other qualities of the transect are significantly less important than the range of depths captured.

Figure 2Transects were measured across 17 study lakes. Although the transects follow irregular paths (due in part to wind conditions and sonar error), all but two of the transects capture a range of depths from the deep central basins to the shallow outer shelves. These are the full transects before resampling to a single point per pixel. Where the form of a transect is unclear, inset maps are provided. Landsat-8 image is courtesy of the US Geological Survey.

2.2 Depth data processing

Depth data points from 17 of the 19 sampled lakes were compiled into a single file to facilitate initial processing, with the lake IDs maintained in the database for lake-specific analysis. Two lakes where sampling occurred contained an insufficient number of measurements to justify modeling their bathymetry (models produced for these two lakes would have been strongly overfitted). The dataset was then filtered to 13 735 depth points; for each transect collected with the Humminbird sonar, discrete points were evaluated relative to the depths of their neighbors, and anomalous and zero-depth points were manually removed from the dataset. This step mitigated sonar errors and improved the smoothness of the bathymetric profiles that were generated from each transect. Subsequently, depths collected at the margins of two lakes at the Pik Dunes (70.234 N, 153.183 W) were removed from the dataset after manual inspection due to their anomalous spectral signatures. The unique, white color of the sandy substrate at this group of lakes and the extreme shallow nature of the littoral shelves ( 0.5 m deep) produce a spectral signature near the margins of lakes in the Pik Dunes area that is easily confused with that of the surrounding land and thus should not be used to analyze lake depth. These Pik Dunes depth points represent outliers and had they been included, our models would have had to reconcile associating strikingly different spectral values with similar depth values. This likely would have decreased overall model performance with the only potential benefit of modeling a limited number of marginal pixels more accurately.

2.3 Landsat image selection

Landsat-8 Operational Land Imager (OLI) imagery was chosen for comparison with measured depth data due to its large swath, 30 m spatial resolution, and quality (as assured by US Geological Survey preprocessing). A cloud-free Landsat image (LC08_L1TP_077011_ 20160805_ 20170222_01_TI) was selected that both covered the study area and was acquired on 5 August 2016, which is to say, at a similar time of year to that of field data collection from the following year (suitable imagery was not available for 2017). The late summer was chosen to provide data for a time when lakes are at an intermediate level, which is to say, lakes are free of ice but have not yet reached their lake level minimums (determined when evaporation exceeds precipitation; Jones et al., 2009). It should be noted that water volume varies seasonally and interannually in accordance with precipitation of the preceding 12 months, and therefore the estimated depth data may not be representative of the lake levels year round or from year to year. Nevertheless, these variations in lake level are relatively small with surface area changes often around 0.6 % of total surface area (Jones et al., 2009). Furthermore, of these area changes, the majority of change occurs at the shallow littoral shelves and therefore results in little volume change (Jones et al., 2009).

As no ice-free, cloud-free Landsat images exist that cover all study lakes for late summer 2017, we selected a Landsat image from 2016 in order to maximize the number of lakes included for which field data exist, i.e., the number of lakes for which we could model volume. One potentially promising Landsat image exists that covers our study area; however, (a) it was acquired at the end of June just after ice-out when the lake levels are at a seasonal high, and (b) slight cloudiness over some study lakes produced models that predicted depths up to 48 % less accurately. The use of 2016 imagery is further justified as the interannual depth and volume changes are smaller than our error metrics. When considering one representative lake (located at 70.147 N, 151.765 W), a continuous depth logger recorded a depth difference of 0.03 m (or 1 % of the annual average depth at that point) between the imagery acquisition date (5 August 2016) and the time of data collection (26 July 2017). This represents a smaller depth difference than the 0.05 m difference measured between 30 June 2017 and 26 July 2017. The maximum observed depth change at this location between 1 January 2016 and 26 July 2017 was on the order of 1 m. The observation of an imagery time series of a different group of lakes that are typically highly responsive to water level changes (located at 70.539 N, 152.733 W) similarly revealed lake level conditions to be more comparable between 5 August 2016 and 21–27 July 2017 than between these latter dates and 1 July 2017. Overlaying lake surface area changes on an airborne-lidar-derived digital surface model showed a change in water level of  0.10 m between 5 August 2016 and 1 July 2017, indicating a depth change well within our error margins (Alaska North Slope LiDAR Data – Project Code ALCC2012-05, 2018).

2.4 Landsat image processing and analysis

Study lakes were visually assessed in ArcGIS to provide a Boolean turbidity rating for the purpose of analyzing the success of different models. Lake clarity was determined by comparing the selected Landsat image (as an RGB true color composite) with a Landsat image acquired on 13 July 2016 (23 d prior to the acquisition date of the selected Landsat image), as well as color-infrared aerial photographs (DOQs) with a 2.5 m horizontal resolution (Fig. 3). Lakes that showed the presence of sediment plumes or water cloudiness near the site of in situ data collection on the selected Landsat image were designated as turbid. Lakes which displayed minimal suspended sediment distant from the area where depths were recorded were designated as turbid as well; however, they were analyzed as if they were clear as the impacts of sediment would not be seen in the depth-point-derived spectral signatures. Lakes that did not have sediment plumes were designated as clear.

Figure 3Sediment is detected in RGB Landsat imagery (acquired 5 August 2016) of a representative study lake (a, b). This is confirmed as a temporary sediment plume by comparing the image of the lake used in modeling to 2.5 m color-infrared photography acquired on 18 July 2002 (c) and a Landsat image acquired on 13 July 2016 (d) in which no sediment plumes are visible. Landsat-8 images and digital orthophoto quadrangles (DOQs) are courtesy of the US Geological Survey.

We validated our qualitative visual turbidity assessment using the ACOLITE (software developed at the Royal Belgian Institute of Natural Sciences for aquatic applications of Landsat and Sentinel-2) implementation of the total suspended matter (TSM) algorithm (Nechad et al., 2010). This algorithm provided quantitative support, agreeing with the visual assessment in 14 out of 17 lakes. However, this algorithm proved highly sensitive to depth (Spearman rank order correlation =0.774; p-value < 0.001) and did not detect sediment in deeper waters to the same extent as shallow waters, effectively ignoring the sediment plumes identified visually. Furthermore, the majority of shallow waters were assigned high TSM values by the algorithm, making the differentiation by turbidity at the lake-wide level irrelevant. Considering the points in our transects, 91 % of high-sediment (i.e., TSM values in or above the 75th percentile) points had measured depths <2 m, and only five outlier high sediment values were detected in points with depths >4.6 m. To directly address the sediment content in deeper waters, the mean TSM value was calculated at each lake from sample points with depths >2 m. Seven out of eight lakes with the highest average TSM values had been designated as turbid by our qualitative assessment (note that one of these lakes was designated as turbid away from the sampling site – this is counted as an error). In addition, all but one of the nine lakes with the lowest mean TSM values were designated as clear at the sampling site.

The chosen Landsat image was clipped to the study area, and a normalized difference water index (NDWI) water mask was created using ArcGIS tools to subset our study lakes from the surrounding land pixels (McFeeters, 1996). Each of our study lakes were then extracted to individual geoTIFF files for use in bathymetry map production.

2.5 Spectral-depth point extraction

Top-of-atmosphere (TOA) reflectance values from the blue band (band 2; 452–512 nm), green band (band 3; 533–590 nm), and red band (band 4; 636–673 nm) of the Landsat image were extracted to each point. Although surface reflectance (SR) imagery was available, we elected to use TOA reflectance initially because SR algorithms are often suboptimal when looking at water bodies due to the low-level water-leaving radiance, and furthermore, we are working at high altitudes where SR corrections are unreliable. Upon comparison, the SR and TOA reflectance values in our selected RGB imagery (discussed below) were very similar (R2>0.99) at our sample locations. The coastal band (band 1; 435–451 nm) was not included here as there was no basis for its examination in prior similar studies (e.g., Jagalingam et al., 2015) at the time this analysis was conducted, and unexpectedly, preliminary results were not greatly improved by the inclusion of the coastal band.

To minimize error caused by associating a single pixel's spectral signature with multiple depth points (i.e., to reduce compatibility issues between the spatial resolution of the sonar transects and the Landsat imagery with which the depth points were compared), the dataset was resampled to include only one depth per pixel. This depth was calculated by averaging the sonar depths of all measurements within the pixel, removing depths greater than 1 standard deviation from this average, and recalculating the depth mean of the pixel. Aggregating per-pixel measurements allowed us to identify the dominant depth represented by the pixel's lake color and improve the precision of training data (i.e., reduce the range of input depths associated with a given band ratio). This pixel-representative depth point provides the final depth value used in analysis. All data visualization and manual data editing were undertaken using ArcMap; automated data editing was done with the aid of ArcGIS and python.

2.6 Model application for lake bathymetry mapping

A total of 12 variations of a spectral-depth algorithm were examined to model bathymetry, each characterized by a specific band ratio, adjustment factor, and growth factor (Table 2). More specifically, the blue to green, blue to red, and green to red band ratios were considered. Such ratios were either simple (e.g., blue band/green band) or transformed according to Stumpf et al. (2003):

(1) ln ( n R i ) ln ( n R j ) ,

where Ri and Rj represent the TOA reflectances for bands i and j, respectively. A constant n is included to effect a positive output (Stumpf et al., 2003). We set n to 500 as it ensured that the logarithm would be positive given any feasible band value input, R, from our image.

Table 2Equations for modeling depth. Modeled depth (Z) is calculated with each of four equations that are tuned with each of three input band pairs. Ri and Rj represent the top-of-atmosphere reflectances of bands i and j, respectively. Band pairs (band i and band j) include the blue and red bands, the blue and green bands, and the green and red bands. Tunable parameters m1 and m0 are derived by comparing spectral signatures with depth (as in Fig. 4a–c).

Download Print Version | Download XLSX

The band ratio and the depth measurement of the point at which the spectral signature was extracted were correlated using either a linear regression or an exponential function (Fig. 4a–c). The constants obtained from each of these models became the parameters with which to tune the linear or exponential equations for the validation data. The root mean squared error (RMSE) of each regression between input depths and input band ratios provided error statistics for modeled depths. In summary, the 12 model variations were each characterized by (1) one of three band ratios, (2) one of two transformation methods, and (3) one of two growth relationships (Table 2).

Figure 4Coefficients of the trend lines between band ratios and measured depths (a–c) are used to tune the depth models for each lake. Different models (specified for each lake in Table 3) predicted lake depth best at each of these three lakes. The correlation between measured and modeled lake depths at three representative lakes (d–f) reveals an underestimation of deeper depths and overestimation of shallow depths. Error bars represent root mean squared error (RMSE).


For each lake, half of the depth points were semi-randomly selected as input data, while the remaining data were used for validation purposes. To ensure that the model was trained and validated with data spanning the full range of input depths, however, the maximum and minimum depths were assigned to the group of data to be input into the model, while the second deepest and second shallowest depth points were retained in the list of validation data. To obtain the best regional model, this same process was undertaken (i.e., selection of half of the data to train the models; application of each of the 12 models); however, a sufficient number of depth points exist in the full dataset such that the explicit assignment of extreme depth values as input and validation data was unnecessary (i.e., the selection was fully random).

Each of the 12 models was tested at each of the 17 lakes and on a regional scale. To account for the slight variations in each model's capacity for depth prediction given different random sets of training data, 1000 trials were performed. This allowed us to assert that the model designated as the most accurate model for a given lake (as determined from one trial) was the same model that most frequently produced the best results for that lake. The best model for each lake, as evaluated by the coefficient of determination between target and predicted data, was used to calculate depths at each pixel in that lake and produce bathymetry raster maps. Depths were multiplied by 900 m2 (the area of one Landsat pixel) and integrated to quantify the lake's water volume. A summary of the data acquisition, processing, and analysis steps is provided in Fig. 5.

Figure 5Sequence of processing, analysis, and production steps used to map bathymetry and derive lake water volumes with depth points and Landsat imagery.


The most accurate models (i.e., the models that were best able to determine lake depth for the greatest number of lakes) were models with an exponential growth factor with input band ratios of blue/green or blue/red (Table 3). In all but three of the study lakes, an exponential relationship was found between spectral signature and depth. At only two lakes did the green to red band ratios provide the best results. The transform ratio provided the best results in 4 out of the 17 lakes, while the simple ratio was used to best model depths in the remainder of the lakes. The difference between the modeled results of the pure versus transform ratios was marginal, however, with an average difference between R2 values generated by the respective models of 0.016.

Table 3The best spectral-depth model for each lake (based on R2). A simple ratio exponential function provided the best model for the greatest number of lakes, while the blue/green and blue/red band ratios both provided good inputs for models at different lakes, accounting for the best spectral-depth models at eight and seven lakes, respectively. The average R2 of the best model at each lake is 0.818 with an average root mean squared error (RMSE) of 1.439 m.

* Some suspended sediment is visible; however, it does not overlap the area where depths were measured.

Download Print Version | Download XLSX

Unsurprisingly, the blue band proved to be the most useful in determining depth overall, while the red band was useful in the presence of turbidity. The blue band was used to tune depths at all but two lakes. Blue light has a shorter wavelength and consequential higher energy which allows it to be absorbed less in water than either green or red light. Thus, the reflectance of the blue band decreases less than either the green or red bands in proportion to increasing depth. In contrast, red light is able to penetrate only several meters into most types of water before it becomes absorbed. The red band proved useful in distinguishing depths at both the sandy littoral shelves, where water is typically 0.5–3 m in depth, and where suspended sediment was present in the water. As sand reflects red light more than blue or green light and suspended sediment can reduce penetration of blue or green wavelengths in deeper water, this is expected. All of the eight lakes where the blue to green band ratios provided the best result were free of sediment where measurements were taken. Furthermore, all of the seven lakes designated as turbid at the data collection site required the incorporation of the red band to achieve the best depth prediction. One anomalous lake where no sediment was detected required the incorporation of the red band to predict depth most accurately.

The two lakes where the green/red band ratio best tuned the model were unique in terms of physical factors or sampling locations. One of these lakes showed the presence of an unusual purple-red patch on a shelf between the littoral shelf and deep zone. Underwater vegetation likely accounts for this unusual spectral signature, and thus it is unsurprising that this lake required a unique band ratio to accurately tune the model. Measurements at the second lake accounted for the shallowest range of depths of any lake (0.2–2.1 m), which may have led to stronger reflectance in the red band as the sand was more prominent.

In addition, an exponential relationship was able to better model depth ranges that include shallow depths of around 0.8 m, an outcome that is likely the result of incomplete transect sampling rather than physical significance. Of the three lakes where a linear function provided the best model, two were the lakes where depths on the littoral shelf were not measured; the third lake contained only a single measurement of the littoral shelf. Therefore, the lakes best modeled with a linear growth relationship are associated with measured bathymetry profiles that do not contain sufficiently shallow littoral shelf depths. This is evidenced by the prediction of negative depths at littoral shelves when applying linear models, the product of the strongly negative y intercepts that render low spectral signature ratios negative. This leads us to conclude that the linear relationship between band ratios and depths at these lakes is more likely the product of the locations where data were gathered rather than a result of physical significance. It is thus important to tune models to all regions (and all depths) of the lake.

3 Results

We produced bathymetry maps for 17 lakes on the ACP (Fig. 6); however, the accuracy of these maps varies by lake and by depth. The best model variants for individual lakes where depth data were collected were able to account for 58.5 %–97.6 % of depth variability (median R2=0.86, mean R2=0.82; Table 3). Regional-scale models, however, were able to accurately explain less than half of the regional depth variability. Median uncertainty of single lake depth models (based on RMSE) was 1.23 m, while the average RMSE of the models was 1.44 m. However, error was not distributed equally across depths, and bathymetry rasters tend to represent a more limited range in depth than the measured depth points (Tables 1, 3). In general, models tended to overestimate shallower depths and underestimate deep depths (Fig. 4d–f). When considering model-predicted depths at all study lakes, depth points less than 2.95 m were overestimated by an average of 0.21 m (or 17.2 % of their true depth), with 61.3 % of depths in this shallow-water group experiencing some model overprediction. Meanwhile, 66.9 % of depths greater than 2.95 m were underestimated with an average difference between measured and modeled depths of 0.97 m. On average, points deeper than 2.95 m were underestimated by 5 %. The threshold of 2.95 m represents the intersection between the 1:1 line and the correlation between measured and predicted depths.To address the underestimation of deep depths and overestimation of shallow depths in our models, additional transformations must be made, a goal that is outside the scope of this work.

Figure 6Bathymetry was modeled individually for each study lake and all bathymetry rasters were ultimately mosaicked together. The color bar indicates the depths predicted by the model variants at each lake; gray represents the pixels at which negative depths were modeled (these negative depths have been reclassified to −1 in the published bathymetry raster dataset; Simpson, 2019).


Bathymetry accuracy variability by depth is at least partially explained by the fact that lake depth points are skewed heavily towards shallow depths, with approximately half of the data points representing depths less than 2.95 m. Only about 15 % of the data points represented depths above 10 m. This is a function of the generally shallow nature of lakes on the Arctic Coastal Plain and the large area covered by littoral shelves within most study lakes (as seen in satellite imagery). Because of the relatively small number of deep water depth points, models were able to map bathymetry less accurately at deep central basins, and therefore the bathymetry maps contain underpredicted deep water depths. In contrast to the skew in depth points, lakes were evenly divided into shallow and deep classes. A total of 9 out of 17 lakes had some measured depths >10 m, and all of the study lakes had measured depths <2.2 m (Table 1).

Lake volumes ranged from 1.056 × 10−3 km3 at the smallest lake (total surface area = 1.089 km2) to 57.416 × 10−3 km3 at the largest lake (total surface area = 18.998 km2) with a median volume of 7.20 × 10−3 km3 (Table 4). Volume and surface area were strongly correlated (R2=0.90) for the 14 lakes for which complete volumes could be modeled (Fig. 7). Linear models predicted negative depths across much of the lakes' shallow littoral shelves; thus, the modeled volumes of the three lakes for which linear models produced the most accurate results are an incomplete representation of the lake's water storage. Pixels for which models predicted negative depths were reclassified to a secondary NoData value of −1 and ignored when calculating water volume (i.e., water volume was calculated for the surface area with predicted depths greater than zero; Fig. 8). Ground truth lake volume data do not exist for the study lakes at a similar scale of analysis, rendering error metrics unfeasible (aside from those implicitly contained in the depth model error).

Figure 7A strong correlation exists between surface area and modeled volume for the 17 lakes we analyzed.


Table 4Modeled lake volumes. Individual lake volumes were estimated by multiplying the modeled depth for each pixel by a constant factor of 900 m2 (Landsat spatial resolution). Depths were modeled by applying the best spectral-depth model for the lake (Table 3). Linear depth models predicted negative depths for some pixels; volume estimates derived from such models (namely the models applied at lakes 2964, 4365, and 6199) include only those pixels with modeled depths greater than zero. The percent of the surface area for which depth estimates at a lake were positive (in contrast to the total surface area of a given lake derived using the NDWI mask) is quantified.

Download Print Version | Download XLSX

Figure 8Modeled lake bathymetry at a representative lake (a) reveals the tendency of linear depth models to drastically underestimate the depths of the littoral shelves when not calibrated to shallow depths. Conversely, the exponential depth models applied to other lakes are promising across both littoral shelves and central basins (b, c). The products of three different spectral-depth model variations are overlain on the Landsat imagery from which the products were derived. Adjacent to each depth product is the original Landsat imagery of the lake. Color bars indicate the depths predicted by the model variants at each lake, while the gray area (a) represents the pixels at which negative depths were modeled (these negative depths have been reclassified to −1 in the published bathymetry raster dataset; Simpson, 2019). Landsat-8 image is courtesy of the US Geological Survey.

4 Discussion

4.1 Depth analysis

Our measured depth points capture the deep water depths on the ACP that many other studies neglect. Furthermore, depth was accurately derived from Landsat OLI imagery for individual lakes (the average R2 value of the selected models for each lake was 0.82). Our R2 values are consistent with those found in the literature (e.g., Jagalingam et al., 2015; Stumpf et al., 2003), and thus our selected models and derived maps can be considered successful. The regional-scale model, however, was unsuccessful, and regional volume analysis and mapping were rejected. This lack of model portability between lakes may be due to the fact that the blue band, most useful in determining depth, is the most susceptible to contamination from atmospheric aerosols. This finding is consistent with Smith and Pavelsky (2009) who found surprisingly high variability in a collection of remotely sensed lake storage volumes on the Peace–Athabasca Delta, Canada, despite their having a similar physiographic setting and morphology.

4.2 Limitations

The depth estimates are only tuned to the extrema of depths measured at each lake. Although gathering data across a lake's full bathymetric profile was attempted, it is likely that the depth minima and maxima were not captured at all lakes. Collecting data with sonar attached to a float plane limited the measurement of depths approaching 0 m. Few pixels were sampled at the minimum depth that was able to be measured (0.2 m), and thus there is insufficient tuning to accurately model the littoral shelves of lakes. Furthermore, while we attempted to gather depths across the deep central basins, it is impossible at present to know whether we sampled the deepest point without measuring the entirety of the basin. Thus, depth maps may not accurately depict a lake's maximum depth.

The limited spatial resolution of Landsat imagery, in comparison with sonar depth data, constitutes the primary limitation to this work. As depths had to be averaged to conform to the assumption that each spectral signature corresponded to a discrete depth, the spatial resolution and depth precision of the sonar depths were greatly degraded, potentially accounting for some of the inaccuracies in the model variants. Modeling bathymetry with satellite imagery of a higher spatial resolution would allow for the use of more training points and thus likely improve the accuracy of depth and volume predictions. Furthermore, samples were taken from a small fraction (in terms of surface area) of the lake (i.e., the entire lake's bathymetry was not mapped, rather data points were collected along discrete and irregular transects). Thus, a mismatch exists regarding the validation data and the natural phenomenon being modeled. Data at such a small spatial scale can never confirm with total accuracy the detailed nature of lake bed bathymetry. Constrained by cost and time, however, collecting data at 17 remote lakes is an important step towards understanding sand sea lake bathymetries on Alaska's Arctic Coastal Plain.

4.3 Implications and future directions

Lakes on the Pleistocene sand sea may be categorized based on depth, littoral substrate, and water clarity, as seen in the study lakes, with such categories providing candidates for different model variations. Future projects may use this work to semiautomatically derive depths across the region, first manually classifying target lakes and then applying different model variations to each class. Furthermore, subregions of each lake (e.g., deep basins, shallow shelves) may be classified in future studies and a different model variant applied to each subregion (e.g., variants that incorporate the red band applied to littoral shelves). Methods of lake subregion differentiation may include either (1) manual delineation based on spectral signatures or (2) automatic delineation with the aid of synthetic aperture radar (SAR) to determine regions of floating versus bedfast ice (which correspond with deep and shallow water, respectively; demonstrated by Engram et al., 2018; Jeffries et al., 1996). Additional future work may include validation of lake water volumes as additional bathymetric datasets become available.

5 Data availability

We present a dataset to greatly increase the number of in situ measurements of lake depth on the little-studied inner Arctic Coastal Plain of Alaska. The dataset contains 13 735 point measurements of bathymetric depth measured across 19 lakes and is freely available through the National Science Foundation Arctic Data Center: (Simpson and Arp, 2018). The second dataset created for this project is comprised of 17 bathymetry rasters, one for each lake for which a sufficient number of depth points were collected. These rasters represent the depth predictions of the best performing model for each individual lake and are also freely available through the National Science Foundation Arctic Data Center: (Simpson, 2019).

6 Conclusions

This work provides a unique in situ depth dataset for lakes on the ACP and leverages these data alongside satellite remote sensing to map lake bathymetries and estimate volume. Lake volumes can be monitored using remote sensing; however, at least one field visit must be made in order to select the best model for a given lake. As of yet, it is still challenging to universally model the bathymetry of lakes across northern Alaska. Instead, field data continue to be necessary to train and calibrate models on a per-lake basis.

Furthermore, lake morphology may evolve in glaciated regions such as northern Alaska in response to hydroclimatic changes and permafrost degradation (Arp et al., 2011; Liljedahl et al., 2011; Nitze et al., 2017). This implies that individual field surveys and static modeling efforts such as this one may not accurately represent ground conditions ad infinitum, particularly in the presence of a rapidly warming Arctic climate (Nitze et al., 2017). In addition to the persistent need for field data to address modeling limitations to spatial scale, field data collection and/or dynamic models will be important components if we are to model bathymetry across a longer temporal scale.

Despite these limitations, the simplicity of the depth modeling and bathymetry mapping approach has important benefits. The models can be tuned very rapidly and require relatively few data points for training in comparison to machine learning models (e.g., Sagawa et al., 2019), a useful feature when training data must be collected in a relatively inaccessible region such as northern Alaska. In addition, the comparative nature of the demonstrated modeling facilitates analysis of individual lake characteristics. Overall, this work provides an effective dataset and methodology for mapping the bathymetry of individual lakes in a unique geologic setting on the ACP.

Author contributions

CES and CDA designed the sampling. CDA secured the funding and instrumentation for field work. CES, CDA, and BMJ conducted the field work. CES processed and analyzed the data and prepared the figures and tables. CES prepared the paper with contributions from BMJ, CDA, LCS, MLC, and YS.

Competing interests

The authors declare that they have no conflict of interest.


Special thanks to Jim Webster for flight support.

Financial support

This research has been supported by the National Sciences Foundation, NSF project no. 1560372, REU: Understanding the Arctic as a System, NSF project no. 1417300, ALISS: Arctic Lake Ice Systems Science, NSF project no. 1806213, and the Bureau of Land Management Arctic District Office. Additional funding for this project was provided by the NASA Arctic-Boreal Vulnerability Experiment (ABoVE) grant NNX17AC60A to LCS and the University of California, Los Angeles, honors program’s Irving and Jean Stone Research Award.

Review statement

This paper was edited by Birgit Heim and reviewed by Ali P. Yunus and Ingmar Nitze.


Alaska North Slope LiDAR Data (Project Code ALCC2012-05): Arctic Landscape Conservation Cooperative, available at:, last access: 30 October 2018. 

Arp, C. D., Jones, B. M., Liljedahl, A. K., Hinkel, K. M., and Welker, J. A.: Depth, ice thickness, and ice-out timing cause divergent hydrologic responses among Arctic lakes, Water Resour. Res., 51, 9379–9401,, 2015. 

Arp, C. D., Jones, B. M., Urban, F. E., and Gross, G.: Hydrogeomorphic processes of thermokarst lakes with grounded-ice and floating-ice regimes on the Arctic coastal plain, Alaska, Hydrol. Proc., 25, 2422–2438,, 2011. 

Carson, C. E.: Radiocarbon Dating of Lacustrine Strands in Arctic Alaska, Arctic, 21, 12–26, 1968. 

Carson, C. E. and Hussey, K. M.: The Oriented Lakes of Arctic Alaska, J. Geol., 70, 417–439, 1962. 

Carter, L. D.: A Pleistocene Sand Sea on the Alaskan Arctic Coastal Plain, Science, 211, 381–383,, 1981. 

Clark, R. K., Fay, T. H., and Walker, C. L.: Bathymetry calculations with Landsat 4 TM imagery under a generalized ratio assumption, Appl. Opt., 26, 4036_1–4038,, 1987. 

Engram, M., Arp, C. D., Jones, B. M., Ajadi, O. A., and Meyer, F. J.: Analyzing floating and bedfast lake ice regimes across Arctic Alaska using 25 years of space-borne SAR imagery, Remote Sens. Environ., 209, 660–676,, 2018. 

Hinkel, K. M., Frohn, R. C., Nelson, F. E., Eisner, W. R., and Beck, R. A.: Morphometric and spatial analysis of thaw lakes and drained thaw lake basins in the western Arctic Coastal Plain, Alaska, Permafrost Periglac., 16, 327–341,, 2005. 

Hinkel, K. M., Sheng, Y., Lenters, J. D., Lyons, E. A., Beck, R. A., Eisner, W. R., and Wang, J.: Thermokarst Lakes on the Arctic Coastal Plain of Alaska: Geomorphic Controls on Bathymetry, Permafrost Periglac., 23, 218–230,, 2012. 

Hodúl, M., Bird, S., Knudby, A., and Chénier, R.: Satellite derived photogrammetric bathymetry, ISPRS J. Photogramm., 142, 268–277,, 2018. 

Hopkins, D. M.: Thaw Lakes and Thaw Sinks in the Imuruk Lake Area, Seward Peninsula, Alaska, J. Geol., 57, 119–131,, 1949. 

Jagalingam, P., Akshaya, B. J., and Hegde, A. V.: Bathymetry Mapping Using Landsat 8 Satellite Imagery, Procedia Eng., 116, 560–566,, 2015. 

Jeffries, M. O., Morris, K., and Liston, G. E.: A Method To Determine Lake Depth and Water Availability on the North Slope of Alaska with Spaceborne Imaging Radar and Numerical Ice Growth Modelling, Arctic, 49, 367–374,, 1996. 

Jeffries, M. O., Zhang, T., Frey, K., and Kozlenko, N.: Estimating late-winter heat flow to the atmosphere from the lake-dominated Alaskan North Slope, J. Glaciol., 45, 315–324,, 1999. 

Jones, B. M., Arp, C. D., Hinkel, K. M., Beck, R. A., Schmutz, J. A., and Winston, B.: Arctic Lake Physical Processes and Regimes with Implications for Winter Water Availability and Management in the National Petroleum Reserve Alaska., Environ. Manage., 43, 1071–1084, 2009. 

Jones, B. M., Arp, C. D., Whitman, M. S., Nigro, D., Nitze, I., Beaver, J., Gädeke, A., Zuck, C., Liljedahl, A., Daanen, R., Torvinen, E., Fritz, S., and Grosse, G.: A lake-centric geospatial database to guide research and inform management decisions in an Arctic watershed in northern Alaska experiencing climate and land-use changes, Ambio, 46, 769–786,, 2017. 

Jorgenson, M. T. and Shur, Y.: Evolution of lakes and basins in northern Alaska and discussion of the thaw lake cycle, J. Geophys. Res., 112, F02S17,, 2007. 

Jorgenson, M. T., Kanevskiy, M., Shur, Y., Grunblatt, J., Ping, C. L., and Michaelson, G.: Permafrost database development, characterization, and mapping for northern Alaska, Report for Arctic Landscape Conservation Cooperative by Alaska Ecoscience and University of Alaska Fairbanks, available at: (last access: July 2017), 45 pp., 2014. 

Kozlenko, N. and Jeffries, M. O.: Bathymetric Mapping of Shallow Water in Thaw Lakes on the North Slope of Alaska with Spaceborne Imaging Radar, Arctic, 53, 306–316,, 2000. 

Liljedahl, A. K., Boike, J., Daanen, R. P., Fedorov, A. N., Frost, G. V., Grosse, G., Hinzman, L. D., Iijma, Y., Jorgenson, J. C., Matveyeva, N., Necsoiu, M., Raynolds, M. K., Romanovsky, V. E., Schulla, J., Tape, K. D., Walker, D. A., Wilson, C. J., Yabuki, H., and Zona, D.: Pan-Arctic ice-wedge degradation in warming permafrost and its influence on tundra hydrology, Nat. Geosci., 9, 312–318,, 2016. 

Livingstone, D. A.: On the orientation of lake basins [Alaska], Am. J. Sci., 252, 547–554,, 1954. 

McFeeters, S. K.: The use of the Normalized Difference Water Index (NDWI) in the delineation of open water features, Int. J. Remote Sens., 17, 1425–1432,, 1996. 

Nechad, B., Ruddick, K. G., and Park, Y.: Calibration and validation of a generic multisensor algorithm for mapping of total suspended matter in turbid waters, Remote Sens. Environ., 114, 854–866,, 2010. 

Nitze, I., Grosse, G., Jones, B. M., Arp, C. D., Ulrich, M, Fedorov, A., and Veremeeva, A.: Landsat-Based Trend Analysis of Lake Dynamics across Northern Permafrost Regions, Remote Sens., 9, 640,, 2017. 

Pacheco, A., Horta, J., Loureiro, C., and Ferreira, Ó. : Retrieval of nearshore bathymetry from Landsat 8 images: A tool for coastal monitoring in shallow waters, Remote Sens. Environ., 159, 102–116,, 2015. 

Pope, A., Scambos, T. A., Moussavi, M., Tedesco, M., Willis, M., Shean, D., and Grigsby, S.: Estimating supraglacial lake depth in West Greenland using Landsat 8 and comparison with other multispectral methods, The Cryosphere, 10, 15–27,, 2016. 

Rex, R. W.: Hydrodynamic Analysis of Circulation and Orientation of Lakes in Northern Alaska, in: Geology of the Arctic, edited by: Raasch, G. O., Toronto: University of Toronto Press, 1021–1043,, 2019. 

Sagawa, T., Yamashita, Y., Okumura, T., and Yamanokuchi, T.: Satellite derived bathymetry using machine learning and multi-temporal satellite images, Remote Sens., 11, 1155,, 2019. 

Sellman, P. V., Brown, J., Lewellen, R. I., McKim, H., and Merry, C.: The Classification and Geomorphic Implications of Thaw Lakes on the Arctic Coastal Plain, Alaska, Research Report, Cold Regions Research and Engineering Lab, Hanover, NH, 1975. 

Sibley, P. K., White, D. M., Cott, P. A., and Lilly, M. R.: Introduction to Water Use From Arctic Lakes: Identification, Impacts, and Decision Support1, J. Am. Water Resour. As., 44, 273–275,, 2008. 

Simpson, C.: Modeled Bathymetry Maps of 17 Lakes on the Arctic Coastal Plain of Alaska, 2017, Arctic Data Center, data set,, 2019. 

Simpson, C. and Arp, C.: Sonar Depth Measurements at Lakes on the Inner Arctic Coastal Plain of Alaska, July 2017, Arctic Data Center, data set,, 2018.  

Smith, L. C. and Pavelsky, T. M.: Remote sensing of volumetric storage changes in lakes, Earth Surf. Proc. Land., 34, 1353–1358,, 2009. 

Stumpf, R. P., Holderied, K., and Sinclair, M.: Determination of water depth with high-resolution satellite imagery over variable bottom types, Limnol. Oceanogr., 48, 547606,, 2003. 

van Everdingen, R.: Multi-language glossary of permafrost and related ground-ice terms, Natl. Snow and Ice Data Cent., Boulder, Colorado, 1998. 

Williams, J. R.: Engineering - geologic maps of northern Alaska, Meade River Quadrangle: U.S. Geological Survey Open-File Report 83-294, 29 p., 1 sheet, scale 1:250,000, 1983. 

Williams, J. R., Carter, L. D., and Yeend, W. E.: Coastal Plain Deposits of NPRA, The United States Geological Survey in Alaska, Accomplishments During 1977, U.S. Department of the Interior, Geological Survey, 1978. 

Yunus, A. P., Dou, J., Song, X., and Avtar, R.: Improved Bathymetric Mapping of Coastal and Lake Environments Using Sentinel-2 and Landsat-8 Images, Sensors, 19, 2788,, 2019. 

Short summary
Sonar depth point measurements collected at 17 lakes on the Arctic Coastal Plain of Alaska are used to train and validate models to map lake bathymetry. These models predict depth from remotely sensed lake color and are able to explain 58.5–97.6 % of depth variability. To calculate water volumes, we integrate this modeled bathymetry with lake surface area. Knowledge of Alaskan lake bathymetries and volumes is crucial to better understanding water storage, energy balance, and ecological habitat.