**Data description paper**
02 Jul 2021

**Data description paper** | 02 Jul 2021

# Coastal complexity of the Antarctic continent

Richard Porter-Smith John McKinlay Alexander D. Fraser and Robert A. Massom

^{1},

^{2},

^{1,3},

^{1,2}

**Richard Porter-Smith et al.**Richard Porter-Smith John McKinlay Alexander D. Fraser and Robert A. Massom

^{1},

^{2},

^{1,3},

^{1,2}

^{1}Antarctic Climate & Ecosystems Corporative Research Centre, University of Tasmania, Hobart, Tasmania, Australia^{2}Australian Antarctic Division, Kingston, Tasmania, Australia^{3}Institute for Marine and Antarctic Studies, University of Tasmania, Hobart, Tasmania, Australia

^{1}Antarctic Climate & Ecosystems Corporative Research Centre, University of Tasmania, Hobart, Tasmania, Australia^{2}Australian Antarctic Division, Kingston, Tasmania, Australia^{3}Institute for Marine and Antarctic Studies, University of Tasmania, Hobart, Tasmania, Australia

**Correspondence**: Richard Porter-Smith (r.smith@utas.edu.au) and Alexander D. Fraser (alexander.fraser@utas.edu.au)

**Correspondence**: Richard Porter-Smith (r.smith@utas.edu.au) and Alexander D. Fraser (alexander.fraser@utas.edu.au)

Received: 13 Aug 2019 – Discussion started: 05 Sep 2019 – Revised: 21 Apr 2021 – Accepted: 22 Apr 2021 – Published: 02 Jul 2021

The Antarctic outer coastal margin (i.e. the coastline itself or the terminus or front of ice shelves, whichever is adjacent to the ocean) is a key interface between the ice sheet and terrestrial environments and the Southern Ocean. Its physical configuration (including both length scale of variation, orientation, and aspect) has direct bearing on several closely associated cryospheric, biological, oceanographical, and ecological processes, yet no study has quantified the coastal complexity or orientation of Antarctica's coastal margin. This first-of-a-kind characterization of Antarctic coastal complexity aims to address this knowledge gap. We quantify and investigate the physical configuration and complexity of Antarctica's circumpolar outer coastal margin using a novel technique based on ∼ 40 000 random points selected along a vector coastline derived from the MODIS Mosaic of Antarctica dataset. At each point, a complexity metric is calculated at length scales from 1 to 256 km, giving a multiscale estimate of the magnitude and direction of undulation or complexity at each point location along the entire coastline. Using a cluster analysis to determine characteristic complexity “signatures” for random nodes, the coastline is found to comprise three basic groups or classes: (i) low complexity at all scales, (ii) most complexity at shorter scales, and (iii) most complexity at longer scales. These classes are somewhat heterogeneously distributed throughout the continent. We also consider bays and peninsulas separately and characterize their multiscale orientation. This unique dataset and its summary analysis have numerous applications for both geophysical and biological studies. All these data are referenced by https://doi.org/10.26179/5d1af0ba45c03 (Porter-Smith et al., 2019) and are available free of charge at http://data.antarctica.gov.au (last access: 7 June 2021).

Although substantial research has been undertaken on quantification of coastal complexity of terrestrial areas (Andrle, 1996a; Bartley et al., 2001; Jiang and Plotnick, 1998; Porter-Smith and McKinlay, 2012), equivalent attention has not yet been paid to the polar continents and ice sheets, i.e. the Antarctic and Greenland. Over its vast total length of > 30 000 km (Fig. 1), the coastline of Antarctica – the focus of this study – comprises only 5 % exposed rock – the remainder consists of ice at the seaward margins of (i) ice sheet grounded (resting) on bedrock (38 %) or (ii) floating extensions of the ice sheet in the form of ice shelves (44 %) and glacier tongues or snouts (13 %) up to several hundred metres thick (Drewry et al., 1982). As such, the Antarctic coastline is more dynamic than its mid-latitude terrestrial counterparts due to ice advance and iceberg calving, and its complexity is therefore more challenging to quantify (Porter-Smith, 2003). Given its direct contact with the high-latitude ocean and atmosphere, the floating outer margin of the ice sheet is also highly sensitive to climate and environmental change.

Characterizing the magnitude and direction of bays and peninsulas over a range of length scales and aspects is a necessary step to evaluating the important though poorly understood effects of coastal complexity on key physical, ecological, and biological processes and phenomena occurring around the circumpolar Antarctic margin. Localized case studies have shown coastline geometry and aspect to be a major determinant of the distribution and properties of sea ice in the Antarctic coastal zone (Fraser et al., 2012; Giles et al., 2008; Massom et al., 2001) and to affect important ice sheet margin processes, e.g. ice shelf–ocean interaction, melt, and iceberg calving, with implications for sea level rise.

Notably, coastal complexity (here termed *C*_{x}) is likely to be an important factor
determining the observed variability in patterns of spatial extent and persistence of landfast sea ice (fast ice) around the Antarctic coastal zone
(Fraser et al., 2012), where fast ice forms both dynamically through interception of pack ice by coastal protrusions (and grounded icebergs) and
thermodynamically in sheltered embayments (Fraser et al., 2012; Giles et al., 2008; Massom et al., 2001). Developing improved knowledge of factors
affecting fast-ice distribution and polynya behaviour – including coastline configuration – is a high priority, as it is a first step towards better
predicting the likely future trajectory of the vulnerable Antarctic coastal environment in a changing climate. Fast ice forms a crucially important
habitat, e.g. for emperor penguins (Massom et al., 2009), is a determinant of ice shelf stability
(Massom et al., 2010, 2018), and has a major impact on logistical operations, e.g. station resupply.

Moreover, coastline configuration and fast-ice distribution are primary determinants of the location and size of Antarctic coastal polynyas (Massom et al., 1998; Fraser et al., 2020). Antarctic polynyas are of regional to global significance as sites of high sea ice production and (in certain cases) associated Antarctic Bottom Water (AABW) formation (Rintoul, 1985) that drives global ocean thermohaline circulation. Polynyas are also areas of enhanced biological productivity and form key habitat for marine mammals and birds (Arrigo and van Dijken, 2003; Tynan et al., 2010).

A further motivation relates to improving model simulation of the complex and highly vulnerable Antarctic coastal environment and the processes
therein. Although model representation of coastlines is inherently smoother than reality due to limitations of model resolution (Hibler, 1979),
coastal complexity is a consideration for producing more accurate dynamic sea ice models, whereby “rougher” coasts (with higher *C*_{x}) tend to
favour production of shear margins or zones in the mobile offshore pack ice. For sea ice models with insufficient spatial resolution to resolve
*C*_{x} explicitly, parameterization of *C*_{x} is required, but baseline knowledge of *C*_{x} is currently lacking. Such a dataset can
provide a “roughness” boundary for sea ice models that currently have insufficient spatial resolution to explicitly resolve the
coastline. Characterization of coastline complexity magnitude, feature type (embayment or peninsula), and feature aspect could also feed into exposure
models for wave–ice shelf interaction (Manson et al., 2005; Massom et al., 2018) and studies quantifying wave exposure relative to coastline
features. This would naturally complement general fetch and exposure models (Hill et al., 2010; Reid and Massom, 2016).

The complexity of terrestrial coastlines is dependent on geological inheritance and surrounding ocean processes. For instance, each of Australia's
geological regions displays discrete complexity signatures, demonstrating a correlation between coastal complexity and geology – an analysis of which
revealed a close relation between *C*_{x}, lithological mix, and ocean processes (see, e.g. Porter-Smith and McKinlay, 2012). These signatures can vary
enormously between regions and over a range of length scales. Geological phenomena cannot be captured by a single value (Ringrose,
1994), and an attempt to do so may cause a process or form to be missed or misinterpreted. To capture
the true complexity of the coastline, it is necessary to adopt a method that accounts for scale variation, since geomorphological features and
associated processes can vary across several scales (Andrle, 1994, 1996a; Goodchild and Mark, 1987;
Lam and Quattrochi, 1992). Therefore, an appreciation of the variability of complexity evident at
different length scales is crucial (Porter-Smith and McKinlay, 2012).

Characterization of the complexity of terrestrial coastlines is a fundamental measure of the lithological mix. Coastlines of a homogeneous lithology tend to be straighter than coastlines of mixed lithology. Wave action promotes a straight coastline if the lithology is homogeneous and a complex one if the lithology is heterogeneous (Porter-Smith and McKinlay, 2012). The Antarctic coastline is a different challenge in that it is almost totally covered by glacier ice and surrounded by ice barriers that influence ocean processes acting on the continent and is likely to be more likely to be more temporally variable in nature than terrestrial coastlines. Additionally, knowledge of the underlying rock type is severely limited due to inability to access much of the geology through the ice (Stål et al., 2019).

However, even in this homogeneous environment, one might expect a relatively high complexity due to the presence of glacial valleys, an example
would include the Western Peninsula's fjord-like coast, where there are glacial erosion processes in motion. Glacial erosive processes have a distinct
signature (Anderson et al., 2006) that would result in a higher coastal complexity. Although the formative processes may differ between Antarctic and
terrestrial scenarios, the methodology does not assume prescriptive or formative processes but instead classifies purely based on differences in complexity over a
range of length scales. The analysis of *C*_{x} using this multiscale approach also allows the identification and analysis of morphologically
similar coastal environments and forms the basis for further research into their relationship to and synergy with natural processes.

Despite its importance, no study has quantified the coastline complexity of the Antarctic continent. Here, we address this critical gap by carrying
out a first quantification of the geometric configuration and complexity of the Antarctic coastline, using a novel technique to examine the spatial
distribution of both the magnitude and direction (aspect) of *C*_{x} over varying length scales. This new dataset (Porter-Smith et al., 2019) not
only highlights spatial differences but also serves as an important yardstick against which to gauge future change and variability in coastal complexity
and character around Antarctica. In this study, we derive methods for determining scale-dependent metrics describing coastal complexity of the
Antarctic continent, including the facility to classify points as belonging to bays or peninsulas at different scales. Using this metric at 40 000
random point locations around the coastal margin, we use clustering techniques to determine characteristic complexity “signatures” around the
continent.

## 2.1 Quantifying the complexity of the Antarctic coastline

To calculate *C*_{x} for the entire Antarctic continental coastline, 40 000 points were randomly chosen along the MODIS Mosaic of Antarctica
2008–2009 (MOA2009) coastline dataset (Haran et al., 2014), acquired from the US National Snow and Ice Data Center (NSIDC). The coastal margin is used
in the calculation of complexity since the outer margin is more relevant for the processes listed in the introduction here (e.g. ecological habitats,
fast-ice formation, polynya location, ice shelf–ocean interaction). Figure 2 illustrates the algorithm for determining *C*_{x}. At each random
target point *x* on the merged MOA dataset and for each length scale (of 1, 2, 4, 8, 16, 32, 64, 128, and 256 km), the Euclidean straight-line
distance was measured either side of the chosen point to find the corresponding points, *a* and *b*, that intersect the coastline. The two
vectors $\stackrel{\mathrm{\u203e}}{\mathit{x}\mathit{a}}$ and
$\stackrel{\mathrm{\u203e}}{\mathit{x}\mathit{b}}$ are vector-summed to give the quantity $\stackrel{\mathrm{\u203e}}{\mathit{x}\mathit{c}}$, indicating the magnitude of complexity and direction (both relative to
north and the local coastline) for the aspect. The maximum distance between successive random points was rarely greater than 1 km, thereby
giving a near-uniform and seamless representation of complexity around the continent (Fig. 2).

This approach varies from previous techniques employed to derive *C*_{x}, such as the angled measurement technique (AMT) where the length scale is
measured forward and backwards of a chosen point on the mapped coastline. In the AMT, the measure of complexity is the supplementary angle (Andrle,
1994, 1996b; Porter-Smith and McKinlay, 2012). The new approach presented here offers a measure not only of complexity (as magnitude) but also of
direction. Additionally, the new technique allowed qualification of the chosen section of coastline as either a bay or peninsula for a given length
scale (i.e. any angle less than 180^{∘} would be classed as a bay, and any angle over 180^{∘} would be classed as a peninsula). An
advantage of our technique is that it can be used to quantify coastal complexity at various scales to reflect the multiscale nature of features along
the coastline. Additionally, characterizing the orientation (i.e. aspect) of features is useful in that it can be compared to the directions of
other potential co-variates, allowing correlations and interactions to be examined.

Given that complexity magnitude varies as length scale changes, the resultant magnitudes of $\stackrel{\mathrm{\u203e}}{\mathit{x}\mathit{c}}$ were normalized to a range of 0 to 100 to give comparability between length scales. The spectrum of length scales examined was chosen to provide complexity measurements at scales relevant to known oceanic, cryospheric, and geomorphological processes and phenomena at kilometre-to-mesoscale levels, with individual lengths chosen as a series of base 2 powers to minimize the potential problem of spatial autocorrelation (Goodchild, 1986).

Data processing, spatial analysis, and mapping was carried out using the GIS and spatial analysis platforms Arc/Info (ESRI, 1996) and QGIS (Quantum GIS
Development Team, 2014). Statistical analysis was carried out using the R language for statistical computing (Ihaka and Gentleman, 1996; R Core Team,
2014) and the R package *cluster* (Maechler et al., 2018).

## 2.2 Clustering

Unsupervised classification (clustering) techniques were used to determine how many distinct complexity classes exist around the Antarctic coast.
Cluster analysis has a rich history in statistics and machine learning (Hastie et al., 2001; Kaufman and Rousseeuw, 1990). In both fields, it is
primarily used as an exploratory technique to identify *k* groups from *n* observations, such that observations within groups are more similar to one
another in their *p* multivariate responses than they are compared with those in other groups.

Given the large size of the dataset and the high computational burden of many clustering algorithms, two common and tractable methodologies were
selected: *k* means and partitioning around medoids (*k* medoids) (Kaufman and Rousseeuw, 1990; Maechler et al., 2018). These centroid-based
partitioning methods were applied to the *n*≈ 40 000 complexity magnitude values for *p* = 9 length scales (i.e. 1, 2, 4, 8, 16, 32, 64,
128, and 256 km). For both *k* means and *k* medoids, length scales were first standardized (0-4116100), and Euclidean distances were used as
the metric describing the similarity between observations. The primary difference between these clustering techniques is that while *k* means attempt
to group objects into *k* clusters based on minimizing the distance of observations to group means (i.e. minimizing the within-cluster
sums of squares), *k* medoids operate by minimizing distances to group medoids, where the latter are data points that are analogous to multivariate
medians. Thus, clustering by *k* medoids can be considered a robust alternative to *k* means that will be less influenced by outliers and noise in the
data. Given the size of the merged MOA coastline dataset, we employ the Clustering LARge Applications (CLARA) implementation of partitioning around
medoids, a method that subsets data in order to achieve an optimal solution that is linear (rather than quadratic) in *n*. The algorithm of Hartigan
and Wong (1979) was used for *k*-means clustering, and optimization was conducted over several random starts to ensure global optimization was
achieved.

For any given application, clustering should be carried out for the spatial extent and at spatial scales relevant to the phenomena under investigation. As the present study seeks a synoptic, Antarctic-wide summary of complexity, we first consider all data (Antarctic-wide, all length scales) in a single analysis. In this case, all length scales are afforded equal weight in the analysis. However, it is likely that many local- to regional-scale phenomena impacting oceanic and cryosphere processes may be relatively unaffected by smaller-scale complexity. For this reason, cluster analyses were repeated on complexity data restricted to length scales ≥ 8 km and results compared with those derived from analyses of all length scales considered simultaneously.

## 2.3 Gap statistic for determining number of clusters

A common problem when conducting unsupervised classification is that often the true number of groups, say *k*^{∗}, is unknown and must be estimated
from the data. Estimating *k*^{∗} is a difficult and somewhat ill-defined problem since there is no universal definition of what should constitute
a group, and this has led to a wide variety of approaches for estimating *k*^{∗} under different clustering scenarios (Charrad et al., 2014;
Milligan and Cooper, 1985). The gap statistic, which can be used in conjunction with many clustering techniques, is one of the more useful approaches
to objectively determining *k*^{∗} (Tibshirani et al., 2001). While it is known to perform imperfectly in a limited set of circumstances (Mohajer
et al., 2011), Tibshirani et al. (2001) use simulation experiments and analyses of real data to demonstrate that the technique outperforms a wide
range of alternate established methods. The technique determines the optimal number of groups by examining the within-cluster dispersion
*W*_{k} as a function of the number of clusters *k*. Obtaining separate clustering solutions for $k\in \mathit{\{}\mathrm{1},\mathrm{2},\mathrm{\dots},{k}_{max}\mathit{\}}$, along
with corresponding *W*_{k} values $\mathit{\{}{W}_{\mathrm{1}},{W}_{\mathrm{2}},\mathrm{\dots},{W}_{{\mathrm{k}}_{max}}\mathit{\}}$, shows that by itself *W*_{k} is uninformative
since it always decreases with increasing *k*, even for independent data with no structure. The gap statistic overcomes this problem by defining

where *E*_{n} denotes the expectation under a sample size of *n* from a reference (*H*_{0}) distribution. The latter is determined by resampling from
a uniform distribution on the *p* hypercube determined by the ranges of the data after first centring and rotating them to align with their principal
axes. The optimal cluster number *k*^{∗} is estimated as the value maximizing Gap_{n}(*k*) after considering sampling variability associated
with determining the reference distribution. In practice, this is achieved by choosing *k*^{∗} to provide the maximum gap statistic that is within
1 standard error (Breiman et al., 1984) of the first local maximum over the range of *k* (Tibshirani et al., 2001). For the present study, *E*_{n}*{*log (*W*_{k})*}* was estimated by an average of 100 separate Monte Carlo samples of the reference distribution. For both *k* means and
*k* medoids, Gap_{n}(*k*) was assessed over the range *k*=1 to 20. The gap statistic can be calculated for a range of clustering algorithms,
which allows the similarity in clustering solutions to be compared between methods.

## 3.1 Complexity and aspect around the continent

The total length of the outer merged MOA coastline is 39 593 km. The length of ice shelf and grounded ice coastline around the continent are
21 269 and 18 324 km, respectively and roughly proportional in western and eastern Antarctica. There is a strong positive skew in the
distribution of *C*_{x} at all length scales, and this skew is especially pronounced at shorter length scales, i.e. complexity is not
normally distributed, indicating that the Antarctic coastal margin has a tendency to be straighter rather than highly complex (Fig. 3).

A notable difference between the western and eastern sectors of Antarctica (−180–0 and 0–180^{∘}) is the orientation of both bays and
peninsulas. In East Antarctica, these features generally face directly offshore across all length scales (Fig. 4), with the higher *C*_{x}
magnitude generally facing directly offshore, i.e. a normal distribution of magnitudes and their orientations. In West Antarctica, on the other hand,
both bays and peninsulas have a general skew toward the west-of-offshore direction. This becomes particularly dominant at length scales
of > 16 km. This bias in the bay and peninsula feature orientation may have implications for key physical processes (e.g. formation and
persistence of fast ice) and biological processes highlighted in the Introduction. These variances could be used to examine and differentiate between regional and local areas and with other
co-variates to analyse specific phenomena.

## 3.2 Determining the number of complexity groups using clustering

Analysis of the gap statistics shows that omission of smaller length scales (≤ 8 km) produces a pronounced local maximum at *k*=3. This
suggests that the optimal number of complexity groups is three, as shown by the “elbow” in the gap statistic plots (see Fig. 5).

A projection of random point scores onto the first two principal component axes, accounting for 41 % of the total variation (Fig. 6), shows the three groups in relation to projections of the complexity length classes. As might be expected, arrows representing the complexity length classes appear in approximate order, in a fan shape, indicating that adjacent classes are most closely correlated with one another. Variances look approximately the same (i.e. arrows are approximately the same length) across length classes. In the two-dimensional approximation, the three groups show considerable overlap.

Figure 7 shows violin plots of *C*_{x} magnitude (for ≥ 8 km length scales), by the three-group structure determined by *k*-medoid
clustering. The red line joins adjacent medians. This plot reveals the multiscale complexity of each group: group 1 represents coastline with little
complexity (i.e. relatively smooth) at all length scales; group 2 represents coastline with more small-scale (≤ 32 km) complexity; and
group 3 represents coastline with more large-scale (≥ 64 km) complexity.

The *C*_{x} dataset (Porter-Smith et al., 2019) presented here allows spatially resolved characterization of normalized complexity as a function
of longitude for each length scale. This is shown in Fig. 8 as a polar plot. For simplicity, we show only the normalized complexity for 16 km
(representing the “class 2” short length-scale cluster) and 128 km (representing the “class 3” long length-scale cluster). For both bays
and peninsulas, the 16 km *C*_{x} is both larger and more homogenous as a function as longitude (bays: mean
*C*_{x} = 24.8 ± 16.7; peninsulas: mean *C*_{x} = 23.9 ± 15.0), whereas the 128 km *C*_{x} is more
heterogeneous or episodic in nature (bays: mean *C*_{x} = 13.0 ± 17.3; peninsulas: mean *C*_{x} = 15.1 ± 17.2).

Figure 9 shows the mix of coastline groups contained within a 64 km sliding window (chosen to allow as many data points as possible while still representing reasonably short length-scale variability) for the entire coastal margin. Although groups or typologies are observed to occur heterogeneously around the entire coastline, certain classes tend to dominate at specific scales and locations around the continent. To derive the dominant group within the heterogeneity, each of the three groups were totalled within the sliding window and proportionately normalized to 255. The dominance and heterogeneity could then be expressed and represented as a value within the RGB colour model.

As expected, the coastal margins of the Ronne, Ross, and Larsen C ice shelves are predominantly group 1. This reflects the very smooth nature of these ice shelf fronts, which tend to calve large, tabular icebergs. There are also several other ice shelves exhibiting group 1 dominance but which do not calve large tabular icebergs, including the Larsen D ice shelf on the eastern side of the peninsula, the Venable and Abbots ice shelves on the western side of the peninsula, and the ice shelves of the Sabrina Coast of East Antarctica. Several East Antarctic regions of grounded ice margin also exhibit group 1 dominance, including the Prince Olav, Mawson, Ingrid Christensen, Wilhelm II, Knox, Wilkes Land, and Adélie Land coasts.

Regions dominated by group 2 (indicating high *C*_{x} at small length scales) include the grounded ice coastal margin on the northern part of the
western Antarctic Peninsula (between Cape Roquemaurel at 63.5^{∘} S, 58.9^{∘} W and Cape Jeremy at 69.4^{∘} S, 68.8^{∘} W), a
mountainous stretch of Victoria Land on the coast of the western Ross Sea that is punctuated by glacier tongues of length 15 to 25 km
(between Cape Washington at 74.7^{∘} S, 165.5^{∘} E and Coulman Island at 73.3^{∘} S, 169.7^{∘} E), and the Sulzberger Ice
Shelf region (at 77^{∘} S, 150^{∘} W). The latter is characterized by a highly crevassed and rough (on a 25 km scale) ice shelf
margin resulting from severe dynamical constraints on outflowing glacial ice.

Regions exhibiting group 3 dominance, on the other hand, occur mainly at major coastal inflection points. Notable locations are where the Transantarctic Mountains meet the McMurdo Ice Shelf, at the tip of the Antarctic Peninsula, and along the coastline of Alexander Island and the Wilkins Ice Shelf, where coastal undulations occur on the large spatial scale captured by group 3 (64 to 256 km).

Enlargements of Fig. 9 around Enderby Land and Victoria Land are presented in Figs. 10 and 11, respectively. These enlargements highlight regions of
complex heterogeneity in *C*_{x}.

Underlying software code and metadata are freely available and can be accessed at https://doi.org/10.5281/zenodo.5044565 (last access: 30 June 2021, Porter-Smith, 2021).

These data are available free of charge from the Australian Antarctic Data Centre (http://data.antarctica.gov.au, last access: 7 June 2021) and are referenced by https://doi.org/10.26179/5d1af0ba45c03 (Porter-Smith et al., 2019).

This first-of-a-kind study of Antarctic coastal complexity has quantified and classified discreet morphology signals using a novel technique to produce a new dataset describing complexity for the entire circum-Antarctic coastal margin over a range of scales. To date, there has been no quantification of the physical configuration of this important interface, despite its central relevance to other research areas. Here, we show that the Antarctic coastal margin is generally straighter than the coastlines of typical terrestrial continents; this is likely due to the generally uniform mechanical strength of the ice compared to the mixed lithology and resultant higher complexity promoted by erosive processes of terrestrial landforms. Another key finding is that, based on the multiscale complexity characterization, the Antarctic coastal margin can be classified into three main groups: these are (i) low complexity, (ii) complex at short length scales, and complex at long length scales. While the Antarctic coastline is largely found to be spatially heterogeneous in its physical complexity, there are dominant groups along certain individual stretches. This study has also, for the first time, quantified and characterized specific Antarctic coastal features such as bays and peninsulas and their orientation at various length scales. Another key finding is that the aspect (orientation) of bay and peninsula features is different for western and eastern Antarctica.

Given the temporally variable nature of ice and the question of how frequently the complexity of the Antarctic coastline should be recalculated, most major change in margins happens with ice shelf advance or retreat (i.e. calving and ice front advance). Of these processes, retreat has by far a shorter timescale. Thus, one could argue that a re-assessment should happen in conjunction with major calving – but such events tend to be regionally limited (e.g. the calving of the Amery Ice Shelf in 2020). Ice shelf collapse (e.g. Wilkins in 2008/09) is a little more dramatic but is still geographically limited. Thereby, such re-evaluations are not needed frequently unless there is major change. Runaway grounding line retreat leading to major coastal margins changes might be sufficient grounds for re-evaluation, but this has not happened yet. Significance of changes could be assessed using standard change detection metrics (e.g. estimating the distribution of the current coastline features and see if the new coastline complexity falls outside of this distribution), thus justifying another evaluation.

Our complexity definition methodology provides a quantitative, repeatable approach to analysing coastline features and could be readily applied to other coastlines both in terrestrial and polar regions. This unique dataset and its analysis presented here also have numerous applications for both geophysical and biological studies and will contribute to Antarctic research requiring quantitative information on (and related to) coastal complexity and configuration. For instance, and in the crucially important field of modelling, a measure of coastal complexity provides a “roughness” boundary, thereby providing a parameterization that is currently missing, e.g. towards more accurate dynamic sea ice models. Similarly, and for general ocean fetch (wave) models, the characterization of coastline complexity magnitude, feature type (embayment or promontory), and their aspect could also feed into exposure models for the study of wave–ice shelf interaction, wave exposure, and high- and low-energy habitat types.

RPS designed the methodology, compilation of data, and analysis with contributions from all co-authors. JM provided statistical guidance. RPS prepared the manuscript with contributions from all co-authors.

The authors declare that they have no conflict of interest.

We would like to thank two reviewers for their highly constructive and insightful comments on an earlier version of this paper. We would like to acknowledge the National Snow and Ice Data Center (NSIDC) for their provision of MODIS Mosaic of Antarctica coastline dataset. This work was supported by the Australian Government's Cooperative Research Centre programme through the Antarctic Climate & Ecosystems Cooperative Research Centre and the Australian Research Council's Special Research Initiative for Antarctic Gateway Partnership.

This research has been supported by the Australian Government (Antarctic Climate & Ecosystems Cooperative Research Centre) and the Australian Research Council (grant no. SR140300001).

This paper was edited by Kirsten Elger and reviewed by Ted Scambos and one anonymous referee.

Anderson, R. S., Molnar, P., and Kessler, M. A.: Features of glacial valley profiles simply explained, J. Geophys. Res.-Earth Surf., 111, F01004, https://doi.org/10.1029/2005jF000344, 2006.

Andrle, R.: The Angle Measure Technique – a New Method for Characterizing the Complexity of Geomorphic Lines, Math. Geol., 26, 83–97, https://doi.org/10.1007/Bf02065877, 1994.

Andrle, R.: Complexity and scale in geomorphology: Statistical self-similarity vs characteristic scales, Math. Geol., 28, 275–293, https://doi.org/10.1007/Bf02083201, 1996a.

Andrle, R.: The west coast of Britain: Statistical self-similarity vs characteristic scales in the landscape, Earth Surf. Proc. Land., 21, 955–962, https://doi.org/10.1002/(SICI)1096-9837(199610)21:10<955::AID-ESP639>3.0.CO;2-Y, 1996b.

Arrigo, K. R. and van Dijken, G. L.: Phytoplankton dynamics within 37 Antarctic coastal polynya systems, J. Geophys. Res., 108, 3271, https://doi.org/10.1029/2002jc001739, 2003.

Bartley, J. D., Buddemeier, R. W., and Bennett, D. A.: Coastline complexity: a parameter for functional classification of coastal environments, J. Sea Res., 46, 87–97, https://doi.org/10.1016/S1385-1101(01)00073-9, 2001.

Breiman, L., Friedman, J., Olshen, R., and Stone, C.: Classification and decision trees, Wadsworth, Belmont, 378, 1984.

Charrad, M., Ghazzali, N., Boiteau, V., and Niknafs, A.: Package `NbClust', J. Stat. Softw., 61, 1–36, https://doi.org/10.18637/jss.v061.i06, 2014.

Drewry, D. J., Jordan, S. R., and Jankowski, E.: Measured properties of the Antarctic ice sheet: surface configuration, ice thickness, volume and bedrock characteristics, Ann. Glaciol., 3, 83–91, https://doi.org/10.3189/S0260305500002573, 1982.

ESRI: ARC/INFO Unix Version 7, Esri Inc, Redlands, California, available at: https://support.esri.com/en/products/legacy-products/legacy-products/arcinfo-workstation/10 (last access: 8 June 2021), 1996.

Fraser, A. D., Massom, R. A., Michael, K. J., Galton-Fenzi, B. K., and Lieser, J. L.: East Antarctic landfast sea ice distribution and variability, 2000–08, J. Climate, 25, 1137–1156, https://doi.org/10.1175/JCLI-D-10-05032.1, 2012.

Fraser, A. D., Massom, R. A., Ohshima, K. I., Willmes, S., Kappes, P. J., Cartwright, J., and Porter-Smith, R.: High-resolution mapping of circum-Antarctic landfast sea ice distribution, 2000–2018, Earth Syst. Sci. Data, 12, 2987–2999, https://doi.org/10.5194/essd-12-2987-2020, 2020.

Giles, A. B., Massom, R. A., and Lytle, V. I.: Fast‐ice distribution in East Antarctica during 1997 and 1999 determined using RADARSAT data, J. Geophys. Res.-Oceans, 113, C02S14, https://doi.org/10.1029/2007JC004139, 2008.

Goodchild, M. F.: Spatial autocorrelation, Geo Books, Norwich, 1986.

Goodchild, M. F. and Mark, D. M.: The fractal nature of geographic phenomena, Ann. Assoc. Am. Geogr., 77, https://doi.org/10.1111/j.1467-8306.1987.tb00158.x, 1987.

Haran, T., Bohlander, J., Scambos, T., Painter, T., and Fahnestock, M.: MODIS Mosaic of Antarctica 2008-2009 (MOA2009) Image Map, Version 1, NSIDC: National Snow and Ice Data Center, Boulder, Colorado, USA, available at: https://nsidc.org/data/NSIDC-0593/versions/1 (last access: 8 June 2021), 2014.

Hartigan, J. A. and Wong, M. A.: Algorithm AS 136: A k-means clustering algorithm, J. R. Stat. Soc. C-Appl., 28, 100–108, https://doi.org/10.2307/2346830, 1979.

Hastie, T., Tibshirani, R., and Friedman, J.: The elements of statistical learning, Springer Series in Statistics, Springer New York Inc, New York, NY, USA, https://web.stanford.edu/~hastie/Papers/ESLII.pdf (last access: 8 June 2021), 2001.

Hibler, W. D.: A dynamic thermodynamic sea ice model, J. Phys. Oceanogr., 9, 815–846, https://doi.org/10.1175/1520-0485(1979)009<0815:ADTSIM>2.0.CO;2, 1979.

Hill, N. A., Pepper, A. R., Puotinen, M. L., Hughes, M. G., Edgar, G. J., Barrett, N. S., Stuart-Smith, R. D., and Leaper, R.: Quantifying wave exposure in shallow temperate reef systems: applicability of fetch models for predicting algal biodiversity, Mar. Ecol. Prog. Ser., 417, 83–95, https://doi.org/10.3354/meps08815, 2010.

Ihaka, R. and Gentleman, R.: R: A Language for Data Analysis and Graphics, J. Comput. Graph. Stat., 5, 299–314, https://doi.org/10.1080/10618600.1996.10474713, 1996.

Jiang, J. W. and Plotnick, R. E.: Fractal analysis of the complexity of United States coastlines, Math. Geol., 30, 535–546, https://doi.org/10.1023/A:1021790111404, 1998.

Kaufman, L. and Rousseeuw, P. J.: Finding groups in data: an introduction to cluster analysis, Wiley Online Library, https://doi.org/10.1002/9780470316801, 1990.

Lam, N. S. and Quattrochi, A. A.: On the issue of scale, resolution, and fractal analysis in the mapping sciences, The Professional Geographer, 44, 88–98, https://doi.org/10.1111/j.0033-0124.1992.00088.x, 1992.

Maechler, M., Rousseeuw, P., Struyf, A., Hubert, M., and Hornik, K.: Cluster: cluster analysis basics and extensions, in: R package version 2.0.7-1, available at: https://CRAN.R-project.org/package=cluster (last access: 8 June 2021), 2018.

Manson, G. K., Solomon, S. M., Forbes, D. L., Atkinson, D. E., and Craymer, M.: Spatial variability of factors influencing coastal change in the western Canadian Arctic, Geo-Mar. Lett., 25, 138–145, https://doi.org/10.1007/s00367-004-0195-9, 2005.

Massom, R. A., Harris, P. T., Michael, K. J., and Potter, M. J.: The distribution and formative processes of latent-heat polynyas in East Antarctica, Ann. Glaciol., 27, 420–426, https://doi.org/10.3189/1998AoG27-1-420-426, 1998.

Massom, R. A., Hill, K. L., Lytle, V. I., Worby, A. P., Paget, M. J., and Allison, I.: Effects of regional fast-ice and iceberg distributions on the behaviour of the Mertz Glacier polynya, East Antarctica, Ann. Glaciol., 33, 391–398, https://doi.org/10.3189/172756401781818518, 2001.

Massom, R. A., Hill, K., Barbraud, C., Adams, N., Ancel, A., Emmerson, L., and Pook, M. J.: Fast ice distribution in Adélie Land, East Antarctica: interannual variability and implications for emperor penguins Aptenodytes forsteri, Mar. Ecol. Prog. Ser., 374, 243–257, https://doi.org/10.3354/meps07734, 2009.

Massom, R. A., Giles, A. B., Fricker, H. A., Warner, R. C., Legrésy, B., Hyland, G., Young, N., and Fraser, A. D.: Examining the interaction between multi‐year landfast sea ice and the Mertz Glacier Tongue, East Antarctica: Another factor in ice sheet stability?, J. Geophys. Res.-Oceans, 115, https://doi.org/10.1029/2009JC006083, 2010.

Massom, R. A., Scambos, T. A., Bennetts, L. G., Reid, P., Squire, V. A., and Stammerjohn, S. E.: Antarctic ice shelf disintegration triggered by sea ice loss and ocean swell, Nature, 558, 383–389, https://doi.org/10.1038/s41586-018-0212-1, 2018.

Milligan, G. W. and Cooper, M. C.: An examination of procedures for determining the number of clusters in a data set, Psychometrika, 50, 159–179, https://doi.org/10.1007/Bf02294245, 1985.

Mohajer, M., Englmeier, K.-H., and Schmid, V. J.: A comparison of Gap statistic definitions with and without logarithm function, arXiv [preprint] arXiv:1103.4767, 2011.

Porter-Smith, R.: Bathymetry of the George Vth Land shelf and slope, Deep-Sea Res. Pt. II, 50, 1337–1341, https://doi.org/10.1016/S0967-0645(03)00069-9, 2003.

Porter-Smith, R.: Coastal Complexity (*C*_{x}) calculation scripts (Version V1) [code], Zenodo, https://doi.org/10.5281/zenodo.5044565, 2021.

Porter-Smith, R. and McKinlay, J.: Mesoscale coastal complexity and its relationship to structure and forcing from marine processes, Mar. Geol., 323–325, 1–13, https://doi.org/10.1016/j.margeo.2012.07.011, 2012.

Porter-Smith, R., McKinlay, J., Fraser, A. D., and Massom, R. A.: Coastal complexity of the Antarctic continent, Australian Antarctic Data Centre, Australia, https://doi.org/10.26179/5d1af0ba45c03, 2019.

Quantum GIS Development Team: QGIS Geographic Information System, Open Source Geospatial Foundation Project, available at: http://qgis.osgeo.org (last access: 8 June 2021), 2014.

R Core Team: R: A language and environment for statistical computing, R Foundation for Statistical Computing, Vienna, Austria, available at: http://www.R-project.org (last access: 8 June 2021), 2014.

Reid, P. and Massom, R.: Coastal exposure index of sea ice in Antarctica, Australian Antarctic Data Centre, https://doi.org/10.4225/15/57A13939A8312, 2016 (updated 2019).

Ringrose, P. S.: Structural and Lithological Controls on Coastline Profiles in Fife, Eastern Britain, Terr. Nova, 6, 251–254, https://doi.org/10.1111/j.1365-3121.1994.tb00492.x, 1994

Rintoul, S. R.: On the origin and influence of Adélie Land Bottom Water, Ocean, Ice, and Atmosphere: Interactions at the Antarctic Continental Margin, 75, 151–171, https://doi.org/10.1029/AR075p0151, 1985.

Scambos, T. A., Haran, T. M., Fahnestock, M. A., Painter, T. H., and Bohlander, J.: MODIS-based Mosaic of Antarctica (MOA) data sets: Continent-wide surface morphology and snow grain size, Remote Sens. Environ., 111, 242–257, https://doi.org/10.1016/j.rse.2006.12.020, 2007.

Stål, T., Reading, A. M., Halpin, J. A., and Whittaker, J. M.: A multivariate approach for mapping lithospheric domain boundaries in East Antarctica, Geophys. Res. Lett., 46, 10404–10416, https://doi.org/10.1029/2019gl083453, 2019.

Tibshirani, R., Walther, G., and Hastie, T.: Estimating the number of clusters in a data set via the gap statistic, J. R. Stat. Soc. B, 63, 411–423, https://doi.org/10.1111/1467-9868.00293, 2001.

Tynan, C. T., Ainley, D. G., and Stirling, I.: Sea ice: a critical habitat for polar marine mammals and birds, in: Sea Ice, 2nd edn., edited by: Thomas, D. N. and Dieckmann, G. S., Blackwell Publishing Ltd, https://doi.org/10.1002/9781444317145.ch11, 2010.

signaturesaround the Antarctic outer coastal margin, giving a multiscale estimate of the magnitude and direction of undulation or complexity at each point location along the entire coastline. It has numerous applications for both geophysical and biological studies and will contribute to Antarctic research requiring quantitative information about this important interface.

signaturesaround the Antarctic outer...