Comment on essd-2021-141

Response: We would like to thank the two anonymous referees for their kind and thorough comments on and suggestions for this manuscript. We appreciate that their constructive feedback has helped to improve the quality and clarity of the paper and its representation of the methane flux datasets we present. Our specific responses to each referee comment are presented below. A supplemental document with copies of updated figures is also attached.

where the delineations were.
Response: Here we wanted to highlight that acidic classes are on the very left, slightly acidic is on the transition from first to second box, neutral is the transition from second to third box and alkaline is on the very right-hand side. For clarity, we updated and simplified the figure to have acidic under the left most box, slightly acidic/neutral under the middle box, and Alkaline under the right most box.

RC1:
Line 231-232: This is a little confusing because first you say they have the driest soils, and then say their soils are moist to wet.
Response:We changed the sentence to the following to help clarify that permafrost bogs are the driest of the wetland classes, but still have relatively moist conditions compared to dry tundra and upland forests etc.
Response:"Excess ground-ice and ice expansion often elevate Permafrost Bogs up to a few meters above their surroundings, and as such, they are ombrotrophic and relatively well drained (Fig. 3). Permafrost Bogs have moist to wet soil conditions, often with a water table that follows the base of the seasonally developing a thawed soil layer." RC1: Line 251: Are rocklands a relatively small area compared with dry tundra? If so, it is logical to lump them together and assume their fluxes are similar. However, if rocklands are relatively large compared with dry tundra, you should note that lumping these two together could introduce significant uncertainty since rockland fluxes are unknown.

Response:
We do not include Rockland area in our Dry Tundra flux estimates/simply scaling exercise. We also updated the text to clarify that there are no explicitly Rockland flux measurements in the dataset. There are five sites that are described as polar desert and also as high tundra, thus we included those sites with the Dry Tundra sites.
Response: "No sites included in the database were described as Rocklands. There are five sites described as high polar desert or desert tundra, which were included as Dry Tundra sites." RC1: Section 265: You do not mention what maps you use to determine whether lakes exist in glacial, yedoma, or peatland environments. Is this information reported in the literature for every site with methane measurements? If not, how do you determine this information for each lake?
Response:Lake type determinations are based on information reported by the authors or in papers cited by the authors related to the study. In a handful of cases when the authors do not describe the lakes/lake sediments, we used the lake locations and yedoma/histel/histisol maps to determine the lake type. In a handful of cases, we could not determine the lake class type and left this field empty as an unknown. We have added this information to the text accordingly.

RC1:
Line 386: Please add a period to end of sentence.
Response:We added the constant as it allows for uptake fluxes to be included in the models. Model residuals did not change much (<5%) when comparing models with and without this constant. This is also a standard practice for methane models, as seen in Response:We added flux after CH4 on this line.

RC1:
Line 455: It's a stretch to say these are evenly distributed. Sites are heavily concentrated in Alaska and the Quebec/Ontario border area, although you do have sites sprinkled in other places. I suggest changing the wording here so you do not use the word "evenly".

Response:
We have changed the sentence to the following: "Aquatic sites were distributed throughout the Boreal-Arctic region with a greater density of sites in Alaska and eastern Canada (Fig. 1b)." RC1: Line 488: I suggest getting rid of commas to make it easier to read: "No other continuous variables were correlated with CH4 uptake; however, sites where shrubs were present had significantly higher…" Response:We made the suggested changes for this sentence. Figure 10: Please increase the contrast between the circle colors by making the ebullition circles darker (as in the color scheme for Figure 11).

Response:
We changed the color of the ebullition circles to a darker shade of blue for better contrast. Table 4: I would write out "Lakes -Diffusion" and "Lakes -Ebullition" in the top two rows, for clarity (assuming that is what D and E stand for?).

RC1:
Response:We replaced D and E with Diffusion and Ebullition, respectively.

RC1:
Line 634-635: Sometimes you use "water body" and sometimes "waterbody". Please choose one and make it consistent throughout the paper.

Response:
We have changed all of the terms to "waterbody" or "waterbodies" for consistency.

RC1:
Line 642-644: This sentence is confusing to me. Shouldn't having a broad range of depths make you MORE likely to see a depth effect than if you had a narrow range of depths? What makes your synthesis different from others such that you do not see these relationships? I also do not understand the second half of this sentence "…because it is likely that the temperature and depth influence is clearer over time and space, respectively, in each specific system." Do you mean that you do not have measurements across a range of temperatures (and spatial locations) within each given system? I assume this is the case, but then how did previous synthesis studies see these relationships?

Response:
We changed the text of this section to the following to help clarify: Response:"The best model for ebullition contained waterbody surface area as a predictor and explained 21% of the variation in the fluxes. Previous synthesis efforts have linked ebullition fluxes to both temperature (Aben et al. 2017) and waterbody depth (Wik et al. 2016a). There are a few potential explanations as to why we did not find similar relationships between ebullition and temperature or waterbody depth. First, Aben et al. include global data that encompass sites across broad temperature ranges from the north to the tropics (2017). It is possible that the range of temperatures represented by our dataset is not wide enough to capture this relationship. It is also possible that the summary data collected, including average temperature and average flux over the ice-free season, are too coarse to show a relationship. It is likely that temperature and also depth influence is clearer over time and space in each respective waterbody and that a higher resolution of data would show these relationships. Regarding waterbody depth, it is also possible that in the absence of detailed surveys, estimated mean and max depths may be less reliable. It is also possible the effects of depth are confounded with surface area as the two metrics are highly correlated (SI Fig. 5). While this dataset represents one of the largest collections of ebullitive emissions from northern lakes so far, this emission pathway is still largely underrepresented and waterbody depth and temperature are not always reported with the flux estimates. Furthermore, we collected information on surface water temperature for this dataset because it was the most widely available temperature metric. Sediment temperature is a better metric to collect in hand with ebullition due to production and transport directly from the sediments (Wik et al. 2013;Aben et al. 2017). Future studies should work to report sediment temperature and water column temperature alongside their flux measurements." RC1: Figure 12: Having the bar graphs be a single shade of green is mildly confusing since shades of green also represent Relative Land Cover. I would suggest either making your bar graph colors match the legend based on their relative land cover or making them all a non-green color.

Response:
We changed the colors of the bar graph to grey to reduce confusion with the relative land cover color.

RC1:
Line 680 -Do you mean Midsize Peatland lakes here instead of Midsize Glacial lakes? According to Figure 13, Midsize Glacial lakes are well represented compared to their relative flux, whereas midsize peatland lakes are not.
Response:Here we meant to say that Midsize Glacial Lake and Large Lake fluxes are not well represented across the Canadian Shield, despite their abundance. We have edited to text to reflect this. Figure 14: As with Figure 12, it is confusing here to have shades of blue mean two different things: either ebullition/diffusion, or relative land cover. I would suggest using a different color scheme for one of these.

Response:
We changed the colors of the bar graph to grey and black for diffusion and ebullition, respectively, to reduce confusion with the relative land cover color.

RC1:
Line 696-670: It is my understanding from the Wik et al. 2016 study that not only do you need 11/39 days for diffusive/ebullitive measurements (respectively), you also need these measurements in 3/11 depth-stratified locations (respectively). Please clarify this in your paper.
Response:We added the following sentence to include the depth-stratified spatial sampling suggestions from Wik et al.
Response:"Further, Wik et al. recommend that in addition to the number of sampling days, flux measurements should be distributed spatially across the waterbody using a depth-stratified approach included ~3 and ~11 locations for diffusion and ebullition, respectively (2016b). While we did not collect data on the number of sampling locations across each waterbody, it is likely that many of the average fluxes included the dataset also represent spatially under-sampled measurements." RC1: Line 715: I do not understand what you mean by "split CH4-emitting ecosystem characteristics" here.

Response:
We have changed the sentence to "split by CH 4 -emitting ecosystem characteristics" RC1:Line 718: Is it fair to say that MAAT has important implications for future scaling efforts if MAAT only accounts for 3% of the variability (per Line 615)? How important could MAAT be if it accounts for such a small (even if statistically significant) amount of the variability?
Response: While 3% looks like a relatively small amount of variation, and the majority of the variability is due to classes, the differences between classes can be large (from almost 0 to the ~150 mg CH4 m-1 d-1). So then even a minor influence of temperature suggest that individual classes might be relatively sensitive to temperature. For example, increasing MAAT from 10C to 15C for a small peatland lake increases the modeled flux from 35 mg CH4 m-1 d-1 to 50 mg CH4 m-1 d-1.

General Comments (RC2):
RC2: This paper describes data that combines terrestrial and aquatic CH 4 flux measurements and associated supporting information for northern boreal-arctic biomes. As the authors clearly state, methane-producing habitats are particularly abundant in this part of the world and current estimates of boreal and Arctic emissions are highly uncertain. This uncertainty is not unique to latitudes above 50â°, but (not mentioned by the authors) it is arguably particularly important in this case given predictions and early evidence of increasing rates of emissions associated with rapid climate change.

RC2:
A key 2-part feature of this paper is (a) its construction in tandem with a land cover data set and (b) the argument supported by preliminary analyses that land cover classes can be identified/assigned based on their CH 4 emission behaviors. The case for this argument is compelling for the boreal and Arctic region. But even if a researcher is skeptical about this argument, this dataset includes both flux data and the methanerelevant land cover data, thus providing the raw material for hypothesis testing as well as for cross-system comparisons and upscaling studies. The combination of terrestrial and aquatic fluxes and support data in the BAWLD-CH 4 dataset is a significant expansion of prior and separate terrestrial and aquatic data compilations, and when paired with the land cover data (BAWLD spatial data) it creates a 'one-stop shopping' resource for researchers studying high latitude GHG dynamics. I appreciate the clear delineation of what data are/are not included and potential limitations of the dataset. The limits of the current state of the data serve as useful raw material for ending the discussion with a section on directions for future research. There is much to like about this paper; it provides the best-available dataset for those interested in high latitude methane dynamics, which is a topic of substantial and rapidly increasing scientific interest, the paper is very well written, and it is thorough in the presentation and explanation of dataset assembly.

Response:
We would like to thank Referee #2 for taking the time to provide thoughtful and constructive comments on this manuscript. The suggested changes and comments have helped to strengthen the manuscript. Our responses to the specific comments are detailed below.

RC2:
Most of my specific comments emphasize aquatic data issues due to my familiarity with these ecosystems. However, I have two general comments. First, following a preliminary reading of this manuscript, I mistakenly thought that this paper was discussing two complementary data sets: BAWLD and BAWLD-CH 4 . I made this assumption because it makes sense that these datasets would be presented together given their construction, and also because a fair amount of text in this paper is dedicated to describing aspects of BAWLD (the land cover classes). Presumably the land cover categories are described in the Arctic Data Center (ADC) data publication, so could this part of the paper be streamlined to put the emphasis back on the CH 4 flux data?
Response: We feel it is necessary to include the descriptions of the classes in this manuscript. While the descriptions section is lengthy, the definitions are important to explain why methane emissions are different and how we defined each class for the purpose of their mapping.

RC2:
Second, given that a strength of this dataset is being able to link it to the BAWLD spatial dataset, it was surprising and disappointing to see several rows with missing or highly aggregated information on site location, including data records from papers written by co-authors on this effort. Often, many sites are lumped together and assigned the same very coarse-level lat/lon (e.g., 64, -148) despite including different types of ecosystems. This is disappointing, as it restricts future analyses that could have been done if and when new and better spatial data become available. I understand that nothing can be done about some of these records, but this is not the case for all records with vague/absent data.

Response:
We understand and sympathize with the frustrations presented here about missing data and aggregated data. For consistency, we only include data that can be accessed in the published papers and accompanying supplemental datasets. We believe this allows the reader to easily go back to the original papers and connect with the data. Furthermore, due to the large volume of data already present in this dataset, the time constraints on reaching out to every author for more detailed datasets was beyond the scope of this project.

RC2:
Aquatic storage flux (lines 392-395)-Can you provide any more detail about determining storage fluxes? I think of this flux as being estimated by quantifying the mass of CH 4 under ice and then assuming that this mass (or some fraction of this mass) is lost to the atmosphere following ice out-because of mixing. Thus, I am not clear on how storage flux is being differentiated from spring mixing flux.

Response:
The storage flux represents emissions upon ice-out, including spring turnover. We have corrected the text-Response: "Storage/ice-out flux includes the annual release of CH4 that accumulates within and under the ice over the winter and is released upon ice-melt and during spring turnover. Methodologically, thisalso includes estimates from ice bubble surveys (IBS). Our storage flux estimate does not include estimates of fall circulation fluxes, wherein CH4 that is stored in the deep portion of the water column is released upon seasonal turnover of the water column (Karlsson et al. 2013;Sepulveda-Jauregui et al. 2015). RC2:Section 3.3 figure citations-do you mean Fig. 10 (not fig 2.10) in this section? And re: fig. 10, while the 2 different blues are easily distinguishable in Fig. 9, they are less so in this figure."

Response:
We fixed the citations in this section to all be Fig. 10. We also changed this figure to make the colors contrast better with a darker blue color for ebullition.