the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
LakeBeD-US: a benchmark dataset for lake water quality time series and vertical profiles
Abstract. Water quality in lakes is an emergent property of complex biotic and abiotic processes that differ across spatial and temporal scales. Water quality is also a determinant of ecosystem services that lakes provide, and thus is of great interest to ecologists. Increasingly, machine learning and other computer science techniques are being used to predict water quality dynamics as well as to gain a greater understanding of water quality patterns and controls. To benefit both the sciences of ecology and computer science, we have created a benchmark dataset of lake water quality time series and vertical profiles. LakeBeD-US contains over 500 million unique observations of lake water quality collected by multiple long-term monitoring organizations across 17 water quality variables in 21 lakes in the United States. There are two published versions of LakeBeD-US: an "Ecology Edition" published in the Environmental Data Initiative repository, and a "Computer Science Edition" published in the Hugging Face repository. Each edition is formatted in a manner conducive to inquiries and analyses specific to each domain. For ecologists, LakeBeD-US provides an opportunity to study the spatial and temporal dynamics of several lakes with varying water quality, ecosystem, and landscape characteristics. For computer scientists, LakeBeD-US acts as a benchmark dataset that enables the advancement of machine learning for water quality prediction.
- Preprint
(1698 KB) - Metadata XML
- BibTeX
- EndNote
Status: open (until 14 Mar 2025)
Data sets
LakeBeD-US: Ecology Edition - a benchmark dataset of lake water quality time series and vertical profiles Bennett J. McAfee, Mary E. Lofton, Adrienne Breef-Pilz, Keli J. Goodman, Robert T. Hensley, Kathryn K. Hoffman, Dexter W. Howard, Abigail S. L. Lewis, Diane M. McKnight, Isabella A. Oleksy, Heather L. Wander, Cayelan C. Carey, Anuj Karpatne, and Paul C. Hanson https://doi.org/10.6073/pasta/c56a204a65483790f6277de4896d7140
LakeBeD-US: Computer Science Edition - a benchmark dataset for lake water quality time series and vertical profiles Aanish Pradhan, Bennett McAfee, Abhilash Neog, Sepideh Fatemi, Mary E. Lofton, Cayelan C. Carey, Anuj Karpatne, and Paul C. Hanson https://doi.org/10.57967/hf/3771
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
82 | 18 | 2 | 102 | 3 | 1 |
- HTML: 82
- PDF: 18
- XML: 2
- Total: 102
- BibTeX: 3
- EndNote: 1
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1