<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpublishing3.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="data-paper" specific-use="SMUR" dtd-version="3.0" xml:lang="en">
<front>
<journal-meta>
<journal-id journal-id-type="publisher">ESSDD</journal-id>
<journal-title-group>
<journal-title>Earth System Science Data Discussions</journal-title>
<abbrev-journal-title abbrev-type="publisher">ESSDD</abbrev-journal-title>
<abbrev-journal-title abbrev-type="nlm-ta">Earth Syst. Sci. Data Discuss.</abbrev-journal-title>
</journal-title-group>
<issn pub-type="epub">1866-3591</issn>
<publisher><publisher-name></publisher-name>
<publisher-loc>Göttingen, Germany</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.5194/essd-2026-64</article-id>
<title-group>
<article-title>HyBEAR: A Hyperspectral Benchmark for Bare Soil Detection</article-title>
</title-group>
<contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Wijata</surname>
<given-names>Agata M.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<xref ref-type="aff" rid="aff6">
<sup>6</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Ruszczak</surname>
<given-names>Bogdan</given-names>
<ext-link>https://orcid.org/0000-0003-1089-1778</ext-link>
</name>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
<xref ref-type="aff" rid="aff6">
<sup>6</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Niepala</surname>
<given-names>Adriana</given-names>
<ext-link>https://orcid.org/0009-0005-8274-9852</ext-link>
</name>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Gumiela</surname>
<given-names>Michal</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Smykała</surname>
<given-names>Krzysztof</given-names>
</name>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
<xref ref-type="aff" rid="aff4">
<sup>4</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Longépé</surname>
<given-names>Nicolas</given-names>
</name>
<xref ref-type="aff" rid="aff5">
<sup>5</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Nalepa</surname>
<given-names>Jakub</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
</contrib-group><aff id="aff1">
<label>1</label>
<addr-line>Silesian University of Technology, Akademicka 2A, 44-100 Gliwice, Poland</addr-line>
</aff>
<aff id="aff2">
<label>2</label>
<addr-line>KP Labs, Bojkowska 37J, 44-100 Gliwice, Poland</addr-line>
</aff>
<aff id="aff3">
<label>3</label>
<addr-line>Opole University of Technology, Prószkowska 76, 45-758 Opole, Poland</addr-line>
</aff>
<aff id="aff4">
<label>4</label>
<addr-line>QZ Solutions Sp. z o.o., Technologiczna 2, 45-839 Opole, Poland</addr-line>
</aff>
<aff id="aff5">
<label>5</label>
<addr-line>Φ-Lab, European Space Agency, Largo Galileo Galilei 1, 00044 Frascati, Italy</addr-line>
</aff>
<aff id="aff6">
<label>6</label>
<addr-line>These authors contributed equally to this work.</addr-line>
</aff>
<pub-date pub-type="epub">
<day>22</day>
<month>04</month>
<year>2026</year>
</pub-date>
<volume>2026</volume>
<fpage>1</fpage>
<lpage>26</lpage>
<permissions>
<copyright-statement>Copyright: &#x000a9; 2026 Agata M. Wijata et al.</copyright-statement>
<copyright-year>2026</copyright-year>
<license license-type="open-access">
<license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri"  xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p>
</license>
</permissions>
<self-uri xlink:href="https://essd.copernicus.org/preprints/essd-2026-64/">This article is available from https://essd.copernicus.org/preprints/essd-2026-64/</self-uri>
<self-uri xlink:href="https://essd.copernicus.org/preprints/essd-2026-64/essd-2026-64.pdf">The full text article is available as a PDF file from https://essd.copernicus.org/preprints/essd-2026-64/essd-2026-64.pdf</self-uri>
<abstract>
<p>Detecting bare soil areas is an important step in the analysis of Earth observation data in a variety of Precision Agriculture (PA) applications focused on quantifying soil properties and assessing soil quality. In this paper, we introduce the HyBEAR benchmark &amp;ndash; a novel large-scale collection of high-resolution hyperspectral aerial images (with 2 m ground sampling distance) accompanied with manual bare soil annotations verified with domain experts. Usually, the bare soil detection problem is tackled at the pixel level, meaning that detection methods classify all pixels as either bare soil or background. In contrast to this approach, we provide pixel-level annotations for the entire agricultural parcels (if the parcel is labeled as bare soil, then all pixels within that parcel are labeled accordingly), and aim to support the development of methods that identify entire fields with no vegetation. Commonly, such fields undergo further analysis to determine specific soil parameters and characteristics that are important while planning various PA activities, such as fertilization. The HyBEAR🐻 benchmark includes (&lt;em&gt;i&lt;/em&gt;) the largest-to-date (108,064,591 pixels, corresponding to 43,225 hectares) and most heterogeneous dataset for bare soil detection, as well as (&lt;em&gt;ii&lt;/em&gt;) the validation procedure (training-test splits and quality metrics) and a set of baseline results, obtained for a set of machine learning bare soil detection models. From the FULL collection of 1954 images in HyBEAR, which we divided into 5 spatially-disjoint folds, we additionally selected a random, stratified subset (MINI) of the images which may be useful for designing and verifying bare soil detection algorithms. Overall, HyBEAR is a step toward standardizing the way the community builds and confronts bare soil detection algorithms in a thorough, reproducible, and unbiased way.</p>
</abstract>
<counts><page-count count="26"/></counts>
<funding-group>
<award-group id="gs1">
<funding-source>European Regional Development Fund</funding-source>
<award-id>FESL.10.25-IZ.01-07G5/23</award-id>
</award-group>
<award-group id="gs2">
<funding-source>European Regional Development Fund</funding-source>
<award-id>POIR.01.01.01-00-0287/21</award-id>
</award-group>
<award-group id="gs3">
<funding-source>Silesian University of Technology</funding-source>
<award-id>02/080/RGJ25/0052</award-id>
</award-group>
<award-group id="gs4">
<funding-source>Politechnika Opolska</funding-source>
<award-id>314/25</award-id>
</award-group>
</funding-group>
</article-meta>
</front>
<body/>
<back>
</back>
</article>