Preprints
https://doi.org/10.5194/essd-2026-64
https://doi.org/10.5194/essd-2026-64
22 Apr 2026
 | 22 Apr 2026
Status: this preprint is currently under review for the journal ESSD.

HyBEAR: A Hyperspectral Benchmark for Bare Soil Detection

Agata M. Wijata, Bogdan Ruszczak, Adriana Niepala, Michal Gumiela, Krzysztof Smykała, Nicolas Longépé, and Jakub Nalepa

Abstract. Detecting bare soil areas is an important step in the analysis of Earth observation data in a variety of Precision Agriculture (PA) applications focused on quantifying soil properties and assessing soil quality. In this paper, we introduce the HyBEAR benchmark – a novel large-scale collection of high-resolution hyperspectral aerial images (with 2 m ground sampling distance) accompanied with manual bare soil annotations verified with domain experts. Usually, the bare soil detection problem is tackled at the pixel level, meaning that detection methods classify all pixels as either bare soil or background. In contrast to this approach, we provide pixel-level annotations for the entire agricultural parcels (if the parcel is labeled as bare soil, then all pixels within that parcel are labeled accordingly), and aim to support the development of methods that identify entire fields with no vegetation. Commonly, such fields undergo further analysis to determine specific soil parameters and characteristics that are important while planning various PA activities, such as fertilization. The HyBEAR? benchmark includes (i) the largest-to-date (108,064,591 pixels, corresponding to 43,225 hectares) and most heterogeneous dataset for bare soil detection, as well as (ii) the validation procedure (training-test splits and quality metrics) and a set of baseline results, obtained for a set of machine learning bare soil detection models. From the FULL collection of 1954 images in HyBEAR, which we divided into 5 spatially-disjoint folds, we additionally selected a random, stratified subset (MINI) of the images which may be useful for designing and verifying bare soil detection algorithms. Overall, HyBEAR is a step toward standardizing the way the community builds and confronts bare soil detection algorithms in a thorough, reproducible, and unbiased way.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Agata M. Wijata, Bogdan Ruszczak, Adriana Niepala, Michal Gumiela, Krzysztof Smykała, Nicolas Longépé, and Jakub Nalepa

Status: open (until 29 May 2026)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Agata M. Wijata, Bogdan Ruszczak, Adriana Niepala, Michal Gumiela, Krzysztof Smykała, Nicolas Longépé, and Jakub Nalepa

Data sets

HyBEAR ? A. Wijata et al. https://zenodo.org/records/17607898

Model code and software

HyBEAR ? A. Wijata et al. https://zenodo.org/records/17607898

Interactive computing environment

HyBEAR ? A. Wijata et al. https://zenodo.org/records/17607898

Agata M. Wijata, Bogdan Ruszczak, Adriana Niepala, Michal Gumiela, Krzysztof Smykała, Nicolas Longépé, and Jakub Nalepa
Metrics will be available soon.
Latest update: 22 Apr 2026
Download
Short summary
The bare soil areas detection is an important step for Earth observation and Precision Agriculture applications. The HyBEAR is a novel large-scale dataset of high-res hyperspectral images, with 2 m ground sampling distance, and manual bare soil annotations verified by experts. It includes the largest-to-date dataset (108 mln pixels), the validation procedure, and initial results. The entire parcel's pixel-level labels enable developing the methods for entire fields with no vegetation detection.
Share
Altmetrics