The MArine Debris hyperspectral reference Library collection (MADLib)
Abstract. Marine debris is a ubiquitous and growing threat to environmental and human health. Efforts to monitor and mitigate marine debris pollution face many challenges. A primary limitation is the absence of standardized methodologies for monitoring capabilities due to the complex and diverse physical and chemical properties of marine debris. Variabilities include object size, apparent color, polymer type, weathering, and aqueous state. Despite the challenges in object characteristics, advances in remote sensing methods are showing promise for detecting marine debris across local to global scales. Algorithms are needed to link remotely sensed observations with relevant characteristics of marine debris to fully realize this potential. Although more optical measurements of marine debris reflectance are becoming available for algorithm development, inconsistencies in data curation remains an obstacle. Variations in data processing and inconsistent metadata hinder efforts to develop robust, generalizable algorithms for marine debris detection. To address this, we present the well-curated MArine Debris hyperspectral reference Library collection (MADLib) containing 24889 spectra from 3032 samples. All optical measurements are available in open access via https://doi.org/10.4121/059551d3-2383-4e20-af2d-011c9a59d3ac (Ohall et al., 2025). MADLib demonstrates the importance of open-science and open-access datasets, as it compiles and harmonizes spectral data collected from publicly accessible datasets and individual research projects. Consistent methods were applied for data standardization, quality assurance, and integration. We also propose a robust protocol for generating metadata tailored to marine debris and ocean color remote sensing applications. MADLib possesses spectra of a wide range of marine debris materials including different polymer types, color, size, weathering, and aqueous states. Here, we analyze the metadata associated with the spectra to identify sampling gaps and propose considerations for future work. By providing open-access and standardized data, MADLib is expected to support the development of robust marine debris detection algorithms.