Machine learning for efficient segregation and labeling of potential biological sounds in long-term underwater recordings

Parcerisas, C.; Schall, E.; te Velde, K.; Botteldooren, D.; Devos, P.; Debusschere, E.

doi:/10.3389/frsen.2024.1390687

Data Policy

[ meld een fout in dit record ]

mandje (0): toevoegen | toon

Machine learning for efficient segregation and labeling of potential biological sounds in long-term underwater recordings

Parcerisas, C.; Schall, E.; te Velde, K.; Botteldooren, D.; Devos, P.; Debusschere, E. (2024). Machine learning for efficient segregation and labeling of potential biological sounds in long-term underwater recordings. Front. Remote Sens. 5: 1390687. https://dx.doi.org/10.3389/frsen.2024.1390687

In: Frontiers in Remote Sensing. Frontiers Media S.A.: Lausanne. ISSN 2673-6187; e-ISSN 2673-6187, meer

Is gerelateerd aan:

Calonge, A.; Develter, R.; Muñiz, C.; Parcerisas, C.; Reubens, J.; Boone, W.; Deneudt, K.; Debusschere, E. (2026). Scalable low-cost seabed landers: The missing link for sustained, integrated, long-term observations in dynamic shallow seas. Remote Sensing in Ecology and Conservation Online first: 1-9. https://dx.doi.org/10.1002/rse2.70072, meer

Beschikbaar in	Auteurs \| Datasets
VLIZ: Open access 400536 [ download pdf ]

Trefwoord

Marien/Kust

Author keywords

transfer learning, object detection, underwater sound, unknown sounds, clustering, soundscape

Project	Top \| Auteurs \| Datasets
PhD: Marine Soundscapes in Shallow Water: Automated Tools for Characterization and Analysis, meer

Auteurs		Top \| Datasets
Parcerisas, C., meer Schall, E. te Velde, K.	Botteldooren, D., meer Devos, P., meer Debusschere, E., meer

Abstract

Studying marine soundscapes by detecting known sound events and quantifying their spatio-temporal patterns can provide ecologically relevant information. However, the exploration of underwater sound data to find and identify possible sound events of interest can be highly time-intensive for human analysts. To speed up this process, we propose a novel methodology that first detects all the potentially relevant acoustic events and then clusters them in an unsupervised way prior to manual revision. We demonstrate its applicability on a short deployment. To detect acoustic events, a deep learning object detection algorithm from computer vision (YOLOv8) is re-trained to detect any (short) acoustic event. This is done by converting the audio to spectrograms using sliding windows longer than the expected sound events of interest. The model detects any event present on that window and provides their time and frequency limits. With this approach, multiple events happening simultaneously can be detected. To further explore the possibilities to limit the human input needed to create the annotations to train the model, we propose an active learning approach to select the most informative audio files in an iterative manner for subsequent manual annotation. The obtained detection models are trained and tested on a dataset from the Belgian Part of the North Sea, and then further evaluated for robustness on a freshwater dataset from major European rivers. The proposed active learning approach outperforms the random selection of files, both in the marine and the freshwater datasets. Once the events are detected, they are converted to an embedded feature space using the BioLingual model, which is trained to classify different (biological) sounds. The obtained representations are then clustered in an unsupervised way, obtaining different sound classes. These classes are then manually revised. This method can be applied to unseen data as a tool to help bioacousticians identify recurrent sounds and save time when studying their spatio-temporal patterns. This reduces the time researchers need to go through long acoustic recordings and allows to conduct a more targeted analysis. It also provides a framework to monitor soundscapes regardless of whether the sound sources are known or not.

Datasets (3)

Parcericas, C.; Schall, E.; te Velde, K.; Botteldooren, D.; Devos, P.; Debusschere, E.; Flanders Marine Institute (VLIZ); Ghent University (UGent): Belgium; Alfred Wegener Institute for Polar and Marine Research (AWI): Germany; Leiden University: The Netherlands; (2024): Yolov8 model weights to detect unknown underwater sounds. Marine Data Archive., meer

PhD_Parcerisas: Parcerisas Clea, Dick Botteldooren, Paul Devos, Debusschere Elisabeth, Flanders Marine Institute (VLIZ); 2021; Broadband Acoustic Network dataset, meer

Parcerisas, C.; Schall, E.; Aubach, J.; Te Velde, K.; Slabbekoorn, H.; Debusschere, E.; Flanders Marine Institute (VLIZ); Alfred Wegener Institute; Leiden University; (2024): Acoustic salient event annotations. Marine Data Archive., meer

Alle informatie in het Integrated Marine Information System (IMIS) valt onder het VLIZ Privacy beleid

Top | Auteurs | Datasets

Ontvang onze nieuwsbrief

Volg VLIZ op sociale media!

Hoe ...

Tools ...

Quick links