2026-06-01
Data-driven sampling strategies for fine-tuning bird detection models
Publication
Publication
The Journal of the Acoustical Society of America , Volume 159 - Issue 6 p. 4891- 4903
Passive acoustic monitoring (PAM) has emerged as a promising tool for collecting ecological data, particularly in the context of bird population monitoring. Bird species can be automatically identified using pre-trained models, such as BirdNET. The performance of these models can be significantly improved through fine-tuning with annotated samples recorded in the specific acoustic conditions in which the microphones are deployed. However, passive acoustic monitoring (PAM) collects vast amounts of data, and annotating bird vocalizations requires specialized expertise. As a result, only a very small portion of the recordings can be effectively labeled. Selecting the most relevant samples to annotate in order to maximize performance in model fine-tuning remains a significant challenge. First, a regularization technique addresses the challenge of class imbalance during model fine-tuning. Next, a data-driven methodology is developed, introducing the influence score, which quantifies the impact of individual training samples on model performance to inform sampling strategies. A linear model is proposed to estimate the influence score for generalization to unseen data. Finally, several sampling strategies are compared, based on acoustic indices and predictions of the pre-trained model. Together, these contributions enable the identification of efficient annotation strategies to overcome the challenges of limited annotation resources in large-scale PAM.
| Additional Metadata | |
|---|---|
| doi.org/10.1121/10.0043947 | |
| The Journal of the Acoustical Society of America | |
| Organisation | Staff publications |
|
Bernard, Corentin, McEwen, Ben, Cretois, Benjamin, Glotin, Hervé, Stowell, D.& Marxer, Ricard. (2026). Data-driven sampling strategies for fine-tuning bird detection models. The Journal of the Acoustical Society of America, 159(6), 4891–4903.https://doi.org/10.1121/10.0043947 |
|