2017 25th European Signal Processing Conference (EUSIPCO) 2017
DOI: 10.23919/eusipco.2017.8081512
|View full text |Cite
|
Sign up to set email alerts
|

Two convolutional neural networks for bird detection in audio signals

Abstract: Abstract-We present and compare two approaches to detect the presence of bird calls in audio recordings using convolutional neural networks on mel spectrograms. In a signal processing challenge using environmental recordings from three very different sources, only two of them available for supervised training, we obtained an Area Under Curve (AUC) measure of 89% on the hidden test set, higher than any other contestant. By comparing multiple variations of our systems, we find that despite very different archite… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
60
0
1

Year Published

2018
2018
2024
2024

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 69 publications
(61 citation statements)
references
References 4 publications
(9 reference statements)
0
60
0
1
Order By: Relevance
“…The precision and recall of CityBioNet was also compared to bulbul (Grill & Schlüter, ), an algorithm for detecting bird sounds in entire audio recordings in order to summarise avian acoustic activity which was the winning entry in the 2016–2017 Bird Audio Detection challenge (Stowell, Wood, Stylianou, & Glotin, ). Like CityNet, bulbul is a CNN‐based classifier which uses spectrograms as input.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…The precision and recall of CityBioNet was also compared to bulbul (Grill & Schlüter, ), an algorithm for detecting bird sounds in entire audio recordings in order to summarise avian acoustic activity which was the winning entry in the 2016–2017 Bird Audio Detection challenge (Stowell, Wood, Stylianou, & Glotin, ). Like CityNet, bulbul is a CNN‐based classifier which uses spectrograms as input.…”
Section: Methodsmentioning
confidence: 99%
“…Rather than compare CityNet to the NDSI, we compared the biotic (NDSI bio ) and anthropogenic (NDSI anthro ) elements of the NDSI to the measures produced by CityBioNet and CityAnthroNet, respectively, as these were more comparable. As the Acoustic Indices are all designed to give a summary of acoustic activity for an entire file, they were analysed on the The precision and recall of CityBioNet was also compared to bulbul (Grill & Schlüter, 2017), an algorithm for detecting bird sounds in entire audio recordings in order to summarise avian acoustic activity which was the winning entry in the 2016-2017 Bird Audio Detection challenge (Stowell, Wood, Stylianou, & Glotin, 2016). Like CityNet, bulbul is a CNN-based classifier which uses spectrograms as input.…”
Section: Competing Algorithmsmentioning
confidence: 99%
See 1 more Smart Citation
“…For example, Acoustic Indices (AIs) use the spectral and temporal characteristics of acoustic 78 energy in sound recordings to produce whole community measures of biotic and 79 anthropogenic sound (Sueur et al 2014). However, several commonly used AIs have been 80 shown to be biased by non-biotic sounds (Towsey et al 2014 (Kasten 122 et al 2012), and to bulbul, a state-of-the-art algorithm for detecting bird sounds in order to 123 summarise avian acoustic activity (Grill & Schlüter 2017). As the main focus of the study 124 was the development of algorithms for ecoacoustic assessment of biodiversity in cities, we 125 conducted further analysis on the two best performing algorithms for measuring biotic sound, 126 CityBioNet and bulbul, by investigating the effect of non-biotic sounds on the accuracy of the 127 algorithms.…”
mentioning
confidence: 99%
“…The precision and recall of CityBioNet was also compared to bulbul (Grill & Schlüter 2017), 249 an algorithm for detecting bird sounds in entire audio recordings in order to summarise avian 250 acoustic activity which was the winning entry in the 2016-7 Bird Audio Detection challenge 251 (Stowell et al 2016). Like CityNet, bulbul is a CNN-based classifier which uses 252 spectrograms as input.…”
mentioning
confidence: 99%