2020
DOI: 10.3390/app10093286
|View full text |Cite
|
Sign up to set email alerts
|

Bioacoustic Classification of Antillean Manatee Vocalization Spectrograms Using Deep Convolutional Neural Networks

Abstract: We evaluated the potential of using convolutional neural networks in classifying spectrograms of Antillean manatee (Trichechus manatus manatus) vocalizations. Spectrograms using binary, linear and logarithmic amplitude formats were considered. Two deep convolutional neural networks (DCNN) architectures were tested: linear (fixed filter size) and pyramidal (incremental filter size). Six experiments were devised for testing the accuracy obtained for each spectrogram representation and architecture combination. R… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
4
1

Relationship

1
8

Authors

Journals

citations
Cited by 17 publications
(12 citation statements)
references
References 44 publications
0
8
0
Order By: Relevance
“…The paper from Merchan et al [2] deals with the classification of vocalization spectrograms of Antillean Manatee, a large endangered sea mammal. In order to estimate its population for scientific purposes, the authors turn to audio classification.…”
Section: Deep Learning For Applications In Acousticsmentioning
confidence: 99%
“…The paper from Merchan et al [2] deals with the classification of vocalization spectrograms of Antillean Manatee, a large endangered sea mammal. In order to estimate its population for scientific purposes, the authors turn to audio classification.…”
Section: Deep Learning For Applications In Acousticsmentioning
confidence: 99%
“…Investigations have already been developed to handle tasks such as infant crying motivation [22], music genre classification [23], and music mood classification [24]. The visual domain has also been used with animal vocalizations, in tasks such as species identification and detection [13,25].…”
Section: Signalmentioning
confidence: 99%
“…all their samples recorded in only two locations (see Table 1), increasing the number of developed to handle tasks such as infant cry motivation [7], music genre classification 165 [8] and music mood classification [9]. The visual domain has also been used with animal 166 vocalizations, in tasks as species identification and detection [4,10]. 191…”
Section: Introduction 18mentioning
confidence: 99%