2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2017
DOI: 10.1109/icassp.2017.7952639
|View full text |Cite
|
Sign up to set email alerts
|

Anuran call classification with deep learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
10
0
8

Year Published

2017
2017
2021
2021

Publication Types

Select...
4
2
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 20 publications
(19 citation statements)
references
References 8 publications
1
10
0
8
Order By: Relevance
“…Therefore, gamma spectrogram is selected for the subsequent analysis. , CaffeNet is the worst, which is in consistent with [16].…”
Section: Rfsupporting
confidence: 75%
See 2 more Smart Citations
“…Therefore, gamma spectrogram is selected for the subsequent analysis. , CaffeNet is the worst, which is in consistent with [16].…”
Section: Rfsupporting
confidence: 75%
“…Multi-label learning Frog species Different from [15], we use a deep learning algorithm as a feature extractor. In [16], a pre-trained network is found to achieve higher classification accuracy than training a new network. Also, there are only 342 10-s recordings for the experiment, which are not enough for training.…”
Section: Feature Extractionmentioning
confidence: 99%
See 1 more Smart Citation
“…At present, deep learning techniques are being employed in frog acoustics classification [23][24][25], applying convolutional neural networks (CNN). However, most of these works also use MFCC as parameters, relying on the discriminatory capacity of the classifier without looking for a better representation of the acoustic signal information.…”
Section: Reptiles and Amphibiansmentioning
confidence: 99%
“…State of the art classification of sound relies on Convolutional Neural Networks (CNN) that take input from some form of the spectrogram [36] or even the raw waveform [37]. Moreover, CNN deep learning approaches have also been used in the identification of anuran sound [38]. In spite of that, studying and optimizing the process of extracting MFCC features is of great interest at least for three reasons.…”
Section: Introductionmentioning
confidence: 99%