2017
DOI: 10.1049/iet-cvi.2017.0075
|View full text |Cite
|
Sign up to set email alerts
|

Bird and whale species identification using sound images

Abstract: Image identification of animals is mostly centred on identifying them based on their appearance, but there are other ways images can be used to identify animals, including by representing the sounds they make with images. In this study, the authors present a novel and effective approach for automated identification of birds and whales using some of the best texture descriptors in the computer vision literature. The visual features of sounds are built starting from the audio file and are taken from images const… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
8
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
6
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 13 publications
(8 citation statements)
references
References 54 publications
0
8
0
Order By: Relevance
“…The performance of Convolutional Neural Network is optimized for image classification, so this method is not suitable for the feature set we used in this paper. We also used Recurrent Neural Network in the pilot test, but our DNN method showed better results [10,11].…”
Section: Introductionmentioning
confidence: 99%
“…The performance of Convolutional Neural Network is optimized for image classification, so this method is not suitable for the feature set we used in this paper. We also used Recurrent Neural Network in the pilot test, but our DNN method showed better results [10,11].…”
Section: Introductionmentioning
confidence: 99%
“…Manual segmentation of audio recordings is time consuming and tedious task. However, some approaches [5][6][7][8] have adopted manual segmentation while others have adopted automated segmentation of birdsong [9][10][11][12]. In general, detect any acoustic activity in audio and segment it as a syllable.…”
Section: Related Workmentioning
confidence: 99%
“…EE represents the Entropy of the energy that is used to measure the changes in the level of energy of birdsong. On the other hand, Equation (6) represents the FDFs that provide the spectral features of the birdsong, e.g., SE represents the spectral entropy, and F represents the spectral flux. The spectral shape is represented by spectral spread (S) and spectral centroid (C).…”
Section: Perceptual Descriptive and Harmonic Features (Pdhfs)mentioning
confidence: 99%
“…all their samples recorded in only two locations (see Table 1), increasing the number of developed to handle tasks such as infant cry motivation [7], music genre classification 165 [8] and music mood classification [9]. The visual domain has also been used with animal 166 vocalizations, in tasks as species identification and detection [4,10]. 191…”
Section: Introduction 18mentioning
confidence: 99%
“…In case of spectrograms 182 in particular, texture is a very prominent visual property. In this vein, the textural 183 content of spectrograms has been used in several audio classification tasks, such as 184 music genre classification[11], voice classification[12], birds species classification and 185 whales recognition[4].186In[13], the authors propose the Local Binary Pattern (LBP). The texture of an187 image is described with a histogram.…”
mentioning
confidence: 99%