2023
DOI: 10.1016/j.patcog.2022.109025
|View full text |Cite
|
Sign up to set email alerts
|

Environmental Sound Classification on the Edge: A Pipeline for Deep Acoustic Networks on Extremely Resource-Constrained Devices

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
37
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 32 publications
(37 citation statements)
references
References 26 publications
0
37
0
Order By: Relevance
“…Datasets on insects were few, with the majority having sounds for birds, frogs, cats, whales, and dogs. For general acoustics, the most popular dataset was the US8K (Urban Sound 8K) which contains 8732 labeled sound excerpts of urban sounds [ 119 , 127 , 131 , 132 , 138 , 141 , 146 , 149 ] as shown in Figure 7 . The ESC 50 and ESC 10 datasets were also among the popular datasets [ 119 , 130 , 131 , 132 , 141 , 143 , 144 , 146 , 147 , 148 , 149 ].…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…Datasets on insects were few, with the majority having sounds for birds, frogs, cats, whales, and dogs. For general acoustics, the most popular dataset was the US8K (Urban Sound 8K) which contains 8732 labeled sound excerpts of urban sounds [ 119 , 127 , 131 , 132 , 138 , 141 , 146 , 149 ] as shown in Figure 7 . The ESC 50 and ESC 10 datasets were also among the popular datasets [ 119 , 130 , 131 , 132 , 141 , 143 , 144 , 146 , 147 , 148 , 149 ].…”
Section: Resultsmentioning
confidence: 99%
“…For general acoustics, the most popular dataset was the US8K (Urban Sound 8K) which contains 8732 labeled sound excerpts of urban sounds [ 119 , 127 , 131 , 132 , 138 , 141 , 146 , 149 ] as shown in Figure 7 . The ESC 50 and ESC 10 datasets were also among the popular datasets [ 119 , 130 , 131 , 132 , 141 , 143 , 144 , 146 , 147 , 148 , 149 ]. They contain a mixture of bioacoustics and general acoustics sounds.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…We selected a CNN model, which is a deep neural network-based machine learning model, for three primary reasons. First, CNN-based models still achieve state-of-the-art in many audio classification tasks due to their ability to effectively extract local features from the audio input [45] [46]. As a result, combining them with transformers has been popular in several recent works to achieve SOTA accuracies in different speech recognition tasks [47] [48].…”
Section: Machine Learning Classifiermentioning
confidence: 99%
“…Although several studies have been carried out in the forest acoustic monitoring context, still, a standard benchmark dataset specific to forest sounds is unavailable. Therefore, most of the existing studies have utilized publicly available environmental sound datasets such as ESC-50 [ 4 , 13 , 14 , 15 , 16 , 17 ], UrbanSound8K (U8k) [ 14 , 18 , 19 , 20 , 21 ], FSD50K [ 22 , 23 ], and SONYC-UST [ 24 , 25 ]. These datasets contain a large quantity of audio data categorized into several groups covering a broad area of sound events.…”
Section: Introductionmentioning
confidence: 99%