2022
DOI: 10.1109/tits.2022.3158076
|View full text |Cite
|
Sign up to set email alerts
|

Listening for Sirens: Locating and Classifying Acoustic Alarms in City Scenes

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3
2

Relationship

2
7

Authors

Journals

citations
Cited by 22 publications
(15 citation statements)
references
References 36 publications
0
15
0
Order By: Relevance
“…A wider perspective on acoustic traffic event detection was later offered by [51], [52], and [55]. In [55], the authors analyze five classes of audio events: several types of ambulance sirens, railroad crossing bells, tire screeches, car honks, and glass breaking.…”
Section: Alert Sound Detection and Recognitionmentioning
confidence: 99%
“…A wider perspective on acoustic traffic event detection was later offered by [51], [52], and [55]. In [55], the authors analyze five classes of audio events: several types of ambulance sirens, railroad crossing bells, tire screeches, car honks, and glass breaking.…”
Section: Alert Sound Detection and Recognitionmentioning
confidence: 99%
“…For humans, lower frequencies are perceptually much more important than higher frequencies and this can be represented in time-frequency representations. Gammatonegrams extend this biological inspiration, using filter banks modelled on the human cochlea and have been successfully used before in a robotics context [24].…”
Section: A Audio Classificationmentioning
confidence: 99%
“…to have Ψ h = T ), we use zero padding at the time dimension. Specifically, we use a padding equal to ξ h for kernel shape of (3, 3), a padding equal to 2 • ξ h for (5, 5) kernel, 3 • ξ h for the (7, 7) kernel, and 5 • ξ h for the (11,11) kernel. We use no padding at the feature dimension for the f dil .…”
Section: Baseline System and Modelsmentioning
confidence: 99%
“…Also, activities of classes can overlap (polyphonic SED) or not (monophonic SED). SED can be employed in a wide range of applications, like wildlife monitoring and bird activity detection [5,6], home monitoring [7,8], autonomous vehicles [9,10], and surveillance [11,12].…”
Section: Introductionmentioning
confidence: 99%