2019
DOI: 10.1016/j.patcog.2019.03.016
|View full text |Cite
|
Sign up to set email alerts
|

Learning representations of sound using trainable COPE feature extractors

Abstract: Sound analysis research has mainly been focused on speech and music processing. The deployed methodologies are not suitable for analysis of sounds with varying background noise, in many cases with very low signal-to-noise ratio (SNR).In this paper, we present a method for the detection of patterns of interest in audio signals. We propose novel trainable feature extractors, which we call COPE (Combination of Peaks of Energy). The structure of a COPE feature extractor is determined using a single prototype sound… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
16
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
3

Relationship

3
4

Authors

Journals

citations
Cited by 25 publications
(17 citation statements)
references
References 66 publications
1
16
0
Order By: Relevance
“…3 ) and the need and operation of each block are explained. The algorithm is divided into two parts: configuration and application [36] . The configuration part deals with the preparation of a clean balanced dataset, and the application part explains the extraction and use of the proposed C-19CC features.…”
Section: Proposed Methodologymentioning
confidence: 99%
“…3 ) and the need and operation of each block are explained. The algorithm is divided into two parts: configuration and application [36] . The configuration part deals with the preparation of a clean balanced dataset, and the application part explains the extraction and use of the proposed C-19CC features.…”
Section: Proposed Methodologymentioning
confidence: 99%
“…This accounts for different resolution at low and high frequency, similarly to the way the auditory system processes the sound. We refer the reader to [21,27] for details.…”
Section: Gammatonegrammentioning
confidence: 99%
“…In this paper, we present a method for audio event detection that is based on trainable feature extractors, called COPE (Combination of Peaks of Energy), recently proposed in [27]. The COPE algorithm is based on the analysis of local maxima in a time-frequency representation of the input audio signal, which have been demonstrated to be robust to additive noise [31].…”
Section: Introductionmentioning
confidence: 99%
“…Parts of higher energy intensity correspond to regions of the cochlea membrane that vibrates more according to the energy of the mechanical sound pressure waves that hit the outer part of the auditory system. This model was exploited in [45][46][47] as input to a trainable feature extractor, the design of which was inspired by the activation of the inner hair cells, placed behind the cochlea, which convert the vibration into electrical stimuli on the auditory nerve.…”
Section: Introductionmentioning
confidence: 99%