Learning representations of sound using trainable COPE feature extractors

Strisciuglio, Nicola; Vento, Mario; Petkov, Nicolai

doi:10.1016/j.patcog.2019.03.016

Cited by 25 publications

(17 citation statements)

References 66 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…3 ) and the need and operation of each block are explained. The algorithm is divided into two parts: configuration and application [36] . The configuration part deals with the preparation of a clean balanced dataset, and the application part explains the extraction and use of the proposed C-19CC features.…”

Section: Proposed Methodologymentioning

confidence: 99%

Detection of COVID-19 from speech signal using bio-inspired based cepstral features

Dash¹,

Mishra²,

Panda³

et al. 2021

Pattern Recognition

View full text Add to dashboard Cite

Section: Proposed Methodologymentioning

confidence: 99%

Detection of COVID-19 from speech signal using bio-inspired based cepstral features

Dash¹,

Mishra²,

Panda³

et al. 2021

Pattern Recognition

View full text Add to dashboard Cite

“…This accounts for different resolution at low and high frequency, similarly to the way the auditory system processes the sound. We refer the reader to [21,27] for details.…”

Section: Gammatonegrammentioning

confidence: 99%

“…In this paper, we present a method for audio event detection that is based on trainable feature extractors, called COPE (Combination of Peaks of Energy), recently proposed in [27]. The COPE algorithm is based on the analysis of local maxima in a time-frequency representation of the input audio signal, which have been demonstrated to be robust to additive noise [31].…”

Section: Introductionmentioning

confidence: 99%

Trainable COPE Features for Sound Event Detection

Strisciuglio

Petkov

2019

Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications

Self Cite

View full text Add to dashboard Cite

Systems for automatic analysis of sounds and detection of events are of great importance as they can be used as substitutes of or complement to video analytic systems. In this paper we describe a flexible system for the detection of audio events based on the use of trainable COPE (Combination of Peaks of Energy) features. The structure of a COPE feature is determined in an automatic configuration process on a single prototype example. Thus, they can be adapted to different kinds of sounds of interest. We configure a set of COPE features in order to account for robustness to variations of the characteristics of sounds within a specific class. The proposed system is flexible as new features (also configured on examples drawn from new classes) can be easily added to the feature set. We performed experiments on the MIVIA road events data set for road surveillance applications and compared the results that we achieved with the ones of other existing methods.

show abstract

“…Parts of higher energy intensity correspond to regions of the cochlea membrane that vibrates more according to the energy of the mechanical sound pressure waves that hit the outer part of the auditory system. This model was exploited in [45][46][47] as input to a trainable feature extractor, the design of which was inspired by the activation of the inner hair cells, placed behind the cochlea, which convert the vibration into electrical stimuli on the auditory nerve.…”

Section: Introductionmentioning

confidence: 99%

Brain-Inspired Algorithms for Processing of Visual Data

Strisciuglio

Petkov

2021

Lecture Notes in Computer Science

Self Cite

View full text Add to dashboard Cite

The study of the visual system of the brain has attracted the attention and interest of many neuro-scientists, that derived computational models of some types of neuron that compose it. These findings inspired researchers in image processing and computer vision to deploy such models to solve problems of visual data processing.In this paper, we review approaches for image processing and computer vision, the design of which is based on neuro-scientific findings about the functions of some neurons in the visual cortex. Furthermore, we analyze the connection between the hierarchical organization of the visual system of the brain and the structure of Convolutional Networks (ConvNets). We pay particular attention to the mechanisms of inhibition of the responses of some neurons, which provide the visual system with improved stability to changing input stimuli, and discuss their implementation in image processing operators and in ConvNets.

show abstract

Learning representations of sound using trainable COPE feature extractors

Cited by 25 publications

References 66 publications

Detection of COVID-19 from speech signal using bio-inspired based cepstral features

Detection of COVID-19 from speech signal using bio-inspired based cepstral features

Trainable COPE Features for Sound Event Detection

Brain-Inspired Algorithms for Processing of Visual Data

Contact Info

Product

Resources

About