2011
DOI: 10.1007/978-3-642-25085-9_54
|View full text |Cite
|
Sign up to set email alerts
|

Environmental Sounds Classification Based on Visual Features

Abstract: This paper presents a method aimed at classification of the environmental sounds in the visual domain by using the scale and translation invariance. We present a new approach that extracts visual features from sound spectrograms. We suggest to apply support vector machines (SVM's) in order to address sound classification. Indeed, in the proposed method we explore sound spectrograms as texture images, and extracts the time-frequency structures by using a translation-invariant wavelet transform and a patch trans… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
10
0

Year Published

2012
2012
2016
2016

Publication Types

Select...
3
2
1

Relationship

2
4

Authors

Journals

citations
Cited by 10 publications
(10 citation statements)
references
References 12 publications
0
10
0
Order By: Relevance
“…The original application in [43] was texture classification, yet the plausible use for music instrument classification was mentioned. Souli and Lachiri subsequently used this method for ESR in [26]. They also proposed another set of nonlinear features in [27].…”
Section: Power-spectrum-based Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The original application in [43] was texture classification, yet the plausible use for music instrument classification was mentioned. Souli and Lachiri subsequently used this method for ESR in [26]. They also proposed another set of nonlinear features in [27].…”
Section: Power-spectrum-based Methodsmentioning
confidence: 99%
“…This method extracted features from the log-Gabor filtered spectrogram (instead of the raw spectrogram). Since no performance comparison was made between features obtained from the log-Gabor filtered spectrogram and the raw spectrogram in [26], the advantages and shortcomings of this approach need to be explored furthermore.…”
Section: Power-spectrum-based Methodsmentioning
confidence: 99%
“…This approach consists in computation of 12 log-Gabor filters from the environmental sounds spectrograms, with 2 different scales (1,2) and 6 different orientations (1,2,3,4,5,6), this extraction allows the best correlate of signal structures. Then, for each single filter result we calculated the magnitude, after that, we passed through on mutual information (MI) algorithm to find an optimal feature vector that can next be passed for classification phase [9].…”
Section: ) Single Log-gabor Filtersmentioning
confidence: 99%
“…To improve our work results realized in [5], the idea consists in elaborating other methods based also on time-frequency representation, but with new approaches.…”
Section: Introductionmentioning
confidence: 98%
See 1 more Smart Citation