2019 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT) 2019
DOI: 10.1109/isspit47144.2019.9001814
|View full text |Cite
|
Sign up to set email alerts
|

Scalogram Neural Network Activations with Machine Learning for Domestic Multi-channel Audio Classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
11
0
1

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 12 publications
(12 citation statements)
references
References 11 publications
0
11
0
1
Order By: Relevance
“…Because deep learning based on scalograms has been used to successfully classify other types of complex biometric signals, we investigated whether a similar approach can be used to automate HTR detection. According to recent studies, images containing time-frequency data can be very effectively analyzed using a multistage approach where features extracted using an off-the-shelf deep CNN are used to train a support vector machine (SVM) or another type of classifier [33][34][35][36][37] . Combining a pretrained deep CNN with an SVM often outperforms other architectures 33,34,[38][39][40] .…”
mentioning
confidence: 99%
See 2 more Smart Citations
“…Because deep learning based on scalograms has been used to successfully classify other types of complex biometric signals, we investigated whether a similar approach can be used to automate HTR detection. According to recent studies, images containing time-frequency data can be very effectively analyzed using a multistage approach where features extracted using an off-the-shelf deep CNN are used to train a support vector machine (SVM) or another type of classifier [33][34][35][36][37] . Combining a pretrained deep CNN with an SVM often outperforms other architectures 33,34,[38][39][40] .…”
mentioning
confidence: 99%
“…The 50-layer ResNet model is a deep CNN that is trained to classify 1,000 image types and includes residual learning, which improves performance by increasing the depth of representation 41 . There is considerable evidence that pretrained CNNs can be used to extract deep features from scalograms and other types of images that are distant from the original datasets used for training 33,34,[38][39][40]42 . Features were extracted from a fully connected layer of ResNet-50 because deeper network layers generate a rich semantic image representation 42 and are well suited for image recognition tasks 34,37,39 .…”
mentioning
confidence: 99%
See 1 more Smart Citation
“…Scalogram is the modulus of the multiscale wavelet transform, which elucidate time-frequency visualization. The spectro-temporal nature of scalograms makes them desirable for neural networks due to the signal's mapping properties with minimalistic information loss [25]. Scalogram gives the insight into the frequency, energy in time which shown as a function ( , ) in Eq-6.…”
Section: B Scalogrammentioning
confidence: 99%
“…Esto es posible luego de obtener una imagen con información multidimencional de la señal procesada. Los espectrogramas (tiempo -frecuencia -amplitud) y escalogramas obtenidos a partir de la transformada de ondícula Wavelet Transform (WT) (tiempo -banda escalada de frecuencia -amplitud) son ejemplos de estas aplicaciones (Zeng et al, 2019;Copiaco et al, 2019).…”
Section: Dominio Tiempo -Frecuenciaunclassified