2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2015
DOI: 10.1109/icassp.2015.7178320
|View full text |Cite
|
Sign up to set email alerts
|

A novel approach for automatic acoustic novelty detection using a denoising autoencoder with bidirectional LSTM neural networks

Abstract: Acoustic novelty detection aims at identifying abnormal/novel acoustic signals which differ from the reference/normal data that the system was trained with. In this paper we present a novel unsupervised approach based on a denoising autoencoder. In our approach auditory spectral features are processed by a denoising autoencoder with bidirectional Long Short-Term Memory recurrent neural networks. We use the reconstruction error between the input and the output of the autoencoder as activation signal to detect n… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
138
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 184 publications
(138 citation statements)
references
References 22 publications
0
138
0
Order By: Relevance
“…NOVELTY DETECTION To detect novelty, we adopt an autoencoder-based approach proposed by Japkowicz et al [15], and focus on camera images as the domain of interest, although this general approach is also applicable to a wide variety of complex domains, including acoustic signals [22], network server anomalies [33], data mining [14], document classification [21] and others. Classification of novel data amounts to a one-class classification problem since we assume that we only have access to the set of images that have been observed and collected by our system, and no examples of any other "unfamiliar" images.…”
Section: Prior Estimate Of Collision Probabilitymentioning
confidence: 99%
“…NOVELTY DETECTION To detect novelty, we adopt an autoencoder-based approach proposed by Japkowicz et al [15], and focus on camera images as the domain of interest, although this general approach is also applicable to a wide variety of complex domains, including acoustic signals [22], network server anomalies [33], data mining [14], document classification [21] and others. Classification of novel data amounts to a one-class classification problem since we assume that we only have access to the set of images that have been observed and collected by our system, and no examples of any other "unfamiliar" images.…”
Section: Prior Estimate Of Collision Probabilitymentioning
confidence: 99%
“…The final decision of this multi-classifier ensemble learning is given by the maximum score based on Eqs. (15) and (16) …”
Section: Two-stage Ensemble Learning Frameworkmentioning
confidence: 99%
“…Notably, this DNN algorithm can effectively improve feature representation of original time-frequency features and then promote the classification performance [8]. Other DL algorithms, such as deep belief network (DBN) [13], convolutional neural networks (CNN) [14,15], and auto-encoder (AE) [16], have also been effectively used for SEC. However, it is still time-costing to train a deep network model by these DL algorithms for a large-scale sound dataset.…”
Section: Introductionmentioning
confidence: 99%
“…That is, we face this sort of problem when it is difficult to collect abnormal sounds to be detected. To detect abnormal sounds, some methods use a criterion for how different input sounds are from ''normal'' sounds [22][23][24].…”
Section: Abnormal Sound Detectionmentioning
confidence: 99%