2017 International Joint Conference on Neural Networks (IJCNN) 2017
DOI: 10.1109/ijcnn.2017.7966273
|View full text |Cite
|
Sign up to set email alerts
|

Acoustic novelty detection with adversarial autoencoders

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
43
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 40 publications
(43 citation statements)
references
References 25 publications
0
43
0
Order By: Relevance
“…We used data from 1,555 feeding executions collected from 24 able-bodied participants where we newly collected 1,203 non-anomalous feeding executions for this work. 16 participants were male and 8 were female, and the age range was [19][20][21][22][23][24][25][26][27][28][29][30][31][32][33][34][35]. We conducted the studies with approval from the Georgia Tech Institutional Review Board (IRB).…”
Section: B Data Collectionmentioning
confidence: 99%
“…We used data from 1,555 feeding executions collected from 24 able-bodied participants where we newly collected 1,203 non-anomalous feeding executions for this work. 16 participants were male and 8 were female, and the age range was [19][20][21][22][23][24][25][26][27][28][29][30][31][32][33][34][35]. We conducted the studies with approval from the Georgia Tech Institutional Review Board (IRB).…”
Section: B Data Collectionmentioning
confidence: 99%
“…The dataset was recorded by a binaural microphone at a sample rate of 16kHz. We converted each audio to 1 channel and then split it into sequences of 160-dimensional frames, each frame corresponds to 0.01s, as in [12] and [13]. [12] and [13] evaluated the detection at each frame instead of at the whole sequence, so we also applied the thresholding step to each log p(x t |x <t ), instead of log p(x 1:T ).…”
Section: Methodsmentioning
confidence: 99%
“…Both [12] and [13] used RNNs (LSTMs in particular) as an AutoEncoder (AE) which can reconstruct the original signal from a compressed representation (Compression AutoEncoders -CAEs) or from a corrupted version of it (Denoising AutoEncoders -DAEs). However, as discussed in [16], [17] and [20], the fact that the hidden states of RNNs are deterministic reduces their capacity to capture all data variabilities, especially for data that contain high levels of randomness.…”
Section: Related Workmentioning
confidence: 99%
“…Due to the inherent potential for capturing data distributions, there is a growing body of literature that recognizes the importance of AAE. In [41], Principi et al proposed an acoustic novelty detector based on AAE, and the results showed that the proposed approach provides a relative performance improvement equal to 0.26% compared to the standard autoencoder. A conditional difference adversarial autoencoder (CDAAE) [42] was proposed for facial expression synthesis to handle the problem of disambiguating changes.…”
Section: Related Workmentioning
confidence: 99%