2018 IEEE 28th International Workshop on Machine Learning for Signal Processing (MLSP) 2018
DOI: 10.1109/mlsp.2018.8516968
|View full text |Cite
|
Sign up to set email alerts
|

Analysing Replay Spoofing Countermeasure Performance Under Varied Conditions

Abstract: In this paper, we aim to understand what makes replay spoofing detection difficult in the context of the ASVspoof 2017 corpus. We use FFT spectra, mel frequency cepstral coefficients (MFCC) and inverted MFCC (IMFCC) frontends and investigate different backends based on Convolutional Neural Networks (CNNs), Gaussian Mixture Models (GMMs) and Support Vector Machines (SVMs). On this database, we find that IMFCC frontend based systems show smaller equal error rate (EER) for high quality replay attacks but higher E… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 9 publications
(6 citation statements)
references
References 14 publications
(31 reference statements)
0
6
0
Order By: Relevance
“…We train two SVMs using i-vectors (model I) and the longterm-average-spectrum (LTAS) feature (model J) since they have shown good performance on prior spoofing datasets [30,31,32]. Inspired from [12] we fuse multiple i-vectors in our approach, each based on complimentary hand-engineered features, and manage to improve performance over a single i-vector based SVM.…”
Section: Svmmentioning
confidence: 99%
“…We train two SVMs using i-vectors (model I) and the longterm-average-spectrum (LTAS) feature (model J) since they have shown good performance on prior spoofing datasets [30,31,32]. Inspired from [12] we fuse multiple i-vectors in our approach, each based on complimentary hand-engineered features, and manage to improve performance over a single i-vector based SVM.…”
Section: Svmmentioning
confidence: 99%
“…Mishra et al [5] proposed the first adaption of LIME for MIR, termed SoundLIME (SLIME), which segments the spectrogram into time, frequency, or time-frequency segments. SLIME was demonstrated on the task of singing voice detection [18] and used for analysing a replay spoofing detection system [6]. Haunschmid et al used other types of interpretable features (super pixels [7], source sep-aration estimates [8,9]) for explaining the predictions of a variety of models, including music taggers [8,9] and a content-based music recommender system [10].…”
Section: Local Interpretable Model-agnostic Explanationsmentioning
confidence: 99%
“…A plethora of explanation methods ("explainers") have been originally developed for text or image data and adapted to the audio domain [1,2], or specifically introduced for MIR systems [3]. Most notably, different versions of Local Interpretable Model-agnostic Explanations (LIME), a posthoc explainer [4], have been used to explain models in a variety of MIR tasks [5][6][7][8][9][10][11].…”
Section: Introductionmentioning
confidence: 99%
“…rectangular subparts of spectrograms) as an explainer, as this is the most prominent method in audio (cf. [19][20][21][22]). Then, in Sect.…”
Section: Introductionmentioning
confidence: 99%