Interspeech 2018 2018
DOI: 10.21437/interspeech.2018-1687
|View full text |Cite
|
Sign up to set email alerts
|

Novel Variable Length Energy Separation Algorithm Using Instantaneous Amplitude Features for Replay Detection

Abstract: Voice-based speaker authentication or Automatic Speaker Verification (ASV) system is now becoming practical reality after several decades of research. However, still this technology is very much susceptible to various spoofing attacks. Among various spoofing attacks, replay is the most challenging attack. In this paper, we propose a novel feature set based on our recently introduced Variable length Energy Separation Algorithm (VESA) during INTERSPEECH 2017. The key idea of this paper is to capture the Instanta… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 19 publications
(5 citation statements)
references
References 33 publications
0
5
0
Order By: Relevance
“…The short-time AM-FM features set obtained using Energy Separation Algorithm (ESA) were studied in [120,121] as shown in Fig. 11.…”
Section: ) Acoustic Featuresmentioning
confidence: 99%
“…The short-time AM-FM features set obtained using Energy Separation Algorithm (ESA) were studied in [120,121] as shown in Fig. 11.…”
Section: ) Acoustic Featuresmentioning
confidence: 99%
“…• Discrete Fourier transform (DFT) based features: which include Mel frequency cepstral coefficients (MFCC) [4,13,36], mel filterbank slope [10], linear filterbak slope [10], and Q-log domain DFT-based mean normalized log spectral [42]. • Variable length energy separation algorithm (VESA)-based features: which include instantaneous frequency cosine coefficients based on VESA [6] and instantaneous amplitude cosine coefficients based on VESA [43].…”
Section: Related Workmentioning
confidence: 99%
“…When speech is replayed through a playback device, or recorded on a recording device, its frequency attributes are changed [7]- [12]. Replay attack detection can be regarded as a task that distinguishes the difference in the frequency attributes between genuine and replayed speeches.…”
Section: Introductionmentioning
confidence: 99%