2019
DOI: 10.1109/access.2019.2923806
|View full text |Cite
|
Sign up to set email alerts
|

Toward Robust Audio Spoofing Detection: A Detailed Comparison of Traditional and Learned Features

Abstract: Automatic speaker verification, like every other biometric system, is vulnerable to spoofing attacks. Using only a few minutes of recorded voice of a genuine client of a speaker verification system, attackers can develop a variety of spoofing attacks that might trick such systems. Detecting these attacks using the audio cues present in the recordings is an important challenge. Most existing spoofing detection systems depend on knowing the used spoofing technique. With this research, we aim at overcoming this l… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
18
0
1

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
3
1
1

Relationship

3
7

Authors

Journals

citations
Cited by 58 publications
(24 citation statements)
references
References 50 publications
0
18
0
1
Order By: Relevance
“…Meanwhile, the false-negative rate (i.e., false alarm rate) refers to the ratio between misclassified genuine test samples and the total number of genuine test samples. Therefore, the lower the EER value, the better the developed countermeasure performs [58].…”
Section: Resultsmentioning
confidence: 99%
“…Meanwhile, the false-negative rate (i.e., false alarm rate) refers to the ratio between misclassified genuine test samples and the total number of genuine test samples. Therefore, the lower the EER value, the better the developed countermeasure performs [58].…”
Section: Resultsmentioning
confidence: 99%
“…The pre-processed audio signals were first segmented into frames of 100 ms, after which a Hamming window was applied, followed by the extraction of audio features. Mel-Frequency Cepstral Coefficients (MFCCs) were chosen for this investigation owing to their effectiveness when it comes to audio classification problems [34,35]. MFCCs are a set of features that focus on the perceptually relevant aspects of the audio spectrum, additionally the coefficients could contain information about the vocal tract characteristics [36,37].…”
Section: Cough Sound Processing and Audio Feature Extractionmentioning
confidence: 99%
“…have performed remarkably well for the spoof detection tasks, and for speech and speaker recognition tasks as well. These techniques can model the human vocal tract and human auditory system very well [15][16][17]. Human ear is proved to be deaf for the phase factor of sound.…”
Section: Introductionmentioning
confidence: 99%