ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021
DOI: 10.1109/icassp39728.2021.9413501
|View full text |Cite
|
Sign up to set email alerts
|

Data Augmentation with Signal Companding for Detection of Logical Access Attacks

Abstract: The recent advances in voice conversion (VC) and textto-speech (TTS) make it possible to produce natural sounding speech that poses threat to automatic speaker verification (ASV) systems. To this end, research on spoofing countermeasures has gained attention to protect ASV systems from such attacks. While the advanced spoofing countermeasures are able to detect known nature of spoofing attacks, they are not that effective under unknown attacks. In this work, we propose a novel data augmentation technique using… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
4
1

Relationship

1
8

Authors

Journals

citations
Cited by 19 publications
(7 citation statements)
references
References 28 publications
0
7
0
Order By: Relevance
“…The logarithm of the CQT power spectrum is then saved as extracted feature for each example. The parameters related to CQT extraction follows our previous work given in [31]. On the other hand, the ERB spectrum (erbSpec) is extracted with 43 number of gammatone filters using audioFeatureExtractor of MAT-LAB audio toolbox 7 .…”
Section: Methodsmentioning
confidence: 99%
“…The logarithm of the CQT power spectrum is then saved as extracted feature for each example. The parameters related to CQT extraction follows our previous work given in [31]. On the other hand, the ERB spectrum (erbSpec) is extracted with 43 number of gammatone filters using audioFeatureExtractor of MAT-LAB audio toolbox 7 .…”
Section: Methodsmentioning
confidence: 99%
“…For the replay attack detection, seven augmentation techniques were tested; out of these, dynamic value change and pitch change showed an 8% improvement in base model accuracy [30]. A data augmentation technique using a-law and mu-law based signal companding was explored in [31] for the detection of logical access attacks. For data augmentation, an Auxiliary Classifier Generative Adversarial Network (AC-GAN) was also proposed to generate more speech samples with diverse variants [32] combined with a post-selection quality frame selection based on CNN, giving more accuracy.…”
Section: Literature Reviewmentioning
confidence: 99%
“…When the evaluation trials from another data set are mounted, any mismatch could make the model fail to respond. This motivates a few studies using data augmentation, for example, augmentation based on waveform companding [21] and frequency mask [13], to alleviate potential mismatches between training and unseen test data. Another potential direction is to use self-supervised front end trained on various speech data.…”
Section: Remaining Issues and Challengesmentioning
confidence: 99%