ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021
DOI: 10.1109/icassp39728.2021.9413467
|View full text |Cite
|
Sign up to set email alerts
|

Attack on Practical Speaker Verification System Using Universal Adversarial Perturbations

Abstract: In authentication scenarios, applications of practical speaker verification systems usually require a person to read a dynamic authentication text. Previous studies played an audio adversarial example as a digital signal to perform physical attacks, which would be easily rejected by audio replay detection modules. This work shows that by playing our crafted adversarial perturbation as a separate source when the adversary is speaking, the practical speaker verification system will misjudge the adversary as a ta… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
16
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 33 publications
(18 citation statements)
references
References 16 publications
1
16
0
Order By: Relevance
“…We use L ∞ and L 2 norms to quantify the perturbation magnitude in adversarial example generation, and adopt SNR and PESQ to measure the imperceptibility of crafted adversarial voices. These metrics have been widely adopted in the literature [9], [10], [11], [13], [14], [15], [16] and in general, can consistently reflect the degree of distortions according to our experimental results. Moreover, PESQ is an objective perceptual measure simulating the human auditory system [62].…”
Section: Discussion Of Limitationssupporting
confidence: 73%
See 4 more Smart Citations
“…We use L ∞ and L 2 norms to quantify the perturbation magnitude in adversarial example generation, and adopt SNR and PESQ to measure the imperceptibility of crafted adversarial voices. These metrics have been widely adopted in the literature [9], [10], [11], [13], [14], [15], [16] and in general, can consistently reflect the degree of distortions according to our experimental results. Moreover, PESQ is an objective perceptual measure simulating the human auditory system [62].…”
Section: Discussion Of Limitationssupporting
confidence: 73%
“…Recently, adversarial attacks on speaker recognition have been extensively studied [7], [8], [9], [10], [11], [12], [13], [14], [15], [16], [17]. Results show that both state-of-the-art open-source and commercial SRSs can be fooled by adding small perturbations to the original voice, even playing over the air in the physical world.…”
Section: Motivationmentioning
confidence: 99%
See 3 more Smart Citations