2016
DOI: 10.1121/1.4969520
|View full text |Cite
|
Sign up to set email alerts
|

Voice livness detection based on pop-noise detector with phoneme information for speaker verification

Abstract: This paper proposes a pop-noise detector using phoneme information for a voice liveness detection (VLD) framework. In recent years, spoofing attacks (e.g., reply, speech synthesis, and voice conversion) have become a serious problem against speaker verification systems. Some techniques have been proposed to protect the speaker verification systems from these spoofing attacks. The VLD framework has been proposed as one of fundamental solutions. The VLD framework identifies that an input sample is uttered by an … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 11 publications
(3 citation statements)
references
References 0 publications
0
3
0
Order By: Relevance
“…We confirm the effectiveness of our system against replay attacks and impersonation attacks by comparing it with the baseline (VLD) [18] and the conference version of VoicePop [1]. In [18], Sayaka Shiota et al proposed the pop noise detector combined with the phoneme information to detect the existence of pop noises, but the replayed samples were easily recognized as legitimate samples under their proposed algorithm.…”
Section: Overall Performancementioning
confidence: 53%
See 2 more Smart Citations
“…We confirm the effectiveness of our system against replay attacks and impersonation attacks by comparing it with the baseline (VLD) [18] and the conference version of VoicePop [1]. In [18], Sayaka Shiota et al proposed the pop noise detector combined with the phoneme information to detect the existence of pop noises, but the replayed samples were easily recognized as legitimate samples under their proposed algorithm.…”
Section: Overall Performancementioning
confidence: 53%
“…We confirm the effectiveness of our system against replay attacks and impersonation attacks by comparing it with the baseline (VLD) [18] and the conference version of VoicePop [1]. In [18], Sayaka Shiota et al proposed the pop noise detector combined with the phoneme information to detect the existence of pop noises, but the replayed samples were easily recognized as legitimate samples under their proposed algorithm. However, VLD does not consider using the characteristics of the pop noise for further classification, nor does it consider the impersonation attack when the adversary replays the audio and mimics breathing at the √ Earise Al-101 √ This also shows that the combination of pop noise and its airflow pressure can improve the detection rate of the pop noise-only feature.…”
Section: Overall Performancementioning
confidence: 53%
See 1 more Smart Citation