2021
DOI: 10.1007/978-3-030-71711-7_18
|View full text |Cite
|
Sign up to set email alerts
|

Cross-lingual Speaker Verification: Evaluation on X-Vector Method

Abstract: Automatic Speaker Verification (ASV) systems accuracy is based on the spoken language used in training and enrolling speakers. Language dependency makes voice-based security systems less robust and generalizable to a wide range of applications. In this work, a study on language dependency of a speaker verification system and experiments are performed to benchmark the robustness of the x-vector based techniques to language dependency. Experiments are carried out on a smartphone multi-lingual dataset with 50 sub… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
3
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
2

Relationship

3
3

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 16 publications
0
3
0
Order By: Relevance
“…The dependency of speaker recognition on the speaker's language has been observed in the recent works [82]. The mismatch of languages of speech samples in training, enrolling, and testing is a challenging problem in AV biometrics.…”
Section: ) Multi-lingual Speaker Recognitionmentioning
confidence: 98%
“…The dependency of speaker recognition on the speaker's language has been observed in the recent works [82]. The mismatch of languages of speech samples in training, enrolling, and testing is a challenging problem in AV biometrics.…”
Section: ) Multi-lingual Speaker Recognitionmentioning
confidence: 98%
“…The dependency of speaker recognition on the speaker's language has been observed in the recent works [83]. The mismatch of languages of speech samples in training, en-rolling, and testing is a challenging problem in AV biometrics.…”
Section: ) Multi-lingual Speaker Recognitionmentioning
confidence: 98%
“…The degradation of biometric recognition due to language mismatch is presented in some previous works [21], [16], [17]. Our dataset comprises of the same subjects speaking three different languages, therefore, providing scope for inter-language speaker recognition evaluation.…”
Section: ) Inter-language Speaker Recognitionmentioning
confidence: 99%