2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07 2007
DOI: 10.1109/icassp.2007.366898
|View full text |Cite
|
Sign up to set email alerts
|

Language Normalization for Bilingual Speaker Recognition Systems

Abstract: In this study, we focus on the problem of removing/normalizing the impact of spoken language variation in Bilingual Speaker Recognition (BSR) systems. In addition to environment, recording, and channel mismatches, spoken language mismatch is an additional factor resulting in performance degradation in speaker recognition systems. In today's world, the number of bilingual speakers is increasing with English becoming the universal second language. Data sparseness is becoming an important research issue to deploy… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
6
0

Year Published

2014
2014
2022
2022

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 13 publications
(6 citation statements)
references
References 10 publications
(7 reference statements)
0
6
0
Order By: Relevance
“…In [73], authors proposed two novel algorithms towards spoken language mismatch problem: the first algorithm merges language-dependent system outputs by using Language Identification (LID) scores of each utterance as fusion weights. And in the second algorithm, fusion is done at the segment-level via multilingual Phone Recognition (PR).…”
Section: B) Language Normalizationmentioning
confidence: 99%
“…In [73], authors proposed two novel algorithms towards spoken language mismatch problem: the first algorithm merges language-dependent system outputs by using Language Identification (LID) scores of each utterance as fusion weights. And in the second algorithm, fusion is done at the segment-level via multilingual Phone Recognition (PR).…”
Section: B) Language Normalizationmentioning
confidence: 99%
“…Factor analysis treats the language as a latent variable and marginalizes it out when conducting verification. For example, the method proposed in [6] infers the posterior probabilities of a test speech belonging to different languages. With these posteriors, the score of the test speech against the claimed speaker is computed as the fusion (summation) of the scores tested against the models of the speaker enrolled with each language.…”
Section: Introductionmentioning
confidence: 99%
“…Another solution for bilingual speaker recognition is training two separate speaker models for each target speaker one with Spanish data and the other using English data (Akbacak and Hansen, 2007). During the recognition phase, first a language detector is used to detect the language of test utterance for choosing the correct speaker model (Akbacak and Hansen, 2007). However, both of these two proposed solutions require knowledge about the languages of training and test utterances.…”
Section: Sectionmentioning
confidence: 99%
“…To alleviate this degradation, authors proposed to model each speaker using both languages (Ma and Meng, 2004). Another solution for bilingual speaker recognition is training two separate speaker models for each target speaker one with Spanish data and the other using English data (Akbacak and Hansen, 2007). During the recognition phase, first a language detector is used to detect the language of test utterance for choosing the correct speaker model (Akbacak and Hansen, 2007).…”
Section: Sectionmentioning
confidence: 99%