2016
DOI: 10.1109/taslp.2015.2499038
|View full text |Cite
|
Sign up to set email alerts
|

Mixture of PLDA for Noise Robust I-Vector Speaker Verification

Abstract: In real-world environments, noisy utterances with variable noise levels are recorded and then converted to i-vectors for cosine distance or PLDA scoring. This paper investigates the effect of noise-level variability on i-vectors. It demonstrates that noise-level variability causes the i-vectors to shift, causing the noise contaminated i-vectors to form clusters in the ivector space. It also demonstrates that optimal subspaces for discriminating speakers are noise-level dependent. Based on these observations, t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
38
0

Year Published

2016
2016
2020
2020

Publication Types

Select...
5
3
2

Relationship

1
9

Authors

Journals

citations
Cited by 66 publications
(40 citation statements)
references
References 33 publications
2
38
0
Order By: Relevance
“…In [8], a mixture of channel-dependent PLDA models are trained to account for the channel conditions of each test utterance presented at the detection phase. Another mixture of PLDA Models is presented in [9], where each one is trained with different levels of noise and used according to the signal-to-noise ratio of the test utterance. The work in [10] assessed one channel features-domain noise compensation combined with multi-condition training.…”
Section: Channel Synthesis and Related Workmentioning
confidence: 99%
“…In [8], a mixture of channel-dependent PLDA models are trained to account for the channel conditions of each test utterance presented at the detection phase. Another mixture of PLDA Models is presented in [9], where each one is trained with different levels of noise and used according to the signal-to-noise ratio of the test utterance. The work in [10] assessed one channel features-domain noise compensation combined with multi-condition training.…”
Section: Channel Synthesis and Related Workmentioning
confidence: 99%
“…In addition to comparing these histograms, a second measure of differences between the distribution of i-vectors from long and short utterances based on the Partition Coefficient [15,16] is employed in this paper. The partition coefficient is an index that indicates the clustering tendency in a dataset and lies in the range [1/݇, 1] , where ݇ is the number of clusters.…”
Section: Duration Mismatch In I-vectorsmentioning
confidence: 99%
“…The approach was shown to outperform standard PLDA with pooled training data when each class in the training data is seen under both considered conditions, frontal and profile, in a face recognition task. A similar approach is proposed by [12]; but in this case, the mixture component is not given during training but rather is dependent on a continuous metadata value. The approach is tested by adding noise to the training data at different signal-to-noise (SNR) levels, resulting in gains compared to pooling all the data to train a single PLDA model.…”
Section: Introductionmentioning
confidence: 99%