2013 IEEE International Conference on Acoustics, Speech and Signal Processing 2013
DOI: 10.1109/icassp.2013.6639150
|View full text |Cite
|
Sign up to set email alerts
|

Probabilistic linear discriminant analysis of i-vector posterior distributions

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
43
0

Year Published

2014
2014
2024
2024

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 53 publications
(43 citation statements)
references
References 4 publications
0
43
0
Order By: Relevance
“…Varying utterance duration was compensated for by calibrating the PLDA score in (Hasan et al, 2013), and by exploiting the uncertainty in the i-vector in (Cumani et al, 2013b). The effect of noise on PLDA-based systems is studied in (Mandasari et al, 2012).…”
Section: Introductionmentioning
confidence: 99%
“…Varying utterance duration was compensated for by calibrating the PLDA score in (Hasan et al, 2013), and by exploiting the uncertainty in the i-vector in (Cumani et al, 2013b). The effect of noise on PLDA-based systems is studied in (Mandasari et al, 2012).…”
Section: Introductionmentioning
confidence: 99%
“…Using the permutation property of the trace for the second term we get: (23) Although the dimension of is huge, we need only its diagonal because, for any feasible solution , matrix is diagonal. Moreover, since the atoms of the dictionary matrix are normalized, the diagonal elements of are .…”
Section: Matrix Optimizationmentioning
confidence: 99%
“…In [21] we have highlighted that the incidence of the time spent for i-vector computation in a system using large models and scoring long speaker segments is negligible compared to the importance of keeping the original accuracy and saving memory. However, the effectiveness of the i-vector extractor is more relevant for systems dealing with short utterances [22], [23], [24], [25] such as, for example, the text prompts in speaker verification [26], [27]. In this paper we propose a new approximate i-vector extraction approach particularly useful for applications that need to optimize their memory requirements without sensibly affecting their performance and speed.…”
Section: Introductionmentioning
confidence: 99%
“…The first is based on speaker modelling (SM) which uses the assumption that each individual has different voice characteristics. Traditionally, speaker models are constructed with Gaussian mixture models (GMMs) and i-vectors [6,7,8], but more recently deep learning has been proven effective for speaker modelling [9,10,11,12,13]. In many systems, the models are often pre-trained for the target speakers [14,15] and are not applicable to unknown participants.…”
Section: Introductionmentioning
confidence: 99%