2017
DOI: 10.1587/transinf.2016edl8182
|View full text |Cite
|
Sign up to set email alerts
|

Lexicon-Based Local Representation for Text-Dependent Speaker Verification

Abstract: SUMMARYA text-dependent i-vector extraction scheme and a lexicon-based binary vector (L-vector) representation are proposed to improve the performance of text-dependent speaker verification. I-vector and L-vector are used to represent the utterances for enrollment and test. An improved cosine distance kernel is constructed by combining i-vector and L-vector together and is used to distinguish both speaker identity and lexical (or text) diversity with back-end support vector machine (SVM). Experiments are condu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 6 publications
(6 reference statements)
0
3
0
Order By: Relevance
“…Furthermore, as we expected, the general performance is worse since the system suffers from the lexical similarity of the short commands. Thus, this part is more challenging than Part I as we can also see in other previous works [15,16].…”
Section: Experiments With Rsr-part IImentioning
confidence: 80%
See 1 more Smart Citation
“…Furthermore, as we expected, the general performance is worse since the system suffers from the lexical similarity of the short commands. Thus, this part is more challenging than Part I as we can also see in other previous works [15,16].…”
Section: Experiments With Rsr-part IImentioning
confidence: 80%
“…The application of DNNs and the same techniques as in text-independent models for text-dependent speaker verification tasks has produced mixed results. On the one hand, specific modifications of the traditional techniques have been shown to be successful for text-dependent tasks such as i-vector+PLDA/Support Vector Machines (SVM) [14][15][16], DNNs bottleneck as features for i-vector extractors [17] or posterior probabilities for i-vector extractors [17,18]. On the other hand, speaker embeddings obtained directly from a DNN have provided good results in tasks with large amounts of data and a single phrase [19], but they have not been as effective in tasks with more than one pass phrase and smaller database sizes [4,5].…”
Section: Introductionmentioning
confidence: 99%
“…The verification performance of 1.22% equal error rate (EER) is achieved. A lexicon-based local representation algorithm for text-dependent i-vector speaker verification system is presented in [18].The speaker recognition system based on Gaussian mixer model-based support vector machine (GMM-SVM) and the nuisance attribute projection (NAP) technique for channel compensation is presented in [19]. Time alignment of different utterances is a serious problem for distance measures and small shift would lead to incorrect identification in text-dependent speaker recognition.…”
Section: Review Of Related Workmentioning
confidence: 99%