Interspeech 2016 2016
DOI: 10.21437/interspeech.2016-1071
|View full text |Cite
|
Sign up to set email alerts
|

Mahalanobis Metric Scoring Learned from Weighted Pairwise Constraints in I-Vector Speaker Recognition System

Abstract: The i-vector model is widely used by the state-of-the-art speaker recognition system. We proposed a new Mahalanobis metric scoring learned from weighted pairwise constraints (WPCML), which use the different weights for the empirical error of the similar and dissimilar pairs. In the new i-vector space described by the metric, the distance between the same speaker's i-vectors is small, while that of the different speakers' is large. In forming the training set, we use the traditional way in random fashion and de… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
4
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 10 publications
0
4
0
Order By: Relevance
“…Another study reported [32] that Cosine or Euclidean scoring methods provide a significant improvement than PLDA. The effectiveness of the Mahalanobis scoring method has been explored by [37,38] and presented an excellent performance for the i-vector system in the speaker recognition system. In this paper, we assess the effectiveness of the speaker verification system in different scoring methods, such as Cosine similarity scoring (CSS), Euclidean distance scoring (EDS), and Mahalanobis distance scoring (MDS).…”
Section: Scoring Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Another study reported [32] that Cosine or Euclidean scoring methods provide a significant improvement than PLDA. The effectiveness of the Mahalanobis scoring method has been explored by [37,38] and presented an excellent performance for the i-vector system in the speaker recognition system. In this paper, we assess the effectiveness of the speaker verification system in different scoring methods, such as Cosine similarity scoring (CSS), Euclidean distance scoring (EDS), and Mahalanobis distance scoring (MDS).…”
Section: Scoring Methodsmentioning
confidence: 99%
“…The effectiveness of the Mahalanobis metric for speaker detection scoring has been proven by [37,38]. The score between two i-vectors w target and w test is proportional to the log-probability that both i-vectors belong to a unique class following the covariance matrix τ.…”
Section: Scoring Methodsmentioning
confidence: 99%
“…The development data with short speech segments can be used to train the metric, and subsequently the learned metric can be used for measuring the similarity of i ‐vectors in a pairwise manner. Previous studies in distance metric method in SV do not focus on short‐utterance problem [116, 117]; however, the approach could be explored further to investigate the similarity between i ‐vectors of small speech segments. Sparse methods: The limited data conditions in SV lead to sparsity in sufficient statistics estimation which is successively used in i ‐vector estimation [48, 98]. Methods developed to handle the sparsity issue such as dictionary learning [118], sparse representation [119] can be investigated to effectively process and represent the speech data for short utterances.…”
Section: Future Research Directionsmentioning
confidence: 99%
“…The development data with short speech segments can be used to train the metric, and subsequently the learned metric can be used for measuring the similarity of i ‐vectors in a pairwise manner. Previous studies in distance metric method in SV do not focus on short‐utterance problem [116, 117]; however, the approach could be explored further to investigate the similarity between i ‐vectors of small speech segments.…”
Section: Future Research Directionsmentioning
confidence: 99%