2022
DOI: 10.1109/taslp.2022.3140558
|View full text |Cite
|
Sign up to set email alerts
|

A Principle Solution for Enroll-Test Mismatch in Speaker Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
1

Relationship

3
2

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 67 publications
0
4
0
Order By: Relevance
“…This analytical view provides a powerful tool by which we can analyze how the performance reduction is caused by a particular imperfection, and design suitable algorithms to compensate for the impact. Recently, using this tool we provided a theoretically optimal solution for the enroll-test mismatch problem and achieved a big success [17,19].…”
Section: Discussionmentioning
confidence: 99%
“…This analytical view provides a powerful tool by which we can analyze how the performance reduction is caused by a particular imperfection, and design suitable algorithms to compensate for the impact. Recently, using this tool we provided a theoretically optimal solution for the enroll-test mismatch problem and achieved a big success [17,19].…”
Section: Discussionmentioning
confidence: 99%
“…The statistics change and the mean shift cause more severe problems in the cross-genre scenario, as the enroll data and test data in this scenario possess different statistical properties but they have to be represented in a single PLDA model. We presented a deep analysis on this enroll-test mismatch problem in our recent study [76], but mismatch caused by the cross-genres challenge is yet to be thoroughly studied.…”
Section: Discussionmentioning
confidence: 99%
“…Based on various tasks, SR is classified as speaker identification (SI) and speaker verification (SV). In order to identify an unknown speaker, SI [14][15][16][17][18][19][20][21][22][23] analyses their verbal output. From the set of all N registered speakers, it picks the right one, as shown in Fig.…”
Section: E Speaker Identification and Speaker Verificationmentioning
confidence: 99%