2022
DOI: 10.1109/taslp.2022.3205752
|View full text |Cite
|
Sign up to set email alerts
|

Task-Specific Optimization of Virtual Channel Linear Prediction-Based Speech Dereverberation Front-End for Far-Field Speaker Verification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(4 citation statements)
references
References 77 publications
0
4
0
Order By: Relevance
“…The achieved results show a major improvement in the speaker recognition domain compared to the current state of the art systems, as seen in Table I. Table I compares the proposed VBxVPE system against the top two systems [19], [20] in the SITW Speech Recognition Challenge 2016 [30] along with four state-of-the-art speaker verification and recognition systems [10], [24], [25], [27]. It can be observed that the VBxVPE speaker verification system demonstrates an improved performance on both the single and multi-speaker settings.…”
Section: Resultsmentioning
confidence: 97%
See 3 more Smart Citations
“…The achieved results show a major improvement in the speaker recognition domain compared to the current state of the art systems, as seen in Table I. Table I compares the proposed VBxVPE system against the top two systems [19], [20] in the SITW Speech Recognition Challenge 2016 [30] along with four state-of-the-art speaker verification and recognition systems [10], [24], [25], [27]. It can be observed that the VBxVPE speaker verification system demonstrates an improved performance on both the single and multi-speaker settings.…”
Section: Resultsmentioning
confidence: 97%
“…Since the VBxVPE system relies on a PLDA model, pre-trained on a large number of speaker-labeled x-vectors [26], the SITW development set [28] was not required at any stage. The SITW evaluation set is composed of a total of 180 different speakers across 2,883 audio files naturally containing overlapping utterances, noise, reverberation, and compression artifacts, making the dataset challenging from a speaker recognition perspective [28], [29].…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations