2021 29th European Signal Processing Conference (EUSIPCO) 2021
DOI: 10.23919/eusipco54536.2021.9615983
|View full text |Cite
|
Sign up to set email alerts
|

Compensate multiple distortions for speaker recognition systems

Abstract: The performance of speaker recognition systems reduces dramatically in severe conditions in the presence of additive noise and/or reverberation. In some cases, there is only one kind of domain mismatch like additive noise or reverberation, but in many cases, there are more than one distortion. Finding a solution for domain adaptation in the presence of different distortions is a challenge. In this paper we investigate the situation in which there is none, one or more of the following distortions: early reverbe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
8
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
3
1

Relationship

3
1

Authors

Journals

citations
Cited by 4 publications
(8 citation statements)
references
References 17 publications
(22 reference statements)
0
8
0
Order By: Relevance
“…Despite having good results for simulated noises this work doesn't include real noise and reverberation. In another work two configurations are proposed to denoise different kinds of distortions such as noise, early reverberation, and late reverberation [7]. In this paper also the capability of doing noise compensation is not explored in real environments.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Despite having good results for simulated noises this work doesn't include real noise and reverberation. In another work two configurations are proposed to denoise different kinds of distortions such as noise, early reverberation, and late reverberation [7]. In this paper also the capability of doing noise compensation is not explored in real environments.…”
Section: Related Workmentioning
confidence: 99%
“…The problem of noise and reverberation is addressed at different levels of speaker recognition systems, including signal level [4], feature level [5], speaker modeling level [6], x-vector level [7] and scoring technique adaptation [8]. Data augmen-tation is another approach to making the speaker recognition systems robust against noise.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Although, the DNN-based speaker embedding systems have given a degree of robustness against acoustic noises, there is a significant degradation of their performance in the presence of background noise, reverberation and other variabilities [4] [5] [6]. Various approaches have been proposed to handle these variabilities in different parts of the system such as: signal level [7], feature level [8], speaker modeling level [9], xvector level [6] and scoring technique level [10]. Addressing the variabilities at each step has its own advantage and disadvantages in terms of data, computational resources, efficiency, etc.…”
Section: Introductionmentioning
confidence: 99%
“…The robustness of the DNN-based speaker recognition (SR) systems in general and specifically their robustness against environment variabilities such as additive noise, reverberation, and far-distance recording device has made them more promising. Several strategies such as data argumentation [4], and noise compensation [5], [6] are explored to make the TDNNbased SR systems more robust against noise and reverberation and other variabilities. The previous research shows the weakness of TDNN-based SRs against noise and reverberation distortions.…”
Section: Introductionmentioning
confidence: 99%