2020 28th European Signal Processing Conference (EUSIPCO) 2021
DOI: 10.23919/eusipco47968.2020.9287690
|View full text |Cite
|
Sign up to set email alerts
|

Data augmentation versus noise compensation for x-vector speaker recognition systems in noisy environments

Abstract: The explosion of available speech data and new speaker modeling methods based on deep neural networks (DNN) have given the ability to develop more robust speaker recognition systems. Among DNN speaker modelling techniques, x-vector system has shown a degree of robustness in noisy environments. Previous studies suggest that by increasing the number of speakers in the training data and using data augmentation more robust speaker recognition systems are achievable in noisy environments. In this work, we want to k… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
19
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
3
1

Relationship

1
6

Authors

Journals

citations
Cited by 11 publications
(19 citation statements)
references
References 17 publications
(18 reference statements)
0
19
0
Order By: Relevance
“…In [4] the importance of using denoising techniques alongside data augmentation was explored to make the xvector system robust against additive noise. The 66% relative improvement of EER in the x-vector domain shows that noise compensation in speaker modeling level is very effective for speaker recognition systems.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…In [4] the importance of using denoising techniques alongside data augmentation was explored to make the xvector system robust against additive noise. The 66% relative improvement of EER in the x-vector domain shows that noise compensation in speaker modeling level is very effective for speaker recognition systems.…”
Section: Related Workmentioning
confidence: 99%
“…Using a huge amount of data makes the DNN based systems robust against noise and reverberation but in severe conditions the performance can degrade drastically [3]. Therefore, explicit denoising and dereverberation can make these systems more robust [4,5].…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…Deep Learning offers i-vector and x-vector features [15] [16]. These features are a fusion of MFCC, DWT and MFCC, Gammatone Frequency Cepstral Coefficients (GFCC) respectively.…”
Section: Introductionmentioning
confidence: 99%