2017
DOI: 10.1016/j.csl.2016.12.007
|View full text |Cite
|
Sign up to set email alerts
|

Fast i-vector denoising using MAP estimation and a noise distributions database for robust speaker recognition

Abstract: Once the i-vector paradigm has been introduced in the field of speaker recognition, many techniques have been proposed to deal with additive noise within this framework. Due to the complexity of its effect in the i-vector space, a lot of effort has been put into dealing with noise in other domains (speech enhancement, feature compensation, robust i-vector extraction and robust scoring). As far as we know, there was no serious attempt to handle the noise problem directly in the i-vector space without relying on… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
12
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
3
1

Relationship

2
6

Authors

Journals

citations
Cited by 18 publications
(13 citation statements)
references
References 20 publications
0
12
0
Order By: Relevance
“…In the previous generation of speaker recognition systems (i.e., i-vector) the mapping from distorted to clean vectors is explored broadly. i-MAP [14] and joint i-MAP [15] are two statistical techniques used for noise compensation in the i-vector framework.…”
Section: Related Workmentioning
confidence: 99%
“…In the previous generation of speaker recognition systems (i.e., i-vector) the mapping from distorted to clean vectors is explored broadly. i-MAP [14] and joint i-MAP [15] are two statistical techniques used for noise compensation in the i-vector framework.…”
Section: Related Workmentioning
confidence: 99%
“…I-MAP is a statistical denoising method that is applied in the ivector space. The main advantage of this method is that it uses both information about the relation between noisy and clean speech and the clean speech distribution [5]. A nonparametric algorithm without considering the relation between corrupted and clean i-vector was proposed by [4], that utilizes the joint distribution of corrupted and clean i-vectors to denoise corrupted i-vector with an MMSE estimator.…”
Section: Related Workmentioning
confidence: 99%
“…Applying denoising techniques at the speaker modeling level has been done successfully in the i-vector space [4,5,6]. In this paper we apply statistical denoising techniques on xvectors that works effectively in i-vector domain.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…The first methodology is to apply a denoising transformation to speaker embeddings. In [7,8,9], researchers use either statistical back end or neural network back end to transform noisy speaker embeddings into enhanced ones. A problem with this methodology is information loss.…”
Section: Introductionmentioning
confidence: 99%