ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019
DOI: 10.1109/icassp.2019.8683616
|View full text |Cite
|
Sign up to set email alerts
|

Speaker Verification Using End-to-end Adversarial Language Adaptation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
23
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
8
2

Relationship

2
8

Authors

Journals

citations
Cited by 40 publications
(23 citation statements)
references
References 5 publications
0
23
0
Order By: Relevance
“…Utilizing adversarial networks has also been explored for speaker recognition. Proposed solutions in [32,33,27,27,28] explore the use of adversarial networks and generative adversarial networks both as discriminative models for verification of the speaker as well as generative models. Such generative models were mostly used to transform the conditions of the utterance into more convenient environments in which to perform the speaker recognition task.…”
Section: Related Workmentioning
confidence: 99%
“…Utilizing adversarial networks has also been explored for speaker recognition. Proposed solutions in [32,33,27,27,28] explore the use of adversarial networks and generative adversarial networks both as discriminative models for verification of the speaker as well as generative models. Such generative models were mostly used to transform the conditions of the utterance into more convenient environments in which to perform the speaker recognition task.…”
Section: Related Workmentioning
confidence: 99%
“…Our approach of conditioning the reconstruction on the estimated phone sequence of each segment can be employed, enabling such approaches to be revisited in an end-to-end fashion. Other recent approaches aiming at enhancing the x-vector architecture with adversarial loss are also relevant, since they are propose joint training of the network with auxiliary losses and structures which are removed in runtime [8,9,10].…”
Section: Related Work 21 Speaker Recognition Using Autoencodersmentioning
confidence: 99%
“…To improve the performance of x-vectors, recently proposed methods for applying domain adaptation to the x-vector extractor (e.g. using Generative Adversarial Networks [41], [42]) are worth exploring, in order to reduce the mismatch in channel and accent between VoxCeleb and RSR2015.…”
Section: Comparison With X-vectormentioning
confidence: 99%