ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020
DOI: 10.1109/icassp40776.2020.9053485
|View full text |Cite
|
Sign up to set email alerts
|

A Discriminative Condition-Aware Backend for Speaker Verification

Abstract: We present a scoring approach for speaker verification that mimics the standard PLDA-based backend process used in most current speaker verification systems. However, unlike the standard backends, all parameters of the model are jointly trained to optimize the binary cross-entropy for the speaker verification task. We further integrate the calibration stage inside the model, making the parameters of this stage depend on metadata vectors that represent the conditions of the signals. We show that the proposed ba… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
3
3
2

Relationship

1
7

Authors

Journals

citations
Cited by 11 publications
(4 citation statements)
references
References 18 publications
0
4
0
Order By: Relevance
“…In recent years, there are three kinds of representative methods to improve the backend. The first category is about PLDA, such as Neural PLDA [17], discriminative PLDA (DPLDA) [18], heavy-tailed PLDA (HT-PLDA) [19], multi-objective optimization training of PLDA (Mot-PLDA) [20] and etc. The second is to add an additional trainable neural network module, e.g., decision residual networks (Dr-vectors) [21], deep learning backend (DLB) [22] and tied variational autoencoder (TVAE) [23].…”
Section: Introductionmentioning
confidence: 99%
“…In recent years, there are three kinds of representative methods to improve the backend. The first category is about PLDA, such as Neural PLDA [17], discriminative PLDA (DPLDA) [18], heavy-tailed PLDA (HT-PLDA) [19], multi-objective optimization training of PLDA (Mot-PLDA) [20] and etc. The second is to add an additional trainable neural network module, e.g., decision residual networks (Dr-vectors) [21], deep learning backend (DLB) [22] and tied variational autoencoder (TVAE) [23].…”
Section: Introductionmentioning
confidence: 99%
“…We proposed to capture the information about the conditions that affect calibration in a sideinformation vector, which was then used to determine the parameters of the calibration stage. In our first paper (Ferrer and McLaren, 2020a), the sideinformation vector was generated by a separate model, which was trained to predict labeled conditions in the training data. These condition labels were derived from the available metadata in each of the training datasets, which had different levels of detail depending on the dataset.…”
Section: Condition-aware Discriminative Backendmentioning
confidence: 99%
“…Such scores instability impairs SR systems in real applications. In order to deal with this problem different calibration strategies [8], [9] and compensation techniques [10], [11] were implemented.…”
Section: Introductionmentioning
confidence: 99%