2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2014
DOI: 10.1109/icassp.2014.6853887
|View full text |Cite
|
Sign up to set email alerts
|

A novel scheme for speaker recognition using a phonetically-aware deep neural network

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

3
162
0
1

Year Published

2016
2016
2022
2022

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 367 publications
(166 citation statements)
references
References 9 publications
3
162
0
1
Order By: Relevance
“…These are used to compute SS using the feature vectors of an utterance. This approach achieved significant improvements over a baseline i-vector system (Lei et al, 2014).…”
Section: Dnn Based Systemmentioning
confidence: 93%
See 3 more Smart Citations
“…These are used to compute SS using the feature vectors of an utterance. This approach achieved significant improvements over a baseline i-vector system (Lei et al, 2014).…”
Section: Dnn Based Systemmentioning
confidence: 93%
“…Training a PLDA model for the SV task uses speaker labels to define a set of classes to be discriminated. It is common to have multiple instances of speaker labelled i-vectors available for large text-independent datasets (Romero and McCree, 2014;Lei et al, 2014). For a text-dependent scenario, the outcome of the task is linked to identifying content and speaker.…”
Section: Plda Projection Featuresmentioning
confidence: 99%
See 2 more Smart Citations
“…The i-vector model plus various normalization approaches offers the standard framework for modern speaker verification systems [1], [2], [3], [4]. Basically, the i-vector model uses a Gaussian mixture model (GMM) or a deep neural network (DNN) to collect the Baum-Welch sufficient statistics of an utterance, and then projects it onto a low-dimensional total variability space.…”
Section: Introductionmentioning
confidence: 99%