International Conference on Acoustics, Speech, and Signal Processing
DOI: 10.1109/icassp.1990.115617
|View full text |Cite
|
Sign up to set email alerts
|

Optimisation of neural models for speaker identification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
33
0
2

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 45 publications
(35 citation statements)
references
References 8 publications
0
33
0
2
Order By: Relevance
“…The i-vector-PLDA technique and its variants have also been successfully used in text-dependent speaker recognition tasks [8,9,10]. In past studies, neural networks have been investigated for speaker recognition [11,12]. Being nonlinear classifiers, neural networks can discriminate the characteristics of different speakers.…”
Section: Previous Workmentioning
confidence: 99%
“…The i-vector-PLDA technique and its variants have also been successfully used in text-dependent speaker recognition tasks [8,9,10]. In past studies, neural networks have been investigated for speaker recognition [11,12]. Being nonlinear classifiers, neural networks can discriminate the characteristics of different speakers.…”
Section: Previous Workmentioning
confidence: 99%
“…In the speaker verification mode, the input vectors of the unknown user are fed forward through the network belonging to the claimed speaker. If the average output value is bigger than a threshold, the speaker is accepted (Oglesby and Mason, 1990). Rudasi and Zahorian (1991) demonstrated that by using small binary networks for distinguishing between two speakers instead of one large network with one output for each known speaker, the performance in speaker recognition was much better, since the binary networks were much more specialised.…”
Section: Gaussian Mixture Modelsmentioning
confidence: 99%
“…Theoretically, any multicategory classification task can be decomposed into a set of binary classification subtasks, where each subtask is to discriminate between the data belonging to a specific class and all the others. By this fact, some connectionist methods have been proposed by constructing a set of neural networks with binary outputs for speaker identification [13], [21]. Indeed, those neural networks of binary outputs may work in a parallel way, which speeds up training.…”
Section: Introductionmentioning
confidence: 99%
“…In previous studies, connectionist approaches have been applied to build speaker identification systems [2], [3], [7], [9], [13], [21] where a neural-network model is typically used to characterize all the speakers' voice in a given set. In this circumstance, the input space of a neural network is composed of feature vectors extracted from acoustic signals belonging to all the 1045-9227/02$17.00 漏 2002 IEEE speakers, while the outputs are usually labels of corresponding speaker identities.…”
Section: Introductionmentioning
confidence: 99%