2023
DOI: 10.15276/hait.06.2023.7
|View full text |Cite
|
Sign up to set email alerts
|

The structural tuning of the convolutional neural network for speaker identification in mel frequency cepstrum coefficients space

Anastasiia D. Matychenko,
Marina V. Polyakova

Abstract: As a result of the literature analysis, the main methods for speaker identification from speech signals were defined. These are statistical methods based on Gaussian mixture model and a universal background model, as well as neural network methods, in particular, using convolutional or Siamese neural networks. The main characteristics of these methods are the recognition performance, a number of parameters, and the training time. High recognition performance is achieved by usin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 25 publications
0
1
0
Order By: Relevance
“…Suppose that for image classification, a CNN with architecture S and parameters P is presynthesized, CNN={S, P} [14,15]. This network has already learned earlier to extract features for solving the problem of image classification.…”
Section: Formulation Of the Problemmentioning
confidence: 99%
“…Suppose that for image classification, a CNN with architecture S and parameters P is presynthesized, CNN={S, P} [14,15]. This network has already learned earlier to extract features for solving the problem of image classification.…”
Section: Formulation Of the Problemmentioning
confidence: 99%