2021
DOI: 10.1016/j.neunet.2021.03.004
|View full text |Cite
|
Sign up to set email alerts
|

Speaker recognition based on deep learning: An overview

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
92
0
1

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 243 publications
(116 citation statements)
references
References 201 publications
0
92
0
1
Order By: Relevance
“…Their research served as a short survey of the analytical inquiries and the explications of the speaker recognition domain. Zhongxin Bai et al [27] reviews various significant speaker recognition subdomains such as speaker identification, verification, diarization etc., focusing on deeplearning-based approaches. Modern and newly published deep learning-based feature extraction approaches, ASR algorithms are extensively explained in this paper.…”
Section: Reference Year Main Purpose Challengesmentioning
confidence: 99%
See 1 more Smart Citation
“…Their research served as a short survey of the analytical inquiries and the explications of the speaker recognition domain. Zhongxin Bai et al [27] reviews various significant speaker recognition subdomains such as speaker identification, verification, diarization etc., focusing on deeplearning-based approaches. Modern and newly published deep learning-based feature extraction approaches, ASR algorithms are extensively explained in this paper.…”
Section: Reference Year Main Purpose Challengesmentioning
confidence: 99%
“…In a stagewise speaker recognition systems, the recognition tasks such as speaker identification, speaker verification or speaker diarization are processed in two stages: front-end and back-end [27]. Various algorithms are employed in the front end and back end to complete the speaker recognition task.…”
Section: A Stagewise Speaker Recognitionmentioning
confidence: 99%
“…In the last five years, deep learning methods have been demonstrated to outperform most of the classical speech and speaker recognition systems such as GMM-Universal Background Model (UBM), SVM, and i -vector [ 32 , 33 ]. However, deep learning systems require huge speech databases to be labeled and trained; theses databases also need to include phonetically rich sentences or at least phonetically balanced sentences [ 31 ].…”
Section: Related Workmentioning
confidence: 99%
“…More specifically, a GMM-Universal Background Model (UBM) was used by [3] to predict PD severity in a longitudinal study. Yet the current trend has now shifted to the use of deep neural networks (DNN) [6]. Indeed, many recent performance advancements in speaker recognition and verification tasks are achieved through the use of x-vectors and other similar embedding approaches [73, 10].…”
Section: Introductionmentioning
confidence: 99%