2024
DOI: 10.1109/taslp.2024.3385277
|View full text |Cite
|
Sign up to set email alerts
|

Golden Gemini is All You Need: Finding the Sweet Spots for Speaker Verification

Tianchi Liu,
Kong Aik Lee,
Qiongqiong Wang
et al.

Abstract: The residual neural networks (ResNet) demonstrate the impressive performance in automatic speaker verification (ASV). They treat the time and frequency dimensions equally, following the default stride configuration designed for image recognition, where the horizontal and vertical axes exhibit similarities. This approach ignores the fact that time and frequency are asymmetric in speech representation. We address this issue and postulate Golden-Gemini Hypothesis, which posits the prioritization of temporal resol… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
references
References 95 publications
(176 reference statements)
0
0
0
Order By: Relevance