ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021
DOI: 10.1109/icassp39728.2021.9414877
|View full text |Cite
|
Sign up to set email alerts
|

Double Multi-Head Attention for Speaker Verification

Abstract: Most state-of-the-art Deep Learning systems for text-independent speaker verification are based on speaker embedding extractors. These architectures are commonly composed of a feature extractor front-end together with a pooling layer to encode variable-length utterances into fixed-length speaker vectors. In this paper we present Double Multi-Head Attention (MHA) pooling, which extends our previous approach based on Self MHA. An additional self attention layer is added to the pooling layer that summarizes the c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 13 publications
(10 citation statements)
references
References 23 publications
(35 reference statements)
0
10
0
Order By: Relevance
“…The results of [17] experiments are shown in Table 1. Performance was evaluated using Equal Error Rate (EER).…”
Section: Resultsmentioning
confidence: 99%
See 4 more Smart Citations
“…The results of [17] experiments are shown in Table 1. Performance was evaluated using Equal Error Rate (EER).…”
Section: Resultsmentioning
confidence: 99%
“…The DMHSA system has been assessed in [17] by VoxCeleb dataset [20,21]. VoxCeleb is a large multimedia database that contains more than one million 16kHz audio utterances for more than 6K celebrities and has two different versions with several evaluation protocols.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations