Interspeech 2021 2021
DOI: 10.21437/interspeech.2021-448
|View full text |Cite
|
Sign up to set email alerts
|

Adapting Speaker Embeddings for Speaker Diarisation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 10 publications
(4 citation statements)
references
References 0 publications
0
4
0
Order By: Relevance
“…Therefore, instead of tuning the threshold for each domain data, we adopt clustering with a silhouette coefficient trick. Some studies [10,11,24,25] already composed their clustering-based SD systems using silhouette coefficient, and those systems show superior performance on various datasets without threshold tuning.…”
Section: Initial Clustering Phasementioning
confidence: 99%
See 3 more Smart Citations
“…Therefore, instead of tuning the threshold for each domain data, we adopt clustering with a silhouette coefficient trick. Some studies [10,11,24,25] already composed their clustering-based SD systems using silhouette coefficient, and those systems show superior performance on various datasets without threshold tuning.…”
Section: Initial Clustering Phasementioning
confidence: 99%
“…Then we extract speaker embeddings using a sliding window with a 1.5s window and a 0.5s shift. We utilise the H / ASP architecture [31] as our model and prepare the model under the training protocol described in [10].…”
Section: Implementation Detailsmentioning
confidence: 99%
See 2 more Smart Citations