2012
DOI: 10.1016/j.specom.2012.05.002
|View full text |Cite
|
Sign up to set email alerts
|

A review on speaker diarization systems and approaches

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
39
0

Year Published

2013
2013
2023
2023

Publication Types

Select...
5
3
2

Relationship

0
10

Authors

Journals

citations
Cited by 85 publications
(43 citation statements)
references
References 138 publications
0
39
0
Order By: Relevance
“…Given any two i-vectors and , the cosine similarity score is given as (2) Equivalently, this means we can normalize the i-vectors by their respective magnitudes such that they all live on the unit hypersphere and the measure of the distance between two i-vectors is given by their angle.…”
Section: B the Total Variability Approachmentioning
confidence: 99%
“…Given any two i-vectors and , the cosine similarity score is given as (2) Equivalently, this means we can normalize the i-vectors by their respective magnitudes such that they all live on the unit hypersphere and the measure of the distance between two i-vectors is given by their angle.…”
Section: B the Total Variability Approachmentioning
confidence: 99%
“…Speaker diarization is the "who spoke when" problem, also called speaker indexing (Wellekens, 2001;Miró et al, 2012;Moattar and Homayounpour, 2012). The input is audio features sampled at 100 Hz frame rate, and the output is framelabels indicating speaker identify for each frame.…”
Section: Speaker Diarizationmentioning
confidence: 99%
“…Speaker diarization research mainly tackles the simultaneous estimation of speaker segmentation (voice activity detection) and clustering (number of speaker estimation). Beside monaural signal based methods [1], [2], microphone array technologies tackles this by introducing spatial information about the speakers. However, most of the existing methods assume that the microphone location is given to estimate the direction of arrival of speakers [3]- [6].…”
Section: Introductionmentioning
confidence: 99%