Interspeech 2009 2009
DOI: 10.21437/interspeech.2009-538
|View full text |Cite
|
Sign up to set email alerts
|

An improved speech segmentation quality measure: the r-value

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2013
2013
2023
2023

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 36 publications
(8 citation statements)
references
References 12 publications
(39 reference statements)
0
8
0
Order By: Relevance
“…Evaluation. To evaluate segmentation performance, we use precision, recall, F1 and R-value [51,23]. For the calculation of above metrics, we use a tolerance window of 50ms for SpokenCOCO and Estonian following [17], and 30ms for the Zerospeech Challenge [13].…”
Section: Implementation Detailsmentioning
confidence: 99%
“…Evaluation. To evaluate segmentation performance, we use precision, recall, F1 and R-value [51,23]. For the calculation of above metrics, we use a tolerance window of 50ms for SpokenCOCO and Estonian following [17], and 30ms for the Zerospeech Challenge [13].…”
Section: Implementation Detailsmentioning
confidence: 99%
“…Other performance metrics can be defined for specific segmentation tasks, for example, R$$ R $$‐value 82 or path accuracy; 73 however, they are rarely used and are not therefore utilizable for comparison with most other studies.…”
Section: Methodsmentioning
confidence: 99%
“…We report the agreement between segment boundaries learned by the downsampling strategy and the boundaries of human-defined information-bearing units, such as phones. We evaluate phone segmentation performance through precision, recall, F1-score and over-segmentation robust R-value [Räsänen et al, 2009] of the predicted pseudo-unit boundaries with respect to phone boundaries obtained from the aligned frame labels. We also evaluate boundary prediction on a processed version of the TIMIT dataset [Garofolo et al, 1993] in which non-speech events have been trimmed to a maximum of 20 ms.…”
Section: Methodsmentioning
confidence: 99%