2012
DOI: 10.1186/1687-4722-2012-19
|View full text |Cite
|
Sign up to set email alerts
|

Speaker diarization of broadcast news in Albayzin 2010 evaluation campaign

Abstract: In this article, we present the evaluation results for the task of speaker diarization of broadcast news, which was part of the Albayzin 2010 evaluation campaign of language and speech technologies. The evaluation data consists of a subset of the Catalan broadcast news database recorded from the 3/24 TV channel. The description of five submitted systems from five different research labs is given, marking the common as well as the distinctive system features. The diarization performance is analyzed in the conte… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
20
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 29 publications
(23 citation statements)
references
References 15 publications
(17 reference statements)
0
20
0
Order By: Relevance
“…These evaluation campaigns provide an objective mechanism to compare different systems and are a powerful way to promote research on different speech technologies [56][57][58][59][60][61][62][63].…”
Section: Motivation and Organization Of This Papermentioning
confidence: 99%
“…These evaluation campaigns provide an objective mechanism to compare different systems and are a powerful way to promote research on different speech technologies [56][57][58][59][60][61][62][63].…”
Section: Motivation and Organization Of This Papermentioning
confidence: 99%
“…We created an open recipe to enable participants to build a baseline ASR system using the Kaldi toolkit [17], as well as XMLStar- 3 , and the SRILM 4 and IRSTLM 5 toolkits. This baseline system simplified and automated the data pre-processing tasks, thus allowing participants to focus on more advanced aspects of ASR modelbuilding.…”
Section: Baseline Systemmentioning
confidence: 99%
“…There have been evaluations of, and corpora for, the rich transcription and diarization of broadcast speech since the mid-1990s [1,2,3,4,5], but all have been limited domain -typically broadcast news. The MediaEval evaluation of multimodal search and hyperlinking [6] used, but did not evaluate, automatic transcriptions of multi-genre broadcast data (in fact the same acoustic data used in the MGB challenge).…”
Section: Introductionmentioning
confidence: 99%
“…This campaign is an internationally open set of evaluations supported by the Spanish Network of Speech Technologies (RTTH [32]) and the ISCA Special Interest Group on Iberian Languages (SIG-IL [33]), which have been held every 2 years since 2006. The evaluation campaigns provide an objective mechanism to compare different systems and are a powerful way to promote research on different speech technologies (e.g., speech segmentation [34], speaker diarization [35], language recognition [36], query-by-example spoken term detection [37], and speech synthesis [38] in the ALBAYZIN 2010 and 2012 evaluation campaigns). This year, this campaign has been held during the IberSPEECH 2014 conference [39].…”
Section: Introductionmentioning
confidence: 99%