2011
DOI: 10.1186/1687-4722-2011-1
|View full text |Cite
|
Sign up to set email alerts
|

Audio segmentation of broadcast news in the Albayzin-2010 evaluation: overview, results, and discussion

Abstract: Recently, audio segmentation has attracted research interest because of its usefulness in several applications like audio indexing and retrieval, subtitling, monitoring of acoustic scenes, etc. Moreover, a previous audio segmentation stage may be useful to improve the robustness of speech technologies like automatic speech recognition and speaker diarization. In this article, we present the evaluation of broadcast news audio segmentation systems carried out in the context of the Albayzín-2010 evaluation campai… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
30
0
1

Year Published

2014
2014
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 30 publications
(31 citation statements)
references
References 12 publications
0
30
0
1
Order By: Relevance
“…Using this database an audio segmentation task was proposed, where the systems were required to identify the presence of speech, music and/or noise, either isolated or overlapped. The Albayzín-2014 Audio Segmentation Evaluation contributed to the evolution of the audio segmentation technology in broadcast news domains by providing a more general and realistic database, compared to those used in the Albayzín-2010 and -2012 Audio Segmentation Evaluations [10,30]. The main features of the approaches and the results attained by seven segmentation systems from four different research groups have been presented and briefly analyzed.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Using this database an audio segmentation task was proposed, where the systems were required to identify the presence of speech, music and/or noise, either isolated or overlapped. The Albayzín-2014 Audio Segmentation Evaluation contributed to the evolution of the audio segmentation technology in broadcast news domains by providing a more general and realistic database, compared to those used in the Albayzín-2010 and -2012 Audio Segmentation Evaluations [10,30]. The main features of the approaches and the results attained by seven segmentation systems from four different research groups have been presented and briefly analyzed.…”
Section: Discussionmentioning
confidence: 99%
“…However, some classes are better described by the statistics computed over longer periods of time (from 0.5 to 5 s long). These characteristics are referred in the literature as segment-based features [29,30]. For example, in [31], a content-based speech discrimination algorithm is designed to exploit the long-term information inherent in the modulation spectrum; and in [32], authors propose two segment-based features: the variance of the spectrum flux (VSF) and the variance of the zero crossing rate (VZCR).…”
Section: General Description Of Audio Segmentation Systemsmentioning
confidence: 99%
“…However, there is an important amount of long segments (longer than 60 s). More details about the database and the labeling process can be found in [19].…”
Section: Databasementioning
confidence: 99%
“…A complete description of the Albayzin 2010 audio segmentation and classification evaluation can be found in [19] where the participant's approaches and the results are presented. We describe the database and the metric used in the evaluation in the next subsections.…”
Section: Albayzin Audio Segmentation Evaluations and Database Descripmentioning
confidence: 99%
See 1 more Smart Citation