2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2011
DOI: 10.1109/icassp.2011.5947650
|View full text |Cite
|
Sign up to set email alerts
|

Robust speaker turn role labeling of TV Broadcast News shows

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
15
0

Year Published

2012
2012
2017
2017

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 22 publications
(15 citation statements)
references
References 9 publications
0
15
0
Order By: Relevance
“…In [1], we have proposed a multi-stage process for speaker turn role labeling. In this work, we pursue this features (basically word n-grams, with the possibility of DGGLQJ QXPHULFDO IHDWXUHV VXFK DV HORFXWLRQ VSHHG HWF« The aim is to find the label among (polit,¬ polit) which gives the highest probability given the feature vector.…”
Section: Role Labeling Of Speaker Turnsmentioning
confidence: 99%
See 2 more Smart Citations
“…In [1], we have proposed a multi-stage process for speaker turn role labeling. In this work, we pursue this features (basically word n-grams, with the possibility of DGGLQJ QXPHULFDO IHDWXUHV VXFK DV HORFXWLRQ VSHHG HWF« The aim is to find the label among (polit,¬ polit) which gives the highest probability given the feature vector.…”
Section: Role Labeling Of Speaker Turnsmentioning
confidence: 99%
“…In previous work [1], we proposed a multi-view approach applied to TV Broadcast News (TVBN) shows, where three categories of speaker turns were distinguished (anchor speaker, reporters and other speakers). Beyond this 3-fold distinction, characterization of non-journalist speakers can be extended to special categories of people which have specific speaking style and lexical fields, such as politicians, VSRUWVPHQ ODZ\HUV« 7KLV ZRUN SURSRVHV WR IRFXV RQ RQH particular category, namely politician speakers.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…[1][2][3][4][5][6][7][8][9] Concerning the audio data, the automatic analysis of the audio signals can offer the users useful information. In the case of broadcast news, automatic processing is related to tasks such as sound recognition, 10,11 speaker recognition, 12 anchor detection, 13 role detection, [14][15][16] story boundary detection, 2,17,18 summary construction from anchor talking, 9,19 channel's quality detection, 20 sound event detection, 21,22 non-linguistic humanproduced sounds detection, 5,6,[23][24][25] audio type segmentation in sport games, 4,26,27 highlight scene extraction from sports games, 3 violence scene detection, 28 music characteristics classification, 29,30 jingle detection, 1 commercial block detection, 8 voice activity detection, 31 language recognition, 32 emotion recognition 33 and speech recognition. 34 Sound recognition is the cornerstone of analysis as typically precedes the other stages.…”
Section: Introductionmentioning
confidence: 99%
“…Typical roles considered in BN audio are formal roles (also referred as functional roles), i.e., roles imposed from the news format and related to the task each speaker performs in the show like anchorman, journalists, interviewees or soundbites. Common features used to train statistical classifiers consist of lexical features [1] as well as structural features from the recording, prosodic features and Dialog Acts [2,3,4]. More recently, automatic role labeling has also been studied in spontaneous conversations including Broadcast Conversations (BC) [3,5,6] as well as meeting recordings [7,8].…”
Section: Introductionmentioning
confidence: 99%