Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96
DOI: 10.1109/icslp.1996.607962
|View full text |Cite
|
Sign up to set email alerts
|

Dialog act classification with the help of prosody

Abstract: This paper presents automatic methods for the segmentation and classication of dialog acts (DA). In Verbmobil it is often sucient to recognize the sequence of DAs occurring during a dialog between the two partners. Since a turn can consist of one or more successive D As we conduct the classication of DAs in a two step procedure: First each turn has to be segmented into units which correspond to a DA and second the DA categories have to be identied. For the segmentation we use polygrams and multi{layer perceptr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
36
0

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 38 publications
(36 citation statements)
references
References 8 publications
0
36
0
Order By: Relevance
“…A crucial aspect of our work, as well as that of some other researchers [6,5] is that the dependence between prosodic features and target classes (e.g., dialog acts, phrase boundaries) is modeled directly in a statistical classifier-without the use of intermediate abstract phonological categories, such as pitch accent or boundary tone labels. This bypasses the need to hand-annotate such labels for training purposes, avoids problems of annotation reliability, and allows the model to choose the level of granularity of the representation that is best suited for the task [2].…”
Section: Direct Modeling Of Target Classesmentioning
confidence: 99%
“…A crucial aspect of our work, as well as that of some other researchers [6,5] is that the dependence between prosodic features and target classes (e.g., dialog acts, phrase boundaries) is modeled directly in a statistical classifier-without the use of intermediate abstract phonological categories, such as pitch accent or boundary tone labels. This bypasses the need to hand-annotate such labels for training purposes, avoids problems of annotation reliability, and allows the model to choose the level of granularity of the representation that is best suited for the task [2].…”
Section: Direct Modeling Of Target Classesmentioning
confidence: 99%
“…For example, House carried out extensive studies on Swedish (e.g., [14]), extending them with some multimodal aspects (e.g., [15]). Much hope was risen by early works on the prosodic properties of the realisations of selected dialogue acts [16,17,18].…”
Section: Dialogue Acts and Prosodymentioning
confidence: 99%
“…In [46], prosody is used to segment utterances. The duration, pause, F0-contour and energy features are used in [13] and [47].…”
Section: Related Workmentioning
confidence: 99%