2019
DOI: 10.1007/978-981-13-9443-0_8
|View full text |Cite
|
Sign up to set email alerts
|

Generating Fillers Based on Dialog Act Pairs for Smooth Turn-Taking by Humanoid Robot

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
5
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
3
1

Relationship

2
6

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 14 publications
0
5
0
Order By: Relevance
“…Several studies investigated the prosodic features of fillers [26,27], but there are only a limited studies on prediction of fillers [28,29].…”
Section: Filler Predictionmentioning
confidence: 99%
“…Several studies investigated the prosodic features of fillers [26,27], but there are only a limited studies on prediction of fillers [28,29].…”
Section: Filler Predictionmentioning
confidence: 99%
“…Over the years, numerous scholars have extensively researched the prediction of pause fillers using various methods, which has resulted in more natural and authentic text generation. For instance, Nakanishi R. et al proposed a method based on analyzing humanrobot interaction data and machine learning models to predict the occurrence and appropriate forms of pause fillers, aiming to generate them at the beginning of system utterances in humanoid robot spoken dialog systems, indicating turn-taking or turn-holding intentions [1]. Balagopalan A. et al compared two common methods for AD detection on a matched dataset, assessing the advantages of domain knowledge and BERT pre-trained transfer models in predicting pauses and interruptions [2].…”
Section: Prediction Of Pause Fillersmentioning
confidence: 99%
“…2) Unlike the detection task, where audio alone can achieve reasonable performance (although lexical cues often help), the synthesis task requires suitable generation of both acoustic and lexical behaviors. When synthesizing fillers and backchannels, the meanings largely depend on their morphological forms [15,21,22,32]. To address this challenge, the majority of the synthesis task is still performed following a rule-based method.…”
Section: Introductionmentioning
confidence: 99%