2015
DOI: 10.1080/01691864.2015.1009164
|View full text |Cite
|
Sign up to set email alerts
|

A cloud robotics approach towards dialogue-oriented robot speech

Abstract: Robot utterances generally sound monotonous, unnatural and unfriendly because their Text-to-Speech systems are not optimized for communication but for text reading. Here, we present a non-monologue speech synthesis for robots. The key novelty lies in speech synthesis based on Hidden Markov models (HMMs) using a non-monologue corpus: we collected a speech corpus in a non-monologue style in which two professional voice talents read scripted dialogues, and HMMs were then trained with the corpus and used for speec… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 16 publications
(3 citation statements)
references
References 15 publications
0
3
0
Order By: Relevance
“…As a result of our screening, we obtained 720 short-dialogue lines. No UUDB [22] Task-oriented Yes 2 14 Yes OGVC [23] Voice chat while playing game Yes 14 17 Yes NICT-VADC [24] Task-oriented Yes 7…”
Section: Crowdsourcing Setting and Resultsmentioning
confidence: 99%
“…As a result of our screening, we obtained 720 short-dialogue lines. No UUDB [22] Task-oriented Yes 2 14 Yes OGVC [23] Voice chat while playing game Yes 14 17 Yes NICT-VADC [24] Task-oriented Yes 7…”
Section: Crowdsourcing Setting and Resultsmentioning
confidence: 99%
“…As a consequence, remote control of robot systems can be performed in a cloud platform through data connection with a robot manipulator. Important applications of cloud robotics can be found in space exploration, remote surgery, intelligent housing systems, unmanned vehicles, and so on [4][5][6][7][8][9].…”
Section: Introductionmentioning
confidence: 99%
“…Most available ASRs are trained with transcribed data that need to be prepared separately from the learning process (Sugiura et al, 2015;Kawaharay et al, 2000;Dahl et al, 2012). By using certain supervised learning methods and certain model architectures, an ASR can be developed with a very large amount of transcribed speech data corpus, i.e., a set of pairs of text data and acoustic data.…”
Section: Introductionmentioning
confidence: 99%