Atsushi Ando scite author profile

This paper proposes a fully neural network based dialogue-context online end-of-turn detection method that can utilize longrange interactive information extracted from both target speaker's and interlocutor's utterances. In the proposed method, we combine multiple time-asynchronous long short-term memory recurrent neural networks, which can capture target speaker's and interlocutor's multiple sequential features, and their interactions. On the assumption of applying the proposed method to spoken dialogue systems, we introduce target speaker's acoustic sequential features and interlocutor's linguistic sequential features, each of which can be extracted in an online manner. Our evaluation confirms the effectiveness of taking dialogue context formed by the target speaker's utterances and interlocutor's utterances into consideration.

show abstract

Speech Emotion Recognition Based on Multi-Label Emotion Existence Model

Ando¹,

Masumura²,

Kamiyama³

et al. 2019

View full text Add to dashboard Cite

Customer Satisfaction Estimation in Contact Center Calls Based on a Hierarchical Multi-Task Model

Ando

Masumura

Kamiyama

et al. 2020

IEEE/ACM Trans. Audio Speech Lang. Process.

View full text Add to dashboard Cite

Sequence-Level Consistency Training for Semi-Supervised End-to-End Automatic Speech Recognition

Masumura

Ihori

Takashima

et al. 2020

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Atsushi Ando

Soft-Target Training with Ambiguous Emotional Utterances for DNN-Based Speech Emotion Classification

Neural Dialogue Context Online End-of-Turn Detection

Speech Emotion Recognition Based on Multi-Label Emotion Existence Model

Customer Satisfaction Estimation in Contact Center Calls Based on a Hierarchical Multi-Task Model

Sequence-Level Consistency Training for Semi-Supervised End-to-End Automatic Speech Recognition

Contact Info

Product

Resources

About