Multi-Modal Repairs of Conversational Breakdowns in Task-Oriented Dialogs

Li, Toby Jia-Jun; Chen, Jingya; Xia, Haijun; Mitchell, Tom M.; Myers, Brad A.

doi:10.1145/3379337.3415820

Cited by 81 publications

(19 citation statements)

References 53 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Recently Li et. al [37] explored multi-modal strategies in the context of the existing mobile app Graphical User Interface (GUIs) for fixing Natural Language Understanding (NLU) breakdowns and command disambiguations [36]. In particular, one of their system solution (Figure 2.1.a)…”

Section: User Perceptions Of Task-oriented Chatbotsmentioning

confidence: 99%

“…We hypothesize that explaining the competencies and limitations of the chatbot using the identified intent and entity will not only aid users to recognize the breakdown but also improve transparency. Furthermore, within the breakdown decision, an indication of where the problem occurred and its possible causes would help the users more clearly understand the cause of the breakdown and repair their queries [37].…”

Section: Structuring the Explanation With Intent And Entitymentioning

confidence: 99%

“…Consequently, inspired by the "research through design" [67] paradigm, we contributed in developing a minimal viable implementation of a chatbot that allowed us to explore how visual explanations should be designed. Through our implementation we were able to augment the chatbot functionality with different visual in-context explanations that could support a range of infeasible and disambiguation tasks in the user study and assess them with users, however, in the future, there is still more work that needs to be done at the intersection of ML and HCI to build upon recent work [37]. Nevertheless, our designs could be used to map user intents to specific portions of GUIs and interaction examples from other users and therefore could be adapted to other feature-rich applications besides spreadsheets that have similar UIs and menu structures.…”

Section: Future Work: Designing a Hybrid Of Visual Tour And Nontour Modementioning

confidence: 99%

“…However, complexities of natural language interactions [3, 51] and limited training sets and poor conversational understanding [2] remain to be key obstacles in fully realizing the potential of human-chatbot interaction. For example, when interacting with task-oriented chatbots, a key challenge for users is dealing with conversational dead-ends or breakdowns [5,37,36]. In fact, during a conversational breakdown, as many as 70% of users may opt to quit the task or completely abandon the chatbot, while others may try to rephrase their queries with little or no success [51].…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

ChatrEx: Designing Explainable Chatbot Interfaces for Enhancing Usefulness, Transparency, and Trust

Khurana

Alamzadeh

Chilana

2021

2021 IEEE Symposium on Visual Languages and Human-Centric Computing (VL/HCC)

View full text Add to dashboard Cite

Section: User Perceptions Of Task-oriented Chatbotsmentioning

confidence: 99%

Section: Structuring the Explanation With Intent And Entitymentioning

confidence: 99%

Section: Future Work: Designing a Hybrid Of Visual Tour And Nontour Modementioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

ChatrEx: Designing Explainable Chatbot Interfaces for Enhancing Usefulness, Transparency, and Trust

Khurana

Alamzadeh

Chilana

2021

2021 IEEE Symposium on Visual Languages and Human-Centric Computing (VL/HCC)

View full text Add to dashboard Cite

“…The combination of voice and touch enhanced the experience on the mobile devices. Besides, multimodal method were also implemented to enhance the performance on disambiguation interfaces [ 32 , 35 , 45 ].…”

Section: Related Workmentioning

confidence: 99%

Voice and Touch Based Error-tolerant Multimodal Text Editing and Correction for Smartphones

Zhao

Cui

Ramakrishnan

et al. 2021

The 34th Annual ACM Symposium on User Interface Software and Technology

View full text Add to dashboard Cite

Editing operations such as cut, copy, paste, and correcting errors in typed text are often tedious and challenging to perform on smartphones. In this paper, we present VT, a voice and touch-based multi-modal text editing and correction method for smartphones. To edit text with VT, the user glides over a text fragment with a finger and dictates a command, such as “bold” to change the format of the fragment, or the user can tap inside a text area and speak a command such as “highlight this paragraph” to edit the text. For text correcting, the user taps approximately at the area of erroneous text fragment and dictates the new content for substitution or insertion. VT combines touch and voice inputs with language context such as language model and phrase similarity to infer a user’s editing intention, which can handle ambiguities and noisy input signals. It is a great advantage over the existing error correction methods (e.g., iOS’s Voice Control) which require precise cursor control or text selection. Our evaluation shows that VT significantly improves the efficiency of text editing and text correcting on smartphones over the touch-only method and the iOS’s Voice Control method. Our user studies showed that VT reduced the text editing time by 30.80%, and text correcting time by 29.97% over the touch-only method. VT reduced the text editing time by 30.81%, and text correcting time by 47.96% over the iOS’s Voice Control method.

show abstract

Demonstration + Natural Language: Multimodal Interfaces for GUI-Based Interactive Task Learning Agents

Mitchell

Myers

2021

Human–Computer Interaction Series

View full text Add to dashboard Cite

Multi-Modal Repairs of Conversational Breakdowns in Task-Oriented Dialogs

Cited by 81 publications

References 53 publications

ChatrEx: Designing Explainable Chatbot Interfaces for Enhancing Usefulness, Transparency, and Trust

ChatrEx: Designing Explainable Chatbot Interfaces for Enhancing Usefulness, Transparency, and Trust

Voice and Touch Based Error-tolerant Multimodal Text Editing and Correction for Smartphones

Demonstration + Natural Language: Multimodal Interfaces for GUI-Based Interactive Task Learning Agents

Contact Info

Product

Resources

About