ALOHA: Artificial Learning of Human Attributes for Dialogue Agents

Li, Aaron W.; Jiang, Veronica; Feng, Steven Y.; Sprague, Julia; Zhou, Wei; Hoey, Jesse

doi:10.1609/aaai.v34i05.6328

Cited by 10 publications

(11 citation statements)

References 22 publications

(28 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Another solution could be to combine a data-driven model with another approach to compensate for the deficiencies in the models, such as combining a generative model (e.g., Sequence-to-Sequence) with a Memory Network ( Madotto et al, 2018 ; Zhang B. et al, 2020 ) or with transformers ( Vaswani et al, 2017 ), such as in the work of Roller et al (2020) , Generative Pre-trained Transformer (GPT) ( Radford et al, 2018 , 2019 ; Brown et al, 2020 ; Zhang Y. et al, 2020 ), Bidirectional Encoder Representations from Transformers (BERT) ( Devlin et al, 2019 ; Song et al, 2021 ), and Poly-encoders ( Humeau et al, 2020 ; Li et al, 2020 ). Data-driven models can also be combined with graphical models ( Zhou et al, 2020 ; Song et al, 2019 ; Moon et al, 2019 ; Shi et al, 2020 ; Wu B. et al, 2020 ; Xu et al, 2020 ), rule-based or slot-filling systems ( Tammewar et al, 2018 ; Zhang Z. et al, 2019 ), a knowledge-base ( Ganhotra and Polymenakos, 2018 ; Ghazvininejad et al, 2018 ; Luo et al, 2019 ; Yavuz et al, 2019 ; Moon et al, 2019 ; Wu et al, 2019 ; Lian et al, 2019 ; Zhang B. et al, 2020 ; Majumder et al, 2020 ; Tuan et al, 2021 ) or with automatic extraction of attributes from dialogue ( Tigunova et al, 2019 , 2020 ; Wu C.-S. et al, 2020 , 2021 ; Ma et al, 2021 ) to improve the personalised entity selection in responses.…”

Section: Discussionmentioning

confidence: 99%

Coffee With a Hint of Data: Towards Using Data-Driven Approaches in Personalised Long-Term Interactions

Irfan

Hellou²,

Belpaeme

2021

Front. Robot. AI

View full text Add to dashboard Cite

While earlier research in human-robot interaction pre-dominantly uses rule-based architectures for natural language interaction, these approaches are not flexible enough for long-term interactions in the real world due to the large variation in user utterances. In contrast, data-driven approaches map the user input to the agent output directly, hence, provide more flexibility with these variations without requiring any set of rules. However, data-driven approaches are generally applied to single dialogue exchanges with a user and do not build up a memory over long-term conversation with different users, whereas long-term interactions require remembering users and their preferences incrementally and continuously and recalling previous interactions with users to adapt and personalise the interactions, known as the lifelong learning problem. In addition, it is desirable to learn user preferences from a few samples of interactions (i.e., few-shot learning). These are known to be challenging problems in machine learning, while they are trivial for rule-based approaches, creating a trade-off between flexibility and robustness. Correspondingly, in this work, we present the text-based Barista Datasets generated to evaluate the potential of data-driven approaches in generic and personalised long-term human-robot interactions with simulated real-world problems, such as recognition errors, incorrect recalls and changes to the user preferences. Based on these datasets, we explore the performance and the underlying inaccuracies of the state-of-the-art data-driven dialogue models that are strong baselines in other domains of personalisation in single interactions, namely Supervised Embeddings, Sequence-to-Sequence, End-to-End Memory Network, Key-Value Memory Network, and Generative Profile Memory Network. The experiments show that while data-driven approaches are suitable for generic task-oriented dialogue and real-time interactions, no model performs sufficiently well to be deployed in personalised long-term interactions in the real world, because of their inability to learn and use new identities, and their poor performance in recalling user-related data.

show abstract

Section: Discussionmentioning

confidence: 99%

Coffee With a Hint of Data: Towards Using Data-Driven Approaches in Personalised Long-Term Interactions

Irfan

Hellou²,

Belpaeme

2021

Front. Robot. AI

View full text Add to dashboard Cite

show abstract

“…In the second category, Li et al (2020) introduced an approach to construct human-level attributes from movie character tropes and used them in a response selection task. They learned the language styles of movie characters associated with several traits, and then retrieved the suitable response associated with the same traits as the target character.…”

Section: Related Workmentioning

confidence: 99%

“…As our intention is to capture the characteristics of the users, that is, the persons who respond to the given utterance, we argue that using user-related information would be useful to achieve that objective. In contrast to Li et al (2020), which classified movie characters into several associated traits, we wanted the model to learn the style at the user level. Additionally, because we adopt LSTM in our model, we argue that using a simpler mechanism would be more effective.…”

Section: Response Generation With Attention To Speaker Informationmentioning

confidence: 99%

“…To address the interestingness and response diversity issues, some studies have focused on improving the style or response characteristics to resemble human-produced ones. One approach is to integrate user-specific information/features (Li et al 2016b;Bak and Oh 2019;Wu et al 2020), or some specific persona characteristics (Chu et al 2018;Li et al 2020) to establish the association between the responses and the corresponding persona. Another approach is to transfer the specific style or latent information from additional texts to the responses (Herzig et al 2017;Zhang et al 2018;Gao et al 2019).…”

mentioning

confidence: 99%

See 1 more Smart Citation

Stylistically User-specific Response Generation

Fikri

Takamura

Okumura

2021

Journal of Natural Language Processing

View full text Add to dashboard Cite

The ability to capture the conversation context is a necessity to build a good conversation model. However, a good model must also provide interesting and diverse responses to mimic actual human conversations. Given that different people can respond differently to the same utterance, we believe that using user-specific attributes can be useful for a conversation task. In this study, we attempt to drive the style of generated responses to resemble the style of real people using user-specific information. Our experiments show that our method applies to both seen and unseen users. Human evaluation also shows that our model outperforms the baselines in terms of relevance and style similarity.

show abstract

“…One such task is to ground open-domain chit-chat dialogue agents to minimize inconsistencies in their language use (e.g., I like cabbage →(next turn) →Cabbage is disgusting) and make them engaging to talk with (Li et al 2016;Zhang et al 2018;Mazaré et al 2018;Qian et al 2018;Zheng et al 2020a,b;Li et al 2020;Majumder et al 2020). Thus far, personalization in chit-chat has made use of dense embeddings and natural language sentences.…”

Section: Introductionmentioning

confidence: 99%

Extracting and Inferring Personal Attributes from Dialogue

Wang¹,

Zhou²,

Koncel-Kedziorski³

et al. 2021

Preprint

View full text Add to dashboard Cite

Personal attributes represent structured information about a person, such as their hobbies, pets, family, likes and dislikes. In this work, we introduce the tasks of extracting and inferring personal attributes from human-human dialogue. We first demonstrate the benefit of incorporating personal attributes in a social chit-chat dialogue model and task-oriented dialogue setting. Thus motivated, we propose the tasks of personal attribute extraction and inference, and then analyze the linguistic demands of these tasks. To meet these challenges, we introduce a simple and extensible model that combines an autoregressive language model utilizing constrained attribute generation with a discriminative reranker. Our model outperforms strong baselines on extracting personal attributes as well as inferring personal attributes that are not contained verbatim in utterances and instead requires commonsense reasoning and lexical inferences, which occur frequently in everyday conversation.

show abstract

ALOHA: Artificial Learning of Human Attributes for Dialogue Agents

Cited by 10 publications

References 22 publications

Coffee With a Hint of Data: Towards Using Data-Driven Approaches in Personalised Long-Term Interactions

Coffee With a Hint of Data: Towards Using Data-Driven Approaches in Personalised Long-Term Interactions

Stylistically User-specific Response Generation

Extracting and Inferring Personal Attributes from Dialogue

Contact Info

Product

Resources

About