“…There has recently been a surge of interest in generating coherent and consistent dialogues grounded on pre-defined persona profile information from the PersonaChat dataset (Zhang et al, 2018;. Approaches to enforce consistent personas on this dataset have included retrieving relevant profile facts (Zhang et al, 2018), retrieving and refining relevant utterances , increasing the probability of copying a word from the profile (Yavuz et al, 2019), tuning to discourage inconsistent responses (Li et al, 2019a), reranking candidate responses (Welleck et al, 2019), and combining natural language inference with reinforcement learning (Song et al, 2019). Unfortunately, these methods fall short of generating responses that are as grammatical, diverse, engaging, and descriptive as natural human generated conversation (See et al, 2019;Roller et al, 2020).…”