2021
DOI: 10.32473/flairs.v34i1.128369
|View full text |Cite
|
Sign up to set email alerts
|

Retrieval-Augmented Transformer-XL for Close-Domain Dialog Generation

Abstract: Transformer-based models have demonstrated excellent capabilities of capturing patterns and structures in natural language generation and achieved state-of-the-art results in many tasks. In this paper we present a transformer-based model for multi-turn dialog response generation. Our solution is based on a hybrid approach which augments a transformer-based generative model with a novel retrieval mechanism, which leverages the memorized information in the training data via k-Nearest Neighbo… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(2 citation statements)
references
References 6 publications
(1 reference statement)
0
2
0
Order By: Relevance
“…More recently, RETRO Borgeaud et al (2021) enhances the model architecture not by increasing the number of parameters or the size of training data, but rather through the retrieval of information relevant for each sample. Similarly, Bonetta et al (2021) uses memorized similarity information from the training data for retrieval at at inference time.…”
Section: Retrieval Mechanismsmentioning
confidence: 99%
“…More recently, RETRO Borgeaud et al (2021) enhances the model architecture not by increasing the number of parameters or the size of training data, but rather through the retrieval of information relevant for each sample. Similarly, Bonetta et al (2021) uses memorized similarity information from the training data for retrieval at at inference time.…”
Section: Retrieval Mechanismsmentioning
confidence: 99%
“…The idea in such systems is to retrieve suitable responses from the dataset of recorded dialogs and adapt them to the context of the current dialog if needed. Retrieval-based approaches have been commonly applied in the domains of question-answering (Q&A) systems and general purpose conversational systems [17,18,19,20]. However, these approaches are not very common in the research literature for building CRS, where nowadays language generation models are predominant.…”
Section: Introductionmentioning
confidence: 99%