Retrieval Enhanced Model for Commonsense Generation

Wang, Han; Liu, Yang; Zhu, Chenguang; Shou, Linjun; Gong, Ming; Xu, Yichong; Zeng, Michael

doi:10.48550/arxiv.2105.11174

Cited by 1 publication

(3 citation statements)

References 18 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Baselines (1) Concept2Sentence: We consider several recent submissions to the leaderboard of CommonGen that leverage auxiliary information for GCSR. KFCNet (Li et al, 2021), Re-T5 (Wang et al, 2021), and EKI-BART (Fan et al, 2020) are prototype-based models, which retrieve sentences containing as many input concepts as possible from external captions and NLI datasets, and then use these sentences as auxiliary inputs. VisCTG (Feng et al, 2021b) is an image-augmented model which retrieves images from Google by using concepts as a query, followed by an image captioning model that generates captions as auxiliary inputs.…”

Section: Methodsmentioning

confidence: 99%

“…This is because LMs have no intrinsic mechanism to reason over high-level relations between concepts . To close the knowledge gap, recent work augment LM input with knowledge graph triples (e.g., (dog, CapableOf, catch)) retrieved from ConceptNet Li et al, 2020), or prototype sentences that cover input concepts retrieved from external text corpora (Fan et al, 2020;Wang et al, 2021). However, despite the input augmentation, GCSR skills are implicitly learned based on the concept-text pairs in the training data, without explicit supervision.…”

Section: Learning To Verbalizementioning

confidence: 99%

“…Knowledge-Enhanced GCSR Recent works Li et al, 2021) on GCSR propose to retrieve external knowledge to enhance the text generation. Prototype-based models, including EKI-BART (Fan et al, 2020), Re-T5 (Wang et al, 2021), and KFCNet (Li et al, 2021) retrieve massive prototype sentences from external corpora (over 70M) like visual captions and Wikipedia as auxiliary input to the LM. Though the retrieved prototype sentences provide high coverage on the concepts, their model is supervised to compose sentences that are very similar to those existing prototypes.…”

Section: Related Workmentioning

confidence: 99%

See 2 more Smart Citations

Contextualized Scene Imagination for Generative Commonsense Reasoning

Wang¹,

Zamora²,

Liu³

et al. 2021

Preprint

View full text Add to dashboard Cite

Humans use natural language to compose common concepts from their environment into plausible, day-to-day scene descriptions. However, such generative commonsense reasoning (GCSR) skills are lacking in state-of-the-art text generation methods. Descriptive sentences about arbitrary concepts generated by neural text generation models (e.g., pre-trained text-to-text Transformers) are often grammatically fluent but may not correspond to human common sense, largely due to their lack of mechanisms to capture concept relations, to identify implicit concepts, and to perform generalizable reasoning about unseen concept compositions. In this paper, we propose an Imagine-and-Verbalize (I&V) method, which learns to imagine a relational scene knowledge graph (SKG) with relations between the input concepts, and leverage the SKG as a constraint when generating a plausible scene description. We collect and harmonize a set of knowledge resources from different domains and modalities, providing a rich auxiliary supervision signal for I&V. The experiments demonstrate the effectiveness of I&V in improving language models on both concept-to-sentence and concept-to-story generation tasks, while enabling the model to learn well from fewer task examples and generate SKGs that make common sense to human annotators 1 .

show abstract

Section: Methodsmentioning

confidence: 99%