Learning to Discover, Ground and Use Words with Segmental Neural Language Models

Kawakami, Kazuya; Dyer, Chris; Blunsom, Phil

doi:10.18653/v1/p19-1645

Cited by 27 publications

(59 citation statements)

References 36 publications

(31 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In terms of grammar induction, they are competitive with recently-proposed neural architectures that discover tree-like structures through gated attention (Shen et al, 2018). Our results, along with other recent work on joint language modeling/structure learning with deep networks (Shen et al, 2018(Shen et al, , 2019Wiseman et al, 2018;Kawakami et al, 2018), suggest that it is possible learn generative models of language that model the underlying data well (i.e. assign high likelihood to held-out data) and at the same time induce meaningful linguistic structure.…”

Section: Introductionsupporting

confidence: 76%

Unsupervised Recurrent Neural Network Grammars

Kim¹,

Rushton²,

Yu³

et al. 2019

Proceedings of the 2019 Conference of the North

Self Cite

100

109

View full text Add to dashboard Cite

Recurrent neural network grammars (RNNG) are generative models of language which jointly model syntax and surface structure by incrementally generating a syntax tree and sentence in a top-down, left-to-right order. Supervised RNNGs achieve strong language modeling and parsing performance, but require an annotated corpus of parse trees. In this work, we experiment with unsupervised learning of RNNGs. Since directly marginalizing over the space of latent trees is intractable, we instead apply amortized variational inference. To maximize the evidence lower bound, we develop an inference network parameterized as a neural CRF constituency parser. On language modeling, unsupervised RNNGs perform as well their supervised counterparts on benchmarks in English and Chinese. On constituency grammar induction, they are competitive with recent neural language models that induce tree structures from words through attention mechanisms.

show abstract

Section: Introductionsupporting

confidence: 76%

Unsupervised Recurrent Neural Network Grammars

Kim¹,

Rushton²,

Yu³

et al. 2019

Proceedings of the 2019 Conference of the North

Self Cite

100

109

View full text Add to dashboard Cite

show abstract

“…There is some work that presents a bayesian probabilistic formulation to learn referential grounding in dialog (Liu et al, 2014), user preferences (Cadilhac et al, 2013), color descriptions (McMahan and Stone, 2015Andreas and Klein, 2014). A huge chunk of work also focus on leveraging attention mechanism for grounding multimodal phenomenon in images (Srinivasan et al, 2020;Chu et al, 2018;Fan et al, 2019;Vu et al, 2018;Kawakami et al, 2019;Dong et al, 2019), videos (Lei et al, 2020; and navigation of embodied agents (Yang et al, 2020), etc., Some approach this using data structures such as graphs in the domains of grounding images (Chang et al, 2015;Liu et al, 2014), videos ), text (Laws et al, 2010;Chen, 2012;Massé et al, 2008), entities (Zhou et al, 2018a), knowledge graphs and ontologies (Jauhar et al, 2015;Zhang et al, 2020) and interactive settings Jauhar et al (2015); Xu et al (2020).…”

Section: Stratificationmentioning

confidence: 99%

Grounding ‘Grounding’ in NLP

Chandu

Bisk

Black

2021

Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021

View full text Add to dashboard Cite

The NLP community has seen substantial recent interest in grounding to facilitate interaction between language technologies and the world. However, as a community, we use the term broadly to reference any linking of text to data or non-textual modality. In contrast, Cognitive Science more formally defines "grounding" as the process of establishing what mutual information is required for successful communication between two interlocutorsa definition which might implicitly capture the NLP usage but differs in intent and scope.We investigate the gap between these definitions and seek answers to the following questions: (1) What aspects of grounding are missing from NLP tasks? Here we present the dimensions of coordination, purviews and constraints.(2) How is the term "grounding" used in the current research? We study the trends in datasets, domains, and tasks introduced in recent NLP conferences. And finally, (3) How to advance our current definition to bridge the gap with Cognitive Science? We present ways to both create new tasks or repurpose existing ones to make advancements towards achieving a more complete sense of grounding.

show abstract

“…Unlike word-to-word alignment, we focus on learning the alignment between data records and text segments. Some works also integrate neural language models to jointly learn the segmentation and correspondence, e.g., phrase-based machine translation (Huang et al, 2018), speech recognition (Wang et al, 2017) and vision-grounded word segmentation (Kawakami et al, 2019). Data-to-text naturally fits into this scenario since each data record is normally verbalized in one continuous text segment.…”

Section: Related Workmentioning

confidence: 99%

Neural Data-to-Text Generation via Jointly Learning the Segmentation and Correspondence

Shen

Chang²,

Su³

et al. 2020

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics

View full text Add to dashboard Cite

The neural attention model has achieved great success in data-to-text generation tasks.Though usually excelling at producing fluent text, it suffers from the problem of information missing, repetition and "hallucination". Due to the black-box nature of the neural attention architecture, avoiding these problems in a systematic way is non-trivial. To address this concern, we propose to explicitly segment target text into fragment units and align them with their data correspondences. The segmentation and correspondence are jointly learned as latent variables without any human annotations. We further impose a soft statistical constraint to regularize the segmental granularity. The resulting architecture maintains the same expressive power as neural attention models, while being able to generate fully interpretable outputs with several times less computational cost. On both E2E and WebNLG benchmarks, we show the proposed model consistently outperforms its neural attention counterparts.

show abstract

Learning to Discover, Ground and Use Words with Segmental Neural Language Models

Cited by 27 publications

References 36 publications

Unsupervised Recurrent Neural Network Grammars

Unsupervised Recurrent Neural Network Grammars

Grounding ‘Grounding’ in NLP

Neural Data-to-Text Generation via Jointly Learning the Segmentation and Correspondence

Contact Info

Product

Resources

About