Pointing the Unknown Words

Gülçehre, Çağlar; Ahn, Sungjin; Nallapati, Ramesh; Zhou, Bowen; Bengio, Yoshua

doi:10.18653/v1/p16-1014

Cited by 411 publications

(337 citation statements)

References 14 publications

Supporting

Mentioning

337

Contrasting

Order By: Relevance

“…We accomplish this by using an attentionbased copying mechanism (Jia and Liang, 2016;Gulcehre et al, 2016;Gu et al, 2016). At every time step, the decoder may either output a token from the training vocabulary or copy a word from the input sentence.…”

Section: Semantic Parsingmentioning

confidence: 99%

Towards Problem Solving Agents that Communicate and Learn

Narayan-Chen

Graber²,

Das

et al. 2017

Proceedings of the First Workshop on Language Grounding for Robotics

View full text Add to dashboard Cite

Agents that communicate back and forth with humans to help them execute nonlinguistic tasks are a long sought goal of AI. These agents need to translate between utterances and actionable meaning representations that can be interpreted by task-specific problem solvers in a contextdependent manner. They should also be able to learn such actionable interpretations for new predicates on the fly. We define an agent architecture for this scenario and present a series of experiments in the Blocks World domain that illustrate how our architecture supports language learning and problem solving in this domain.

show abstract

Section: Semantic Parsingmentioning

confidence: 99%

Towards Problem Solving Agents that Communicate and Learn

Narayan-Chen

Graber²,

Das

et al. 2017

Proceedings of the First Workshop on Language Grounding for Robotics

View full text Add to dashboard Cite

show abstract

“…Particularly, we base our model on the attention mechanism of and the pointer-softmax copying mechanism of Gulcehre et al (2016). In question generation, we can condition our encoder on two different sources of information (compared to the single source in neural machine translation (NMT)): a document that the question should be about and an answer that should fit the generated question.…”

Section: Encoder-decoder Model For Question Generationmentioning

confidence: 99%

“…When formulating questions based on documents, it is common to refer to phrases and entities that appear directly in the text. We therefore incorporate into our decoder a mechanism for copying relevant words from D. We use the pointer-softmax formulation (Gulcehre et al, 2016), which has two output layers: the shortlist softmax and the location softmax. The shortlist softmax places a distribution over words in a predefined output vocabulary.…”

Section: Decodermentioning

confidence: 99%

See 1 more Smart Citation

Proceedings of the 2nd Workshop on Representation Learning for NLP

2017

View full text Add to dashboard Cite

“…Thus, we expect to decisively prohibit excessive generation. Finally, we evaluate the effectiveness of our method on well-studied ABS benchmark data provided by Rush et al (2015), and evaluated in (Chopra et al, 2016;Nallapati et al, 2016b;Kikuchi et al, 2016;Takase et al, 2016;Ayana et al, 2016;Gulcehre et al, 2016).…”

Section: Introductionmentioning

confidence: 99%

Cutting-off Redundant Repeating Generations for Neural Abstractive Summarization

Suzuki¹,

Nagata²

2017

Proceedings of the 15th Conference of the European Chapter of The Association for Computational Linguistics: Volume 2

View full text Add to dashboard Cite

This paper tackles the reduction of redundant repeating generation that is often observed in RNN-based encoder-decoder models. Our basic idea is to jointly estimate the upper-bound frequency of each target vocabulary in the encoder and control the output words based on the estimation in the decoder. Our method shows significant improvement over a strong RNN-based encoder-decoder baseline and achieved its best results on an abstractive summarization benchmark.

show abstract

Pointing the Unknown Words

Cited by 411 publications

References 14 publications

Towards Problem Solving Agents that Communicate and Learn

Towards Problem Solving Agents that Communicate and Learn

Proceedings of the 2nd Workshop on Representation Learning for NLP

Cutting-off Redundant Repeating Generations for Neural Abstractive Summarization

Contact Info

Product

Resources

About