Lexical Chains meet Word Embeddings in Document-level Statistical
            Machine Translation

Mascarell, Laura

doi:10.18653/v1/w17-4813

Cited by 10 publications

(8 citation statements)

References 29 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Statistical Machine Translation (SMT) Initial studies were based on cache memories (Tiede- mann, 2010; Gong et al, 2011). However, most of the work explicitly models discourse phenomena (Sim Smith, 2017) such as lexical cohesion (Meyer and Popescu-Belis, 2012;Xiong et al, 2013;Loáiciga and Grisot, 2016;Pu et al, 2017;Mascarell, 2017), coherence (Born et al, 2017), and coreference (Rios Gonzales and Tuggener, 2017;Miculicich Werlen and Popescu-Belis, 2017a). Hardmeier et al (2013) introduced the document-level SMT paradigm.…”

Section: Related Workmentioning

confidence: 99%

Document-Level Neural Machine Translation with Hierarchical Attention Networks

Miculicich¹,

Ram²,

Pappas³

et al. 2018

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing

218

297

View full text Add to dashboard Cite

Neural Machine Translation (NMT) can be improved by including document-level contextual information. For this purpose, we propose a hierarchical attention model to capture the context in a structured and dynamic manner. The model is integrated in the original NMT architecture as another level of abstraction, conditioning on the NMT model's own previous hidden states. Experiments show that hierarchical attention significantly improves the BLEU score over a strong NMT baseline with the state-of-the-art in context-aware methods, and that both the encoder and decoder benefit from context in complementary ways.

show abstract

Section: Related Workmentioning

confidence: 99%

Document-Level Neural Machine Translation with Hierarchical Attention Networks

Miculicich¹,

Ram²,

Pappas³

et al. 2018

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing

218

297

View full text Add to dashboard Cite

show abstract

“…Conventional Document-level MT These can further be classified into two main categories. The first, which use cache-based memories (Tiedemann, 2010;Gong et al, 2011) and the second, which focus on specific discourse phenomema like anaphora (Hardmeier and Federico, 2010), lexical cohesion (Xiong et al, 2013;Gong et al, 2015;Mascarell, 2017) and coreference (Miculicich Werlen and Popescu-Belis, 2017) to name a few. Most of these approaches are, however, restrictive as they mostly involve using handcrafted features similar to the conventional MT approaches.…”

Section: Related Workmentioning

confidence: 99%

Selective Attention for Context-aware Neural Machine Translation

Maruf¹,

Martins

Haffari³

2019

Proceedings of the 2019 Conference of the North

134

170

View full text Add to dashboard Cite

Despite the progress made in sentence-level NMT, current systems still fall short at achieving fluent, good quality translation for a full document. Recent works in context-aware NMT consider only a few previous sentences as context and may not scale to entire documents. To this end, we propose a novel and scalable top-down approach to hierarchical attention for context-aware NMT which uses sparse attention to selectively focus on relevant sentences in the document context and then attends to key words in those sentences. We also propose single-level attention approaches based on sentence or word-level information in the context. The document-level context representation, produced from these attention modules, is integrated into the encoder or decoder of the Transformer model depending on whether we use monolingual or bilingual context. Our experiments and evaluation on English-German datasets in different document MT settings show that our selective attention approach not only significantly outperforms context-agnostic baselines but also surpasses context-aware baselines in most cases.

show abstract

“…These words are related sequentially in the text, defining the topic of the text segment that they cover and establishing associations between sentences. Following this observation, some researchers have obtained success in many NLP tasks such as word sense induction (Tao et al, 2014) , machine translation (Mascarell, 2017) and text (Stokes et al, 2004) segmentation. In the BB-rel dataset, the sentences where inter-sentence relations occur usually express the same topic or have semantic associations each other.…”

Section: Contained Threementioning

confidence: 98%

Bacteria Biotope Relation Extraction via Lexical Chains and Dependency Graphs

Xiong

Cheng

et al. 2019

Proceedings of the 5th Workshop on BioNLP Open Shared Tasks

View full text Add to dashboard Cite

In this article, we describe our approach for the Bacteria Biotopes relation extraction (BBrel) subtask in the BioNLP Shared Task 2019. This task aims to promote the development of text mining systems that extract relationships between Microorganism, Habitat and Phenotype entities. In this paper, we propose a novel approach for dependency graph construction based on lexical chains, so one dependency graph can represent one or multiple sentences. After that, we propose a neural network model which consists of the bidirectional long shortterm memories and an attention graph convolution neural network to learn relation extraction features from the graph. Our approach is able to extract both intra-and inter-sentence relations, and meanwhile utilize syntax information. The results show that our approach achieved the best F1 (66.3%) in the official evaluation participated by 7 teams. 1

show abstract

Lexical Chains meet Word Embeddings in Document-level Statistical Machine Translation

Cited by 10 publications

References 29 publications

Document-Level Neural Machine Translation with Hierarchical Attention Networks

Document-Level Neural Machine Translation with Hierarchical Attention Networks

Selective Attention for Context-aware Neural Machine Translation

Bacteria Biotope Relation Extraction via Lexical Chains and Dependency Graphs

Contact Info

Product

Resources

About