“…Statistical Machine Translation (SMT) Initial studies were based on cache memories (Tiede- mann, 2010; Gong et al, 2011). However, most of the work explicitly models discourse phenomena (Sim Smith, 2017) such as lexical cohesion (Meyer and Popescu-Belis, 2012;Xiong et al, 2013;Loáiciga and Grisot, 2016;Pu et al, 2017;Mascarell, 2017), coherence (Born et al, 2017), and coreference (Rios Gonzales and Tuggener, 2017;Miculicich Werlen and Popescu-Belis, 2017a). Hardmeier et al (2013) introduced the document-level SMT paradigm.…”