“…Another direction, however, is to identify bitexts using only textual information, as the metadata associated with documents can often be sparse or unreliable (Uszkoreit et al, 2010). Some text-based approaches for identifying bitexts rely on methods such as n-gram scoring (Uszkoreit et al, 2010), named entity matching (Do et al, 2009), and cross-language information retrieval (Utiyama and Isahara, 2003;Munteanu and Marcu, 2005).…”