Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers) 2016
DOI: 10.18653/v1/p16-2047
|View full text |Cite
|
Sign up to set email alerts
|

An Unsupervised Method for Automatic Translation Memory Cleaning

Abstract: We address the problem of automatically cleaning a large-scale Translation Memory (TM) in a fully unsupervised fashion, i.e. without human-labelled data. We approach the task by: i) designing a set of features that capture the similarity between two text segments in different languages, ii) use them to induce reliable training labels for a subset of the translation units (TUs) contained in the TM, and iii) use the automatically labelled data to train an ensemble of binary classifiers. We apply our method to cl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2016
2016
2017
2017

Publication Types

Select...
2

Relationship

2
0

Authors

Journals

citations
Cited by 2 publications
references
References 11 publications
0
0
0
Order By: Relevance