Ioan Calapodescu scite author profile

Ioan Calapodescu

5Publications

98Citation Statements Received

77Citation Statements Given

How they've been cited

How they cite others

Affiliations

Naver (South Korea), Xerox (France)

Publications

Order By: Most citations

Naver Labs Europe’s Systems for the WMT19 Machine Translation Robustness Task

Bérard¹,

Calapodescu²,

Roux³

2019

View full text Add to dashboard Cite

This paper describes the systems that we submitted to the WMT19 Machine Translation robustness task. This task aims to improve MT's robustness to noise found on social media, like informal language, spelling mistakes and other orthographic variations. The organizers provide parallel data extracted from a social media website 1 in two language pairs: French-English and Japanese-English (in both translation directions). The goal is to obtain the best scores on unseen test sets from the same source, according to automatic metrics (BLEU) and human evaluation. We proposed one single and one ensemble system for each translation direction. Our ensemble models ranked first in all language pairs, according to BLEU evaluation. We discuss the preprocessing choices that we made, and present our solutions for robustness to noise and domain adaptation.

show abstract

Machine Translation of Restaurant Reviews: New Corpus for Domain Adaptation and Robustness

Bérard¹,

Calapodescu²,

Dymetman³

et al. 2019

View full text Add to dashboard Cite

We share a French-English parallel corpus of Foursquare restaurant reviews, and define a new task to encourage research on Neural Machine Translation robustness and domain adaptation, in a real-world scenario where better-quality MT would be greatly beneficial. We discuss the challenges of such usergenerated content, and train good baseline models that build upon the latest techniques for MT robustness. We also perform an extensive evaluation (automatic and human) that shows significant improvements over existing online systems. Finally, we propose taskspecific metrics based on sentiment analysis or translation accuracy of domain-specific polysemous words.

show abstract

Naver Labs Europe’s Systems for the Document-Level Generation and Translation Task at WNGT 2019

Saleh

Bérard

Calapodescu

et al. 2019

View full text Add to dashboard Cite

Recently, neural models led to significant improvements in both machine translation (MT) and natural language generation tasks (NLG). However, generation of long descriptive summaries conditioned on structured data remains an open challenge. Likewise, MT that goes beyond sentence-level context is still an open issue (e.g., document-level MT or MT with metadata). To address these challenges, we propose to leverage data from both tasks and do transfer learning between MT, NLG, and MT with source-side metadata (MT+NLG). First, we train document-based MT systems with large amounts of parallel data. Then, we adapt these models to pure NLG and MT+NLG tasks by fine-tuning with smaller amounts of domain-specific data. This end-toend NLG approach, without data selection and planning, outperforms the previous state of the art on the Rotowire NLG task. We participated to the "Document Generation and Translation" task at WNGT 2019, and ranked first in all tracks.

show abstract

Semi-Automatic De-identification of Hospital Discharge Summaries with Natural Language Processing: A Case-Study of Performance and Real-World Usability

Calapodescu

Rozier

Artemova

et al. 2017

View full text Add to dashboard Cite

Naver Labs Europe's Systems for the WMT19 Machine Translation Robustness Task

Bérard¹,

Calapodescu²,

Roux³

2019

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.