Fethi Bougares scite author profile

In this paper, we propose a novel neural network model called RNN EncoderDecoder that consists of two recurrent neural networks (RNN). One RNN encodes a sequence of symbols into a fixedlength vector representation, and the other decodes the representation into another sequence of symbols. The encoder and decoder of the proposed model are jointly trained to maximize the conditional probability of a target sequence given a source sequence. The performance of a statistical machine translation system is empirically found to improve by using the conditional probabilities of phrase pairs computed by the RNN Encoder-Decoder as an additional feature in the existing log-linear model. Qualitatively, we show that the proposed model learns a semantically and syntactically meaningful representation of linguistic phrases.

show abstract

Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation

Cho¹,

Merriënboer²,

Gülçehre³

et al. 2014

Preprint

2,565

2,490

View full text Add to dashboard Cite

Findings of the Third Shared Task on Multimodal Machine Translation

Barrault¹,

Bougares²,

Specia³

et al. 2018

113

View full text Add to dashboard Cite

We present the results from the third shared task on multimodal machine translation. In this task a source sentence in English is supplemented by an image and participating systems are required to generate a translation for such a sentence into German, French or Czech. The image can be used in addition to (or instead of) the source sentence. This year the task was extended with a third target language (Czech) and a new test set. In addition, a variant of this task was introduced with its own test set where the source sentence is given in multiple languages: English, French and German, and participating systems are required to generate a translation in Czech. Seven teams submitted 45 different systems to the two variants of the task. Compared to last year, the performance of the multimodal submissions improved, but text-only systems remain competitive.

show abstract

Findings of the Second Shared Task on Multimodal Machine Translation and Multilingual Image Description

Elliott¹,

Frank²,

Barrault³

et al. 2017

133

104

View full text Add to dashboard Cite

We present the results from the second shared task on multimodal machine translation and multilingual image description. Nine teams submitted 19 systems to two tasks. The multimodal translation task, in which the source sentence is supplemented by an image, was extended with a new language (French) and two new test sets. The multilingual image description task was changed such that at test time, only the image is given. Compared to last year, multimodal systems improved, but text-only systems remain competitive.

show abstract

LIUM-CVC Submissions for WMT17 Multimodal Translation Task

Çağlayan¹,

Aransa²,

Bardet³

et al. 2017

View full text Add to dashboard Cite

This paper describes the monomodal and multimodal Neural Machine Translation systems developed by LIUM and CVC for WMT17 Shared Task on Multimodal Translation. We mainly explored two multimodal architectures where either global visual features or convolutional feature maps are integrated in order to benefit from visual context. Our final systems ranked first for both En→De and En→Fr language pairs according to the automatic evaluation metrics METEOR and BLEU.

show abstract

Sentiment Analysis of Tunisian Dialects: Linguistic Ressources and Experiments

Medhaffar¹,

Bougares²,

Estève³

et al. 2017

117

View full text Add to dashboard Cite

Dialectal Arabic (DA) is significantly different from the Arabic language taught in schools and used in written communication and formal speech (broadcast news, religion, politics, etc.). There are many existing researches in the field of Arabic language Sentiment Analysis (SA); however, they are generally restricted to Modern Standard Arabic (MSA) or some dialects of economic or political interest. In this paper we focus on SA of the Tunisian dialect. We use Machine Learning techniques to determine the polarity of comments written in Tunisian dialect. First, we evaluate the SA systems performances with models trained using freely available MSA and Multi-dialectal data sets. We then collect and annotate a Tunisian dialect corpus of 17.000 comments from Facebook. This corpus shows a significant improvement compared to the best model trained on other Arabic dialects or MSA data. We believe that this first freely available 12 corpus will be valuable to researchers working in the field of Tunisian Sentiment Analysis and similar areas.

show abstract

Does Multimodality Help Human and Machine for Translation and Image Captioning?

Çağlayan¹,

Aransa²,

Wang³

et al. 2016

View full text Add to dashboard Cite

This paper presents the systems developed by LIUM and CVC for the WMT16 Multimodal Machine Translation challenge. We explored various comparative methods, namely phrase-based systems and attentional recurrent neural networks models trained using monomodal or multimodal data. We also performed a human evaluation in order to estimate the usefulness of multimodal data for human machine translation and image description generation. Our systems obtained the best results for both tasks according to the automatic evaluation metrics BLEU and ME-TEOR.

show abstract

NMTPY: A Flexible Toolkit for Advanced Neural Machine Translation Systems

Çağlayan¹,

García-Martínez²,

Bardet³

et al. 2017

View full text Add to dashboard Cite

In this paper, we present nmtpy, a flexible Python toolkit based on Theano for training Neural Machine Translation and other neural sequence-to-sequence architectures. nmtpy decouples the specification of a network from the training and inference utilities to simplify the addition of a new architecture and reduce the amount of boilerplate code to be written. nmtpy has been used for LIUM's topranked submissions to WMT Multimodal Machine Translation and News Translation tasks in 2016 and 2017.

show abstract

12 3 4 5 6

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Fethi Bougares

Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation

Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation

Findings of the Third Shared Task on Multimodal Machine Translation

Findings of the Second Shared Task on Multimodal Machine Translation and Multilingual Image Description

LIUM-CVC Submissions for WMT17 Multimodal Translation Task

Sentiment Analysis of Tunisian Dialects: Linguistic Ressources and Experiments

Does Multimodality Help Human and Machine for Translation and Image Captioning?

NMTPY: A Flexible Toolkit for Advanced Neural Machine Translation Systems

Contact Info

Product

Resources

About