Vikas Bhardwaj scite author profile

Vikas Bhardwaj

5Publications

233Citation Statements Received

68Citation Statements Given

How they've been cited

210

223

How they cite others

Affiliations

Meta (Israel), Integrated Laboratory Systems, Inc., Thomson Reuters (United States)

Publications

Order By: Most citations

A neural interlingua for multilingual machine translation

Lu¹,

Keung²,

Ladhak³

et al. 2018

View full text Add to dashboard Cite

We incorporate an explicit neural interlingua into a multilingual encoder-decoder neural machine translation (NMT) architecture. We demonstrate that our model learns a languageindependent representation by performing direct zero-shot translation (without using pivot translation), and by using the source sentence embeddings to create an English Yelp review classifier that, through the mediation of the neural interlingua, can also classify French and German reviews. Furthermore, we show that, despite using a smaller number of parameters than a pairwise collection of bilingual NMT models, our approach produces comparable BLEU scores for each language pair in WMT15.

show abstract

Adversarial Learning with Contextual Embeddings for Zero-resource Cross-lingual Classification and NER

Keung¹,

Lu²,

Bhardwaj³

2019

View full text Add to dashboard Cite

Contextual word embeddings (e.g. GPT, BERT, ELMo, etc.) have demonstrated stateof-the-art performance on various NLP tasks. Recent work with the multilingual version of BERT has shown that the model performs very well in cross-lingual settings, even when only labeled English data is used to finetune the model. We improve upon multilingual BERT's zero-resource cross-lingual performance via adversarial learning. We report the magnitude of the improvement on the multilingual ML-Doc text classification and CoNLL 2002/2003 named entity recognition tasks. Furthermore, we show that language-adversarial training encourages BERT to align the embeddings of English documents and their translations, which may be the cause of the observed performance gains.

show abstract

Multiplicity and word sense: evaluating and learning from multiply labeled word sense annotations

Passonneau

Bhardwaj

Salleb-Aouissi

et al. 2012

Lang Resources & Evaluation

View full text Add to dashboard Cite

Don’t Use English Dev: On the Zero-Shot Cross-Lingual Evaluation of Contextual Embeddings

Keung¹,

Lu²,

Salazar³

et al. 2020

View full text Add to dashboard Cite

Multilingual contextual embeddings have demonstrated state-of-the-art performance in zero-shot cross-lingual transfer learning, where multilingual BERT is fine-tuned on one source language and evaluated on a different target language. However, published results for mBERT zero-shot accuracy vary as much as 17 points on the MLDoc classification task across four papers. We show that the standard practice of using English dev accuracy for model selection in the zero-shot setting makes it difficult to obtain reproducible results on the MLDoc and XNLI tasks. English dev accuracy is often uncorrelated (or even anti-correlated) with target language accuracy, and zero-shot performance varies greatly at different points in the same fine-tuning run and between different fine-tuning runs. These reproducibility issues are also present for other tasks with different pre-trained embeddings (e.g., MLQA with XLM-R). We recommend providing oracle scores alongside zero-shot results: still fine-tune using English data, but choose a checkpoint with the target dev set. Reporting this upper bound makes results more consistent by avoiding arbitrarily bad checkpoints.

show abstract

A neural interlingua for multilingual machine translation

Lu¹,

Keung²,

Ladhak³

et al. 2018

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Vikas Bhardwaj

A neural interlingua for multilingual machine translation

Adversarial Learning with Contextual Embeddings for Zero-resource Cross-lingual Classification and NER

Multiplicity and word sense: evaluating and learning from multiply labeled word sense annotations

Don’t Use English Dev: On the Zero-Shot Cross-Lingual Evaluation of Contextual Embeddings

A neural interlingua for multilingual machine translation

Contact Info

Product

Resources

About