Chenhui Chu scite author profile

In this paper, we propose a novel domain adaptation method named "mixed fine tuning" for neural machine translation (NMT). We combine two existing approaches namely fine tuning and multi domain NMT. We first train an NMT model on an out-of-domain parallel corpus, and then fine tune it on a parallel corpus which is a mix of the in-domain and out-ofdomain corpora. All corpora are augmented with artificial tags to indicate specific domains. We empirically compare our proposed method against fine tuning and multi domain methods and discuss its benefits and shortcomings.

show abstract

A Survey of Domain Adaptation for Machine Translation

Chu

Wang

2020

Journal of Information Processing

113

120

View full text Add to dashboard Cite

Neural machine translation (NMT) is a deep learning based approach for machine translation, which yields the state-of-the-art translation performance in scenarios where large-scale parallel corpora are available. Although the high-quality and domain-specific translation is crucial in the real world, domain-specific corpora are usually scarce or nonexistent, and thus vanilla NMT performs poorly in such scenarios. Domain adaptation that leverages both out-of-domain parallel corpora as well as monolingual corpora for in-domain translation, is very important for domainspecific translation. In this paper, we give a comprehensive survey of the state-of-the-art domain adaptation techniques for NMT.

show abstract

A Survey of Multilingual Neural Machine Translation

2020

View full text Add to dashboard Cite

We present a survey on multilingual neural machine translation (MNMT), which has gained a lot of traction in recent years. MNMT has been useful in improving translation quality as a result of translation knowledge transfer (transfer learning). MNMT is more promising and interesting than its statistical machine translation counterpart, because end-to-end modeling and distributed representations open new avenues for research on machine translation. Many approaches have been proposed to exploit multilingual parallel corpora for improving translation quality. However, the lack of a comprehensive survey makes it difficult to determine which approaches are promising and, hence, deserve further exploration. In this article, we present an indepth survey of existing literature on MNMT. We first categorize various approaches based on their central use-case and then further categorize them based on resource scenarios, underlying modeling principles, coreissues, and challenges. Wherever possible, we address the strengths and weaknesses of several techniques by comparing them with each other. We also discuss the future directions for MNMT. This article is aimed towards both beginners and experts in NMT. We hope this article will serve as a starting point as well as a source of new ideas for researchers and engineers interested in MNMT.

show abstract

KnowIT VQA: Answering Knowledge-Based Questions about Videos

García

Otani

Chu

et al. 2020

AAAI

View full text Add to dashboard Cite

We propose a novel video understanding task by fusing knowledge-based and video question answering. First, we introduce KnowIT VQA, a video dataset with 24,282 human-generated question-answer pairs about a popular sitcom. The dataset combines visual, textual and temporal coherence reasoning together with knowledge-based questions, which need of the experience obtained from the viewing of the series to be answered. Second, we propose a video understanding model by combining the visual and textual video content with specific knowledge about the show. Our main findings are: (i) the incorporation of knowledge produces outstanding improvements for VQA in video, and (ii) the performance on KnowIT VQA still lags well behind human accuracy, indicating its usefulness for studying current video modelling limitations.

show abstract

BERT Representations for Video Question Answering

Yang

García

Chu

et al. 2020

View full text Add to dashboard Cite

Exploiting Multilingualism through Multistage Fine-Tuning for Low-Resource Neural Machine Translation

Dabre¹,

Fujita²,

Chu³

2019

View full text Add to dashboard Cite

This paper highlights the impressive utility of multi-parallel corpora for transfer learning in a one-to-many low-resource neural machine translation (NMT) setting. We report on a systematic comparison of multistage finetuning configurations, consisting of (1) pretraining on an external large (209k-440k) parallel corpus for English and a helping target language, (2) mixed pre-training or fine-tuning on a mixture of the external and low-resource (18k) target parallel corpora, and (3) pure finetuning on the target parallel corpora. Our experiments confirm that multi-parallel corpora are extremely useful despite their scarcity and content-wise redundancy thus exhibiting the true power of multilingualism. Even when the helping target language is not one of the target languages of our concern, our multistage finetuning can give 3-9 BLEU score gains over a simple one-to-one model.

show abstract

Adaptive Strategies for Coping with Product Variety Decisions

Yeh

Chu

1991

View full text Add to dashboard Cite

Under acute global competitive pressure, many companies have viewed product diversification as a strategic weapon with which to win market shares. Nevertheless, product diversification increases costs and sometimes degrades product quality. Many businesses today confront the dilemma of whether to broaden product lines or to focus production. This article explores both theoretically and empirically the possible impacts of product diversification. The emerging technologies and adaptive manufacturing strategies, potentially meeting a variety of customer needs while enhancing other competitive advantages, are examined.

show abstract

Information markets vs. opinion pools

Chen

Chu

Mullen

et al. 2005

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Chenhui Chu

An Empirical Comparison of Domain Adaptation Methods for Neural Machine Translation

A Survey of Domain Adaptation for Machine Translation

A Survey of Multilingual Neural Machine Translation

KnowIT VQA: Answering Knowledge-Based Questions about Videos

BERT Representations for Video Question Answering

Exploiting Multilingualism through Multistage Fine-Tuning for Low-Resource Neural Machine Translation

Adaptive Strategies for Coping with Product Variety Decisions

Information markets vs. opinion pools

Contact Info

Product

Resources

About