UDALM: Unsupervised Domain Adaptation through Language Modeling

Karouzos, Constantinos; Paraskevopoulos, Georgios; Potamianos, Alexandros

doi:10.48550/arxiv.2104.07078

Cited by 1 publication

(3 citation statements)

References 8 publications

(10 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Unsupervised domain adaption (UDA) is an essential task in the realm of deep learning since it mitigates the expensive burden of manual annotation by focusing on cheap unlabeled data from target domains [Ramponi and Plank, 2020]. Among all existing approaches for UDA, pre-trained language model (PrLM) based approaches become the de-facto standard [Gururangan et al, 2020, Ben-David et al, 2020, Yu et al, 2021, Karouzos et al, 2021 since these PrLMs are equipped with generic knowledge learned from large corpora [Howard and Ruder, 2018] and lead to promising results. The primary focuses of UDA methods are to capture the transferable features for the target domain while reserving the knowledge learned from the source domain [Blitzer et al, 2006, Pan et al, 2010.…”

Section: Introductionmentioning

confidence: 99%

“…The primary focuses of UDA methods are to capture the transferable features for the target domain while reserving the knowledge learned from the source domain [Blitzer et al, 2006, Pan et al, 2010. However, most existing pre-training-based UDA approaches are carried out by fine-tuning the entire set of model parameters on domain-specific corpora [Gururangan et al, 2020, Yu et al, 2021, Karouzos et al, 2021, which are usually of limited sizes. Such a setting may easily drift the PrLM to a specified domain and distort the generic knowledge embedded in the original PrLM weights , He et al, 2021.…”

Section: Introductionmentioning

confidence: 99%

“…Such a setting may easily drift the PrLM to a specified domain and distort the generic knowledge embedded in the original PrLM weights , He et al, 2021. This hinders the model from capturing transferable features between different domains and leads to sub-optimal performance for UDA tasks [Karouzos et al, 2021]. Moreover, it is also expensive to fine-tune and deploy a large model for every single domain [Houlsby et al, 2019].…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Unsupervised Domain Adaptation with Adapter

Zhang¹,

Zheng²,

Mao³

et al. 2021

Preprint

View full text Add to dashboard Cite

Unsupervised domain adaptation (UDA) with pre-trained language models (PrLM) has achieved promising results since these pre-trained models embed generic knowledge learned from various domains. However, fine-tuning all the parameters of the PrLM on a small domain-specific corpus distort the learned generic knowledge, and it is also expensive to deployment a whole fine-tuned PrLM for each domain. This paper explores an adapter-based fine-tuning approach for unsupervised domain adaptation. Specifically, several trainable adapter modules are inserted in a PrLM, and the embedded generic knowledge is preserved by fixing the parameters of the original PrLM at fine-tuning. A domain-fusion scheme is introduced to train these adapters using a mix-domain corpus to better capture transferable features. Elaborated experiments on two benchmark datasets are carried out, and the results demonstrate that our approach is effective with different tasks, dataset sizes, and domain similarities.

show abstract