ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

Yan, Yuanmeng; Li, Rumei; Wang, Sirui; Zhang, Fuzheng; Wu, Wei; Wang, Xu

doi:10.48550/arxiv.2105.11741

Cited by 50 publications

(79 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Nevertheless, we focus on unsupervised contrastive learning and form the positive pairs via data augmentation, since such methods are more cost-effective and applicable across different domains and languages. Along this line, many approaches have been developed recently, where the augmentations are obtained via sampling from surrounding or nearby contexts (Logeswaran and Lee, 2018;Giorgi et al, 2020), word or feature-level perturbation Yan et al, 2021), back-translation (Fang and Xie, 2020), sentencelevel corruption using an auxiliary language model (Meng et al, 2021), intermediate representations of BERT (Kim et al, 2021), and dropout (Yan et al, 2021;Gao et al, 2021).…”

Section: Related Workmentioning

confidence: 99%

“…Various contrastive learning based approaches have been proposed for learning sentence representations, with the main difference lies in how the augmentations are generated (Fang and Xie, 2020;Giorgi et al, 2020;Meng et al, 2021;Yan et al, 2021;Kim et al, 2021;Gao et al, 2021). Somewhat surprisingly, a recent work (Gao et al, 2021) empirically shows that augmentations obtained by Dropout (Srivastava et al, 2014), i.e., feeding the same instance to the encoder twice, outperforms common text augmentation operations on the text directly, including cropping, word deletion, or synonym replacement.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Virtual Augmentation Supported Contrastive Learning of Sentence Representations

Zhang¹,

Xiao²,

Zhu³

et al. 2021

Preprint

View full text Add to dashboard Cite

Despite profound successes, contrastive representation learning relies on carefully designed data augmentations using domain specific knowledge. This challenge is magnified in natural language processing where no general rules exist for data augmentation due to the discrete nature of natural language. We tackle this challenge by presenting a Virtual augmentation Supported Contrastive Learning of sentence representations (VaSCL). Originating from the interpretation that data augmentation essentially constructs the neighborhoods of each training instance, we in turn utilize the neighborhood to generate effective data augmentations. Leveraging the large training batch size of contrastive learning, we approximate the neighborhood of an instance via it's K-nearest in-batch neighbors in the representation space. We then define an instance discrimination task within this neighborhood, and generate the virtual augmentation in an adversarial training manner. We access the performance of VaSCL 1 on a wide range of downstream tasks, and set a new state-of-the-art for unsupervised sentence representation learning.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Virtual Augmentation Supported Contrastive Learning of Sentence Representations

Zhang¹,

Xiao²,

Zhu³

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…Contrastive learning (CL) recently is attracting researchers' attention in all area. After witnessing its superiority in Computer Vision tasks (Chen et al 2020;He et al 2020), researchers in NLP are also applying this techniques (Wu et al 2020;Karpukhin et al 2020;Yan et al 2021;Giorgi et al 2021;Gao, Yao, and Chen 2021). For the concern of ODPR, the research lines of CL can be divided into two types: (i) Improving the sampling strategies for positive samples and hard negative samples.…”

Section: Contrastive Learning In Nlpmentioning

confidence: 99%

Sentence-aware Contrastive Learning for Open-Domain Passage Retrieval

Bo-hong¹,

Zhang²,

Wang³

et al. 2021

Preprint

View full text Add to dashboard Cite

Training dense passage representations via contrastive learning (CL) has been shown effective for Open-Domain Passage Retrieval (ODPR). Recent studies mainly focus on optimizing this CL framework by improving the sampling strategy or extra pretraining. Different from previous studies, this work devotes itself to investigating the influence of conflicts in the widely used CL strategy in ODPR, motivated by our observation that a passage can be organized by multiple semantically different sentences, thus modeling such a passage as a unified dense vector is not optimal. We call such conflicts Contrastive Conflicts. In this work, we propose to solve it with a representation decoupling method, by decoupling the passage representations into contextual sentence-level ones, and design specific CL strategies to mediate these conflicts. Experiments on widely used datasets including Natural Questions, Trivia QA, and SQuAD verify the effectiveness of our method, especially on the dataset where the conflicting problem is severe. Our method also presents good transferability across the datasets, which further supports our idea of mediating Contrastive Conflicts.

show abstract

“…To promote the robustness, some data augmentations are used in training. Following [5], we adopt token shuffling, cutoff and dropout. Token shuffling strategy aims to randomly shuffle order of the tokens in the token embeddings.…”

Section: Data Augmentationmentioning

confidence: 99%

Coarse to Fine: Video Retrieval before Moment Localization

Gao,

Liu,

Liu

2021

Preprint

View full text Add to dashboard Cite

The current state-of-the-art methods for video corpus moment retrieval (VCMR) often use similarity-based feature alignment approach for the sake of convenience and speed. However, late fusion methods like cosine similarity alignment are unable to make full use of the information from both query texts and videos. In this paper, we combine feature alignment with feature fusion to promote the performance on VCMR.

show abstract

ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

Cited by 50 publications

References 27 publications

Virtual Augmentation Supported Contrastive Learning of Sentence Representations

Virtual Augmentation Supported Contrastive Learning of Sentence Representations

Sentence-aware Contrastive Learning for Open-Domain Passage Retrieval

Coarse to Fine: Video Retrieval before Moment Localization

Contact Info

Product

Resources

About