Few-Shot Text Classification with Pre-Trained Word Embeddings and a Human in the Loop

Bailey, Katherine; Chopra, Sunny

doi:10.48550/arxiv.1804.02063

Cited by 5 publications

(6 citation statements)

References 3 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Few-shot learning approach. We benchmark MFeEmb against prior work on conflict prediction, other embedding choices, and FsText, a few-shot model proposed by Bailey and Chopra (2018). Experiments were performed using a 300-dimensional version of MFeEmb where the length of all the three embeddings is the same, i.e., 100.…”

Section: Methodsmentioning

confidence: 99%

Improving the Generalizability of Collaborative Dialogue Analysis With Multi-Feature Embeddings

Enayet,

Sukthankar

2023

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics

View full text Add to dashboard Cite

Conflict prediction in communication is integral to the design of virtual agents that support successful teamwork by providing timely assistance. The aim of our research is to analyze discourse to predict collaboration success. Unfortunately, resource scarcity is a problem that teamwork researchers commonly face since it is hard to gather a large number of training examples. To alleviate this problem, this paper introduces a multi-feature embedding (MFeEmb) that improves the generalizability of conflict prediction models trained on dialogue sequences. MFeEmb leverages textual, structural, and semantic information from the dialogues by incorporating lexical, dialogue acts, and sentiment features. The use of dialogue acts and sentiment features reduces performance loss from natural distribution shifts caused mainly by changes in vocabulary.This paper demonstrates the performance of MFeEmb on domain adaptation problems in which the model is trained on discourse from one task domain and applied to predict team performance in a different domain. The generalizability of MFeEmb is quantified using the similarity measure proposed by Bontonou et al. (2021). Our results show that MFeEmb serves as an excellent domain-agnostic representation for meta-pretraining a few-shot model on collaborative multiparty dialogues.

show abstract

Section: Methodsmentioning

confidence: 99%

Improving the Generalizability of Collaborative Dialogue Analysis With Multi-Feature Embeddings

Enayet,

Sukthankar

2023

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics

View full text Add to dashboard Cite

show abstract

“…Fewshot text classification entails performing classification after training or tuning a model on only a few examples. Several studies (Yu et al, 2018;Bailey and Chopra, 2018;Geng et al, 2020) have explored various approaches for few-shot text classification, which mainly involve the traditional machine learning techniques for selecting the optimal category sub-samples.…”

Section: Few-shot Text Classificationmentioning

confidence: 99%

Self-Evolution Learning for Mixup: Enhance Data Augmentation on Few-Shot Text Classification Tasks

Zheng,

Zhong,

Ding

et al. 2023

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

Text classification tasks often encounter fewshot scenarios with limited labeled data, and addressing data scarcity is crucial. Data augmentation with mixup merges sample pairs to generate new pseudos, which can relieve the data deficiency issue in text classification. However, the quality of pseudo-samples generated by mixup exhibits significant variations. Most of the mixup methods fail to consider the varying degree of learning difficulty in different stages of training. And mixup generates new samples with one-hot labels, which encourages the model to produce a high prediction score for the correct class that is much larger than other classes, resulting in the model's over-confidence. In this paper, we propose a self-evolution learning (SE) based mixup approach for data augmentation in text classification, which can generate more adaptive and model-friendly pseudo samples for the model training. SE caters to the growth of the model learning ability and adapts to the ability when generating training samples. To alleviate the model over-confidence, we introduce an instance-specific label smoothing regularization approach, which linearly interpolates the model's output and one-hot labels of the original samples to generate new soft labels for label mixing up. Through experimental analysis, experiments show that our SE brings consistent and significant improvements upon different mixup methods. In-depth analyses demonstrate that SE enhances the model's generalization ability.

show abstract

“…Word embeddings are a powerful tool and are applied in variety of Natural Language Processing tasks, such as text classification (Aydogan and Karci, 2020;Alwehaibi and Roy, 2018;Jo and Cinarel, 2019;Bailey and Chopra, 2018;Rescigno et al, 2020) and sentiment analysis (Araque et al, 2017;Rezaeinia et al, 2019;Fu et al, 2017;Ren et al, 2016;Tang et al, 2014). However, analogies such as "Man is to computer programmer as woman is to homemaker" (Bolukbasi et al, 2016a) contain worrisome biases that are present in society and hence embedded in language.…”

Section: Introductionmentioning

confidence: 99%

An Empirical Study on the Fairness of Pre-trained Word Embeddings

Sesari¹,

Hort²,

Sarro³

2022

Proceedings of the 4th Workshop on Gender Bias in Natural Language Processing (GeBNLP)

View full text Add to dashboard Cite

Pre-trained word embedding models are easily distributed and applied, as they alleviate users from the effort to train models themselves. With widely distributed models, it is important to ensure that they do not exhibit undesired behaviour, such as biases against population groups. For this purpose, we carry out an empirical study on evaluating the bias of 15 publicly available, pre-trained word embeddings model based on three training algorithms (GloVe, word2vec, and fastText) with regard to four bias metrics (WEAT, SEMBIAS, DIRECT BIAS, and ECT). The choice of word embedding models and bias metrics is motivated by a literature survey over 37 publications which quantified bias on pre-trained word embeddings. Our results indicate that fastText is the least biased model (in 8 out of 12 cases) and small vector lengths lead to a higher bias.

show abstract

Few-Shot Text Classification with Pre-Trained Word Embeddings and a Human in the Loop

Cited by 5 publications

References 3 publications

Improving the Generalizability of Collaborative Dialogue Analysis With Multi-Feature Embeddings

Improving the Generalizability of Collaborative Dialogue Analysis With Multi-Feature Embeddings

Self-Evolution Learning for Mixup: Enhance Data Augmentation on Few-Shot Text Classification Tasks

An Empirical Study on the Fairness of Pre-trained Word Embeddings

Contact Info

Product

Resources

About