Active Learning for Sequence Tagging with Deep Pre-trained Models and Bayesian Uncertainty Estimates

Shelmanov, Artem; Puzyrev, Dmitri; Kupriyanova, Lyubov; Belyakov, Denis; Larionov, Daniil Yu.; Khromov, Nikita; Kozlova, Olga; Artemova, Ekaterina; Dylov, Dmitry V.; Panchenko, Alexander

doi:10.48550/arxiv.2101.08133

Cited by 2 publications

(3 citation statements)

References 19 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Active learning for NER. Active learning for NER has seen a variety of methodologies being developed to address its unique challenges (Settles and Craven, 2008;Marcheggiani and Artieres, 2014;Shelmanov et al, 2021). The overarching goal is to reduce the budget for labeling sequence data by selectively querying informative samples.…”

Section: Related Workmentioning

confidence: 99%

“…The primary challenge in applying active learning to sequence tagging lies in addressing data imbalance at the entity level. Traditional methods of active sequence tagging, such as those referenced by (Shen et al, 2017;Zhang et al, 2020;Shelmanov et al, 2021) , typically generate scores for sentences by summing or averaging the tokens within them, thereby treating each token equally. Radmard et al (2021) attempted to address this by segmenting sentences for token-level selections, However this led to loss of context and semantic meaning, impairing human understanding.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Re-weighting Tokens: A Simple and Effective Active Learning Strategy for Named Entity Recognition

Luo,

Tan,

Nguyen

et al. 2023

Findings of the Association for Computational Linguistics: EMNLP 2023

View full text Add to dashboard Cite

Active learning, a widely adopted technique for enhancing machine learning models in text and image classification tasks with limited annotation resources, has received relatively little attention in the domain of Named Entity Recognition (NER). The challenge of data imbalance in NER has hindered the effectiveness of active learning, as sequence labellers lack sufficient learning signals. To address these challenges, this paper presents a novel reweightingbased active learning strategy that assigns dynamic smoothed weights to individual tokens. This adaptable strategy is compatible with various token-level acquisition functions and contributes to the development of robust active learners. Experimental results on multiple corpora demonstrate the substantial performance improvement achieved by incorporating our re-weighting strategy into existing acquisition functions, validating its practical efficacy.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Re-weighting Tokens: A Simple and Effective Active Learning Strategy for Named Entity Recognition

Luo,

Tan,

Nguyen

et al. 2023

Findings of the Association for Computational Linguistics: EMNLP 2023

View full text Add to dashboard Cite

show abstract

“…For example, Agrawal [15] gives a modified least confidence-based sampling strategy, Marcheggiani [16] investigates minimum token margin(MTM) strategy that is a variant of the margin sampling strategy, or Balcan [17] offers the maximum token entropy(MTE) measure to the ambiguity about the label of a token. In addition, Bayesian uncertainty estimation method is conducted in [18].These techniques ensure choose crucial samples for training and minimize labeling cost.…”

Section: Related Workmentioning

confidence: 99%

Subsequence and Distant Supervision based Active Learning for Relation Extraction of Chinese Medical Texts

Cai

Ruan

et al. 2022

Preprint

View full text Add to dashboard Cite

Background: In recent years, relation extraction from unstructured texts has become an important task in medical research. However, relation extraction requires a large amount of labeled corpus, manually annotating sequences is time consuming and expensive. Therefore, efficient and economical methods for annotating sequences are required to ensure the performance of relational extraction. Methods: This paper proposes a method of subsequence and distant supervision based active learning. The method is annotated by selecting information-rich subsequences as a sampling unit instead of the full sentences in traditional active learning. Additionally, the method saves the labeled subsequence texts and their corresponding labels in a dictionary which is continuously updated and maintained, and pre-labels the unlabeled set through text matching based on the idea of distant supervision. Finally, the method combines a BERT-CRF model for relation extraction in Chinese medical texts. Results: Experimental results test on the CMeIE dataset that it achieves the best results compared to existing methods. And the best F1 values are obtained in different sampling strategies, which are 52.65%, 52.55% and 51.37% respectively.

show abstract

Active Learning for Sequence Tagging with Deep Pre-trained Models and Bayesian Uncertainty Estimates

Cited by 2 publications

References 19 publications

Re-weighting Tokens: A Simple and Effective Active Learning Strategy for Named Entity Recognition

Re-weighting Tokens: A Simple and Effective Active Learning Strategy for Named Entity Recognition

Subsequence and Distant Supervision based Active Learning for Relation Extraction of Chinese Medical Texts

Contact Info

Product

Resources

About