Few-shot Slot Tagging with Collapsed Dependency Transfer and Label-enhanced Task-adaptive Projection Network

Hou, Yutai; Che, Wanxiang; Lai, Yongkui; Zhou, Zhihan; Liu, Yijia; Liu, Han; Liu, Ting

doi:10.18653/v1/2020.acl-main.128

Cited by 139 publications

(257 citation statements)

References 37 publications

Supporting

Mentioning

217

Contrasting

Order By: Relevance

“…Prior work (Fritzler et al, 2019;Hou et al, 2020) on few-shot NER followed few-shot classification literature and adopted the episode evaluation methodology. Specifically, a NER system is evaluated with respect to multiple evaluation episodes.…”

Section: A Standard Evaluation Setupmentioning

confidence: 99%

“…Most meta-learning approaches (Snell et al, 2017;Hou et al, 2020) simulate the test time setup during training. Hence, these approaches sample multiple support sets and test sets from the training data and learn representations to minimize their corresponding few-shot loss on the source domain.…”

Section: Pre-trained Ner Models As Token Embeddersmentioning

confidence: 99%

“…Additionally, STRUCT-SHOT discards training phase in CRF and only makes use of its Viterbi decoder during inference. In particular, similar to Hou et al (2020), we utilize a transition matrix that captures transition probabilities between three abstract NER tags: O, I, I-Other 4 . For instance, p(O|I) and p(I|O) correspond to the transition probabilities between an entity tag and O, whereas p(I|I) and p(I-Other|I) correspond to the probabilities of transitioning from an entity tag to itself and to a different entity tag respectively.…”

Section: Structured Nearest Neighbor Learningmentioning

confidence: 99%

“…In the context of NER, these fewshot classification methods can enable rapid building of NER systems for a new domain by labeling only a few examples per entity class. Several previous studies (Fritzler et al, 2019;Hou et al, 2020) propose using prototypical networks (Snell et al, 2017), a popular few-shot classification algorithm, to address the few-shot NER problem. However, these approaches only achieve 10 ∼ 30% F1 scores on average, when transferring knowledge between different NER datasets with one or five shot examples, warranting more effective methods for the problem.…”

Section: Introductionmentioning

confidence: 99%

“…We test our systems on both identifying new types of entities in the source domain as well as identifying new types of entities in various target domains in one-shot and five-shot settings. In addition to the previous evaluation setup followed by Hou et al (2020), we propose a more standard and reproducible evaluation setup for few-shot NER by using standard test sets and development sets from benchmark datasets of several domains. In particular, we sample support sets from the standard development set and evaluate our models on the standard test set.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Simple and Effective Few-Shot Named Entity Recognition with Structured Nearest Neighbor Learning

Yang¹,

Katiyar²

2020

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

116

113

View full text Add to dashboard Cite

We present a simple few-shot named entity recognition (NER) system based on nearest neighbor learning and structured inference. Our system uses a supervised NER model trained on the source domain, as a feature extractor. Across several test domains, we show that a nearest neighbor classifier in this featurespace is far more effective than the standard meta-learning approaches. We further propose a cheap but effective method to capture the label dependencies between entity tags without expensive CRF training. We show that our method of combining structured decoding with nearest neighbor learning achieves stateof-the-art performance on standard few-shot NER evaluation tasks, improving F1 scores by 6% to 16% absolute points over prior metalearning based systems.

show abstract

Section: A Standard Evaluation Setupmentioning

confidence: 99%

Section: Pre-trained Ner Models As Token Embeddersmentioning

confidence: 99%

Section: Structured Nearest Neighbor Learningmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Simple and Effective Few-Shot Named Entity Recognition with Structured Nearest Neighbor Learning

Yang¹,

Katiyar²

2020

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

116

113

View full text Add to dashboard Cite

show abstract

Ontology Completion Using Graph Convolutional Networks

Bouraoui

Schockaert

2019

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Many methods have been proposed to automatically extend knowledge bases, but the vast majority of these methods focus on finding plausible missing facts, and knowledge graph triples in particular. In this paper, we instead focus on automatically extending ontologies that are encoded as a set of existential rules. In particular, our aim is to find rules that are plausible, but which cannot be deduced from the given ontology. To this end, we propose a graphbased representation of rule bases. Nodes of the considered graphs correspond to predicates, and they are annotated with vectors encoding our prior knowledge about the meaning of these predicates. The vectors may be obtained from external resources such as word embeddings or they could be estimated from the rule base itself. Edges connect predicates that co-occur in the same rule and their annotations reflect the types of rules in which the predicates co-occur. We then use a neural network model based on Graph Convolutional Networks (GCNs) to refine the initial vector representation of the predicates, to obtain a representation which is predictive of which rules are plausible. We present experimental results that demonstrate the strong performance of this method.

show abstract