Nested Named Entity Recognition via Second-best Sequence Learning and Decoding

Shibuya, Takashi; Hovy, Eduard

doi:10.1162/tacl_a_00334

Cited by 90 publications

(82 citation statements)

References 32 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Although the second-best path searching algorithm is proposed as the main contribution of Shibuya and Hovy (2020), we claim that forcing the target path at the next level to be the second-best path at the current level is not optimal. As the innermostfirst encoding example above, the best path at level 3 is B-ROLE,I-ROLE,E-ROLE,O,O.…”

Section: Influence Of the Best Pathmentioning

confidence: 98%

“…Therefore the second-best path is more likely to be one of those paths that share as many as possible labels with the best path, e.g., B-ROLE,I-ROLE,E-ROLE,O,S-ORG, rather than the actual target label sequence at level 4, i.e., B-PER,I-PER,I-PER,I-PER,E-PER, which does not overlap with the best path at all. In addition, Shibuya and Hovy (2020) reuse the same potential function at all higher levels. This indicates that, for instance, at level 3 and time step 1, their model encourages the dot product of the hidden state and the label embedding h 1 v B-ROLE to be larger than h 1 v B-PER , while at level 4, the remaining influence of the best path reversely forces h 1 v B-PER to be larger than h 1 v B-ROLE .…”

Section: Influence Of the Best Pathmentioning

confidence: 99%

“…Among them, layered methods solve this task through multi-level sequential labeling, in which entities are divided into several levels, where the term level indicates the depth of entity nesting, and sequential labeling is performed repeatedly. As a special case of layered method, Shibuya and Hovy (2020) force the next level entities to locate on the second-best path of the current level search space. Hence, their algorithm can repeatedly detect inner entities through applying a conventional conditional random field (CRF) (Lafferty et al, 2001) and then exclude the obtained best paths from the search space.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Nested Named Entity Recognition via Explicitly Excluding the Influence of the Best Path

Wang¹,

Shindo²,

Matsumoto³

et al. 2021

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer

View full text Add to dashboard Cite

This paper presents a novel method for nested named entity recognition.As a layered method, our method extends the prior secondbest path recognition method by explicitly excluding the influence of the best path. Our method maintains a set of hidden states at each time step and selectively leverages them to build a different potential function for recognition at each level. In addition, we demonstrate that recognizing innermost entities first results in better performance than the conventional outermost entities first scheme. We provide extensive experimental results on ACE2004, ACE2005, and GENIA datasets to show the effectiveness and efficiency of our proposed method.

show abstract

Section: Influence Of the Best Pathmentioning

confidence: 98%

Section: Influence Of the Best Pathmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Nested Named Entity Recognition via Explicitly Excluding the Influence of the Best Path

Wang¹,

Shindo²,

Matsumoto³

et al. 2021

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer

View full text Add to dashboard Cite

show abstract

“…Fisher and Vlachos (2019) introduced a BERT-based model that first merges tokens and/or entities into entities, and then assigned labeled to these entities. Shibuya and Hovy (2019) provided inference model that extracts entities iteratively from outermost ones to inner ones. Straková et al (2019) viewed nested NER as a sequence-tosequence generation problem, in which the input sequence is a list of tokens and the target sequence is a list of labels.…”

Section: Nested Named Entity Recognitionmentioning

confidence: 99%

“…• DYGIE: Luan et al (2019) introduces a general framework that share span representations using dynamically constructed span graphs. (Wang and Lu, 2018) 76.8 72.3 74.5 ARN (Lin et al, 2019a) 76.2 73.6 74.9 Path-BERT (Shibuya and Hovy, 2019) 82.98 82.42 82.70 Merge-BERT (Fisher and Vlachos, 2019) micro-averaged precision, recall and F1 scores for evaluation.…”

Section: Baselinesmentioning

confidence: 99%