LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention

Yamada, Ikuya; Asai, Akari; Shindo, Haruo; Takeda, Hideaki; Matsumoto, Yūji

doi:10.18653/v1/2020.emnlp-main.523

Cited by 338 publications

(292 citation statements)

References 28 publications

Supporting

Mentioning

252

Contrasting

Order By: Relevance

“…There are also limitations in the integration of gazetteer features. Existing studies often add extra features to a word-level model's Contextual Word Representations (CWRs), which typically contain no info about real world entities or their spans (Yamada et al, 2020). This concatenation approach is sub-optimal as it creates additional, and often highly correlated features.…”

Section: Complex Entitiesmentioning

confidence: 99%

“…While a subtagger may learn regularities in entity names, a key limitation is that it needs retraining and evaluation on gazetteer updates. Recent work has considered directly integrating knowledge into transformers, e.g., KnowBert adds knowledge to BERT layers (Peters et al, 2019), and LUKE is pretrained to predict masked entities (Yamada et al, 2020). The drawbacks of such methods are that they are specific to Transformers, and the model's knowledge cannot be updated without retraining.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

GEMNET: Effective Gated Gazetteer Representations for Recognizing Complex Entities in Low-context Input

Meng¹,

Fang²,

Rokhlenko³

et al. 2021

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Langua

View full text Add to dashboard Cite

Named Entity Recognition (NER) remains difficult in real-world settings; current challenges include short texts (low context), emerging entities, and complex entities (e.g. movie names). Gazetteer features can help, but results have been mixed due to challenges with adding extra features, and a lack of realistic evaluation data. It has been shown that including gazetteer features can cause models to overuse or underuse them, leading to poor generalization. We propose GEMNET, a novel approach for gazetteer knowledge integration, including (1) a flexible Contextual Gazetteer Representation (CGR) encoder that can be fused with any word-level model; and (2) a Mixture-of-Experts gating network that overcomes the feature overuse issue by learning to conditionally combine the context and gazetteer features, instead of assigning them fixed weights. To comprehensively evaluate our approaches, we create 3 large NER datasets (24M tokens) reflecting current challenges. In an uncased setting, our methods show large gains (up to +49% F1) in recognizing difficult entities compared to existing baselines. On standard benchmarks, we achieve a new uncased SOTA on CoNLL03 and WNUT17.

show abstract

Section: Complex Entitiesmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

GEMNET: Effective Gated Gazetteer Representations for Recognizing Complex Entities in Low-context Input

Meng¹,

Fang²,

Rokhlenko³

et al. 2021

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Langua

View full text Add to dashboard Cite

show abstract

“…In NER, we also conduct a comparison on the revised version of German datasets in the CoNLL 2006 shared task (Buchholz and Marsi, 2006). Recent work such as Yu et al (2020) and Yamada et al (2020) utilizes document contexts in the datasets. We follow their work and extract document embeddings for the transformer-based embeddings.…”

Section: Comparison With State-of-the-art Approachesmentioning

confidence: 99%

Automated Concatenation of Embeddings for Structured Prediction

Wang¹,

Jiang²,

Bach³

et al. 2021

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer

View full text Add to dashboard Cite

“…NEL typically involves two tasks: recognizing named entities in a given text and then disamgibuating the entity mentions according to the knowledge base (KB). Researchers have shown great success in NER with the help of Convolutional Neural Networks (CNNs), Bidirectional Recurrent Neural Networks (Bi-RNNs), and attention mechanisms along with a CRF decoder (Chiu and Nichols, 2016;Akbik et al, 2018;Ghaddar and Langlais, 2018;Jiang et al, 2019;Baevski et al, 2019;Yamada et al, 2020). Deep neural networks (DNNs) are also dominant in entity resolution tasks.…”

Section: Neural Entity Linkingmentioning

confidence: 99%

Entity Resolution in Open-domain Conversations

Shang¹,

Wang²,

Eric³

et al. 2021

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Langua

View full text Add to dashboard Cite

In recent years, incorporating external knowledge for response generation in open-domain conversation systems has attracted great interest. To improve the relevance of retrieved knowledge, we propose a neural entity linking (NEL) approach. Different from formal documents such as news, conversational utterances are informal and multi-turn, which makes it more challenging to disambiguate the entities. Therefore, we present a context-aware named entity recognition model (NER) and entity resolution (ER) model to utilize dialogue context information. We conduct NEL experiments on three open-domain conversation datasets and validate that incorporating context information improves the performance of NER and ER models. Furthermore, we verify that using knowledge sentences identified based on NEL benefits the neural response generation model.

show abstract

LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention

Cited by 338 publications

References 28 publications

GEMNET: Effective Gated Gazetteer Representations for Recognizing Complex Entities in Low-context Input

GEMNET: Effective Gated Gazetteer Representations for Recognizing Complex Entities in Low-context Input

Automated Concatenation of Embeddings for Structured Prediction

Entity Resolution in Open-domain Conversations

Contact Info

Product

Resources

About