Pooled Contextualized Embeddings for Named Entity Recognition

Akbik, Alan; Bergmann, Tanja; Vollgraf, Roland

doi:10.18653/v1/n19-1078

Cited by 319 publications

(337 citation statements)

References 13 publications

Supporting

Mentioning

256

Contrasting

Order By: Relevance

“…This includes statistic methods, such as SVM (Isozaki and Kazawa, 2002), HMMs (Bikel et al, 1997) and CRF (Lafferty et al, 2001), suffering from feature engineering. There are also a number of recent neural network approaches applied to NER, such as (Collobert et al, 2011;Huang et al, 2015;Lample et al, 2016;Ma and Hovy, 2016;Chiu and Nichols, 2016;Akbik et al, 2018;Jie et al, 2019;Akbik et al, 2019 (Bastings et al, 2017;Yao et al, 2019;Wang et al, 2018;Mishra et al, 2019;Cao et al, 2019;Zhang et al, 2019). Cetoli et al (2017) use GCN to investigate the role of the dependency tree in English named entity recognition.…”

Section: Related Workmentioning

confidence: 99%

Leverage Lexical Knowledge for Chinese Named Entity Recognition via Collaborative Graph Network

Sui

Chen

Liu

et al. 2019

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen

127

View full text Add to dashboard Cite

The lack of word boundaries information has been seen as one of the main obstacles to develop a high performance Chinese named entity recognition (NER) system. Fortunately, the automatically constructed lexicon contains rich word boundaries information and word semantic information. However, integrating lexical knowledge in Chinese NER tasks still faces challenges when it comes to self-matched lexical words as well as the nearest contextual lexical words. We present a Collaborative Graph Network to solve these challenges. Experiments on various datasets show that our model not only outperforms the stateof-the-art (SOTA) results, but also achieves a speed that is six to fifteen times faster than that of the SOTA model. 1

show abstract

Section: Related Workmentioning

confidence: 99%

Leverage Lexical Knowledge for Chinese Named Entity Recognition via Collaborative Graph Network

Sui

Chen

Liu

et al. 2019

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen

127

View full text Add to dashboard Cite

show abstract

“…Neural architecture search has been proposed to automatically search for better architectures, showing competitive results on several tasks, e.g., image recognition and language modeling. A s- Model F1 best published BiLSTM-CRF (Lample et al, 2016) 90.94 BiLSTM-CRF+ELMo (Peters et al, 2018) 92.22 BERT Base (Devlin et al, 2018) 92.40 BERT Large (Devlin et al, 2018) 92.80 BiLSTM-CRF+PCE (Akbik et al, 2019) 93 trand of NAS research focuses on reinforcement learning (Zoph and Le, 2016) and evolutionary algorithm-based (Xie and Yuille, 2017) methods. They are powerful but inefficient.…”

Section: Related Workmentioning

confidence: 99%

Improved Differentiable Architecture Search for Language Modeling and Named Entity Recognition

Jiang¹,

Hu²,

Xiao³

et al. 2019

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen

View full text Add to dashboard Cite

In this paper, we study differentiable neural architecture search (NAS) methods for natural language processing. In particular, we improve differentiable architecture search by removing the softmax-local constraint. Also, we apply differentiable NAS to named entity recognition (NER). It is the first time that differentiable NAS methods are adopted in NLP tasks other than language modeling. On both the PTB language modeling and CoNLL-2003 English NER data, our method outperforms strong baselines. It achieves a new state-ofthe-art on the NER task.

show abstract

“…For our best performing model, we used two different token-level embeddings, a WANG2VECbased embedding (Ling et al, 2015) and a FAST-TEXT-based embedding (Bojanowski et al, 2017), a single byte-pair sub-word embedding (Heinzerling and Strube, 2018) and one context sensitive character-level language model (Akbik et al, 2019b). Figure 1 gives a visual depiction of our best performing model.…”

Section: System Architecturementioning

confidence: 99%

When Specialization Helps: Using Pooled Contextualized Embeddings to Detect Chemical and Biomedical Entities in Spanish

Stoeckel¹,

Hemati²,

Mehler³

2019

Proceedings of the 5th Workshop on BioNLP Open Shared Tasks

View full text Add to dashboard Cite

The recognition of pharmacological substances, compounds and proteins is an essential preliminary work for the recognition of relations between chemicals and other biomedically relevant units. In this paper, we describe an approach to Task 1 of the PharmaCoNER Challenge, which involves the recognition of mentions of chemicals and drugs in Spanish medical texts. We train a state-of-the-art BiLSTM-CRF sequence tagger with stacked Pooled Contextualized Embeddings, word and sub-word embeddings using the open-source framework FLAIR. We present a new corpus composed of articles and papers from Spanish health science journals, termed the Spanish Health Corpus, and use it to train domainspecific embeddings which we incorporate in our model training. We achieve a result of 89.76% F1-score using pre-trained embeddings and are able to improve these results to 90.52% F1-score using specialized embeddings.

show abstract

Pooled Contextualized Embeddings for Named Entity Recognition

Cited by 319 publications

References 13 publications

Leverage Lexical Knowledge for Chinese Named Entity Recognition via Collaborative Graph Network

Leverage Lexical Knowledge for Chinese Named Entity Recognition via Collaborative Graph Network

Improved Differentiable Architecture Search for Language Modeling and Named Entity Recognition

When Specialization Helps: Using Pooled Contextualized Embeddings to Detect Chemical and Biomedical Entities in Spanish

Contact Info

Product

Resources

About