Proceedings of the Seventh Joint Conference on Lexical And Computational Semantics 2018
DOI: 10.18653/v1/s18-2021
|View full text |Cite
|
Sign up to set email alerts
|

Deep Affix Features Improve Neural Named Entity Recognizers

Abstract: We propose a practical model for named entity recognition (NER) that combines word and character-level information with a specific learned representation of the prefixes and suffixes of the word. We apply this approach to multilingual and multi-domain NER and show that it achieves state of the art results on the CoNLL 2002 Spanish and Dutch and CoNLL 2003 German NER datasets, consistently achieving 1.5-2.3 percent over the state of the art without relying on any dictionary features. Additionally, we show impro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
51
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 68 publications
(76 citation statements)
references
References 18 publications
0
51
0
Order By: Relevance
“…Text De-identification system architecture Our machine learning model implements the state-of-the-art in de-identification of medical notes [5,6] and named entity sequence tagging [30]. In our analyses, however, any sufficiently powerful model could be substituted.…”
Section: Data Sourcesmentioning
confidence: 99%
“…Text De-identification system architecture Our machine learning model implements the state-of-the-art in de-identification of medical notes [5,6] and named entity sequence tagging [30]. In our analyses, however, any sufficiently powerful model could be substituted.…”
Section: Data Sourcesmentioning
confidence: 99%
“…This model became the state-of-the-art system (85.81%) for Spanish NER until 2018. Finally, the most similar architecture towards ours proposed by Yadav et al in 2018 [25]. They combined affix-level features along with frequently explored word + character models.…”
Section: Related Workmentioning
confidence: 92%
“…Instead, we consider affixes at both ends of the word as additional features for NER. Our base model is similar to Reference [25], where we apply the combined representation of character-level, word-level and affix-level features over Bi-LSTM-CRF stack. Our model differs from their architecture in how the character-level and affix-level features are generated.…”
Section: Proposed Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…NER is a widely studied problem, where methods have been characterized by the use of CRFs with heavy feature engineering, gazetteers and external knowledge resources (Finkel et al, 2005;Florian et al, 2003;Kazama and Torisawa, 2007;Klein et al, 2003;Lin and Wu, 2009;Radford et al, 2015;Ratinov and Roth, 2009;Zhang and Johnson, 2003). Ratinov and Roth (2009) (Bharadwaj et al, 2016;Chiu and Nichols, 2016;Collobert et al, 2011;Gillick et al, 2016;Huang et al, 2015;Lample et al, 2016;Ma and Hovy, 2016;dos Santos and Guimarães, 2015;Yadav et al, 2018;Yang et al, 2016). Huang et al (2015) use a word-level bidirectional LSTM-CRF for several sequence tagging problems including POS tagging and NER.…”
Section: Related Workmentioning
confidence: 99%