2022
DOI: 10.48550/arxiv.2212.03692
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Transformer-Based Named Entity Recognition for French Using Adversarial Adaptation to Similar Domain Corpora

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1
1
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 0 publications
0
2
0
Order By: Relevance
“…The low performance of NER tools compared to our proposed solution can be explained by two reasons. Firstly, the French language is known to have less publicly available labelled data that participate in the training of the NER tools [25]. Secondly, the social network texts are often poorly formulated, with grammatical and syntax errors which make the context difficult to understand by the tools and thus failing to recognise location entities.…”
Section: Discussionmentioning
confidence: 99%
“…The low performance of NER tools compared to our proposed solution can be explained by two reasons. Firstly, the French language is known to have less publicly available labelled data that participate in the training of the NER tools [25]. Secondly, the social network texts are often poorly formulated, with grammatical and syntax errors which make the context difficult to understand by the tools and thus failing to recognise location entities.…”
Section: Discussionmentioning
confidence: 99%
“…We chose spaCy because it is robust, well supported, and regularly updated, with a full pipeline. There are perhaps state-of-the-art resources for each of the components (e.g., for French NER [9]), but integration into a pipeline is at best complex and often impossible. In prior work, we had built a quote detection system for English, which is well supported by spaCy's existing English language models [10,11].…”
Section: Multilingual Nlp and The Hegemony Of Englishmentioning
confidence: 99%