Low-Resource Named Entity Recognition via the Pre-Training Model

Chen, Siqi; Pei, Yijie; Ke, Zunwang; Silamu, Wushour

doi:10.3390/sym13050786

Cited by 18 publications

(11 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Currently, due to the increasing influence of pre-training models such as the Bidirectional Encoder Representations from Transformers (BERT) in natural language processing (NLP) research, another study ( Chen et al, 2021 ) introduced pre-training models in NER research to enhance NER models through the powerful semantic representation ability of pre-training models for semantic understanding of text, thus achieving better entity recognition results.…”

Section: Related Workmentioning

confidence: 99%

Named entity recognition and emotional viewpoint monitoring in online news using artificial intelligence

2024

PeerJ Computer Science

View full text Add to dashboard Cite

Network news is an important way for netizens to get social information. Massive news information hinders netizens to get key information. Named entity recognition technology under artificial background can realize the classification of place, date and other information in text information. This article combines named entity recognition and deep learning technology. Specifically, the proposed method introduces an automatic annotation approach for Chinese entity triggers and a Named Entity Recognition (NER) model that can achieve high accuracy with a small number of training data sets. The method jointly trains sentence and trigger vectors through a trigger-matching network, utilizing the trigger vectors as attention queries for subsequent sequence annotation models. Furthermore, the proposed method employs entity labels to effectively recognize neologisms in web news, enabling the customization of the set of sensitive words and the number of words within the set to be detected, as well as extending the web news word sentiment lexicon for sentiment observation. Experimental results demonstrate that the proposed model outperforms the traditional BiLSTM-CRF model, achieving superior performance with only a 20% proportional training data set compared to the 40% proportional training data set required by the conventional model. Moreover, the loss function curve shows that my model exhibits better accuracy and faster convergence speed than the compared model. Finally, my model achieves an average accuracy rate of 97.88% in sentiment viewpoint detection.

show abstract

Section: Related Workmentioning

confidence: 99%

Named entity recognition and emotional viewpoint monitoring in online news using artificial intelligence

2024

PeerJ Computer Science

View full text Add to dashboard Cite

show abstract

“…The advent of cross-lingual models has been at the aid of many downstream tasks plagued with low-resourcedness such as document classification 48 , POS tagging 20 , sentiment analyses 49 , and named entity recognition 50 . The primary goal of these models is to learn transferable language-generic knowledge encoded inside sound embedding spaces(of high-resourced languages) obtained from large-enough language representation and inject this mined "knowledge" into the embedding spaces of low-resourced languages to use for downstream tasks.…”

Section: Related Workmentioning

confidence: 99%

Zero-Shot Transfer Learning using Affix and Correlated Cross-Lingual Embeddings.

Modupe

Sindane

Marivate

2023

Preprint

View full text Add to dashboard Cite

Learning morphologically supplemented embedding spaces using cross-lingual models has become an active area of research and facilitated many research breakthroughs in various applications such as machine translation, named entity recognition, document classification, and natural language inference. However, the field has not become customary for Southern African low-resourced languages. In this paper, we present, evaluate and benchmark a cohort of cross-lingual embeddings for the English-Southern African languages on two classification tasks: News Headlines Classification (NHC) and Named Entity Recognition (NER). Our methodology considers four agglutinative languages from the eleven official South African languages: Isixhosa, Sepedi, Sesotho, and Setswana. Canonical correlation analyses and VecMap are the two cross-lingual alignment strategies adopted for this study. Monolingual embeddings used in this work are Glove (source), and FastText (source and target) embeddings. Our results indicate that with enough comparable corpora, we can develop strong inter-joined representations between English and the considered Southern African languages. More specifically, the best zero-shot transfer results on the available Setswana NHC dataset were achieved using canonically correlated embeddings with Multi-layered perceptron as the training model (54.5% accuracy). Furthermore, our NER best performance was achieved using canonically correlated cross-lingual embeddings with Conditional Random Fields as the training model (96.4% F1 score). Collectively, this study’s results were competitive with the benchmarks of the explored NHC and NER datasets, on both zero-short NHC and NER tasks with our advantage being the use of very minimal resources.

show abstract

“…These IE applications include sequence tagging tasks such as Named-Entity Recognition (NER) and Part-of-Speech (POS) tagging. N NER is a task that processes natural language, classifying and grouping, for example, words into categories (also known as phrase types) [20]. With the advent of big data and large datasets, classifying natural language in these datasets has become increasingly important.…”

Section: Introductionmentioning

confidence: 99%

“…tions are able to apply NER in customer support, content classification, and search and recommendation engines [21]. Furthermore, NER findings may be transferred to other NLP tasks such as MT, automatic text summarization, and knowledge base construction [20]. Lack of data severely impedes performance on NER tasks with low-resourced languages [20].…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Deep Learning Transformer Architecture for Named Entity Recognition on Low Resourced Languages: State of the art results

Hanslo¹

2022

Annals of Computer Science and Information Systems

View full text Add to dashboard Cite

Low-Resource Named Entity Recognition via the Pre-Training Model

Cited by 18 publications

References 27 publications

Named entity recognition and emotional viewpoint monitoring in online news using artificial intelligence

Named entity recognition and emotional viewpoint monitoring in online news using artificial intelligence

Zero-Shot Transfer Learning using Affix and Correlated Cross-Lingual Embeddings.

Deep Learning Transformer Architecture for Named Entity Recognition on Low Resourced Languages: State of the art results

Contact Info

Product

Resources

About