Context-Aware Bidirectional Neural Model for Sindhi Named Entity Recognition

Ali, Wazir; Kumar, Jay; Xu, Zenglin; Kumar, Rajesh; Ren, Yazhou

doi:10.3390/app11199038

Cited by 5 publications

(7 citation statements)

References 37 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For this research, 15k-sentences from the Punjabi data corpus and many newspapers were used, and they were annotated utilizing the open-source annotation tool Doccano. Ali et al [83] discussed pretrained Sindhi Glove (SdGlove), Sindhi fastText (Sdfast Text), task-oriented and CRL-based word representations on the recently presented SiNER dataset, the baselines and CaBiLSTM model are compared which has been proposed. After comparison of these two models, it achieved a high F1-score of 91.25% by using CaBiLSTM on the SiNER dataset with CRL.…”

Section: (C) Deep Learning Approachmentioning

confidence: 99%

“…The authors applied Bi-LSTM [85] based on DL which contains POS and CNN embedding character and overall accuracy has been calculated by using F1-score, recall and precision. Ali et al [2], [94] introduced a NER dataset for the limited resources Sindhi language with quality benchmarks with SiNER. It has 1,338 updates along with more than 1.35 million tokens that were gathered with the begin-inside-outside (BIO) tagging method used by Kawish and Awami Awaz Sindhi newspapers as the suggested dataset has a good potential of being a useful tool for statistical Sindhi language processing.…”

Section: (C) Deep Learning Approachmentioning

confidence: 99%

“…The system produced results utilizing hybrid unigrams and bigrams on the IJCNLP NE corpus and the CRL NE corpus of 92.65 and 87.6% and 92.47 and 86.83%, respectively. Ali et al [21] described the present challenges in developing the SNER system while taking the need for language processing tools into consideration as the purpose of this essay is to explore challenges and potential areas for further study in the subject of NER in the Sindhi language. Few existing works are discussed in table 6 related to hybrid approach (i.e., combination of any other two methods like ML, DL and rule-based approach).…”

Section: (D) Hybrid Approachmentioning

confidence: 99%

“…Litake et al [10] Hindi, Marathi IJCNLP 80% Mehta et al [102] Hindi, Indo-aryan Large Corpus 90.7% Saha et al [103] Hindi Test Corpus 73.87% Pant et al [47] Kumauni Annotated data Corpus 85% Krishnan et al [104] Malayalam Wikipedia, Google Graph F1=78% Liu et al [105] Urdu MULTIPLE, IDENTIFIER 80% Srivastava et al [106] Hindi IJCNLP-08 P=96.05, F=91.25%, R=86.90% Naz et al [107] Urdu IJCNLP and ACL corpus F=92.47%, R=95.57%, P=89.57% Ali et al [21] Sindhi IJCNLP-2008 82%…”

Section: Authors and References Languages Dataset Accuracymentioning

confidence: 99%

“…NER for Sindhi language would involve building a system that can accurately identify and classify named entities within Sindhi text. "It's worth mentioning that NER for low-resource languages like Sindhi can be challenging due to the limited availability of annotated data and linguistic resources [2], [9], [16], [21], [22]. Additionally, the accuracy of an NER system depends on the quality and coverage of the training data, as well as the complexity of the language and the types of entities we want to recognize".…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

State-of-art approach for Indian Language based on NER: Comprehensive Review

Pandey,

Nathani

2024

Preprint

View full text Add to dashboard Cite

Named Entity Recognition (NER) is a fundamental task of natural language processing (NLP) that focuses on the identification and classification of named entities such as name of individual persons, location, organization and dates within the text. NER plays a pivotal role in various NLP applications, including information extraction, question answering, text summarization and sentiment analysis. Natural language processing's (NLPs) fundamental issue is named entity recognition (NER). While extensive research has been conducted on NER for English and Hindi, the complexities of Indian languages present unique challenges that require customized solutions. Working with NER for Indian languages is a difficult endeavor with limited resources available. This article provides a comprehensive review of NER approaches tailored for Indian languages. Indian languages pose unique challenges to NER due to their rich morphological and syntactic variations, script diversity and limited annotated data availability. This paper reviews the various techniques and methodologies employed in NER for Indian languages, including rule-based, machine learning and deep learning approaches. It analyzes the strengths and limitations of each approach. Additionally, this article examines the recent advancements in transfer learning and multilingual models, showcasing their potential in improving NER performance across Indian languages. This paper aims to guide researchers and practitioners in the development of NER systems for Indian languages and foster further advancements in this field. This article also provides a comprehensive review of the diverse approaches employed for NER in Indian languages, highlighting the strength and limitations as well.

show abstract

Section: (C) Deep Learning Approachmentioning

confidence: 99%