2021
DOI: 10.3390/app11199038
|View full text |Cite
|
Sign up to set email alerts
|

Context-Aware Bidirectional Neural Model for Sindhi Named Entity Recognition

Abstract: Named entity recognition (NER) is a fundamental task in many natural language processing (NLP) applications, such as text summarization and semantic information retrieval. Recently, deep neural networks (NNs) with the attention mechanism yield excellent performance in NER by taking advantage of character-level and word-level representation learning. In this paper, we propose a deep context-aware bidirectional long short-term memory (CaBiLSTM) model for the Sindhi NER task. The model relies upon contextual repr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(7 citation statements)
references
References 37 publications
0
7
0
Order By: Relevance
“…For this research, 15k-sentences from the Punjabi data corpus and many newspapers were used, and they were annotated utilizing the open-source annotation tool Doccano. Ali et al [83] discussed pretrained Sindhi Glove (SdGlove), Sindhi fastText (Sdfast Text), task-oriented and CRL-based word representations on the recently presented SiNER dataset, the baselines and CaBiLSTM model are compared which has been proposed. After comparison of these two models, it achieved a high F1-score of 91.25% by using CaBiLSTM on the SiNER dataset with CRL.…”
Section: (C) Deep Learning Approachmentioning
confidence: 99%
See 4 more Smart Citations
“…For this research, 15k-sentences from the Punjabi data corpus and many newspapers were used, and they were annotated utilizing the open-source annotation tool Doccano. Ali et al [83] discussed pretrained Sindhi Glove (SdGlove), Sindhi fastText (Sdfast Text), task-oriented and CRL-based word representations on the recently presented SiNER dataset, the baselines and CaBiLSTM model are compared which has been proposed. After comparison of these two models, it achieved a high F1-score of 91.25% by using CaBiLSTM on the SiNER dataset with CRL.…”
Section: (C) Deep Learning Approachmentioning
confidence: 99%
“…The authors applied Bi-LSTM [85] based on DL which contains POS and CNN embedding character and overall accuracy has been calculated by using F1-score, recall and precision. Ali et al [2], [94] introduced a NER dataset for the limited resources Sindhi language with quality benchmarks with SiNER. It has 1,338 updates along with more than 1.35 million tokens that were gathered with the begin-inside-outside (BIO) tagging method used by Kawish and Awami Awaz Sindhi newspapers as the suggested dataset has a good potential of being a useful tool for statistical Sindhi language processing.…”
Section: (C) Deep Learning Approachmentioning
confidence: 99%
See 3 more Smart Citations