CWPC_BiAtt: Character–Word–Position Combined BiLSTM-Attention for Chinese Named Entity Recognition

Johnson, Shardrom; Shen, Sherlock; Liu, Yuanchen

doi:10.3390/info11010045

Cited by 17 publications

(5 citation statements)

References 29 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In addition, the F1 values of entity recognition of the method proposed in this paper and other methods based on deep learning on public datasets in three different domains are shown in Table 7. These models are lattice LSTM [30], a lstm model that fully considers word and character information; CAN_NER [31], a character-based local attention layer convolutional neural network (CNN) and gated recurrent unit (GRU) with global self-attention layer; CWPC_BiAtt [32], a attention-based bilstm model combining character and word position information; and ACNN [33], a model combining 7, Albert-BiLSTM-MHA-CRF has improved the effect of entity recognition on Weibo and Resume datasets. Compared with Bert-BiLSTM-CRF, the entity recognition effect of Albert-BiLSTM-MHA-CRF is increased by 5.51% and 0.41% on F1 value, respectively.…”

Section: Evaluation Indexes and Experimental Resultsmentioning

confidence: 99%

Named Entity Recognition of Ancient Poems Based on Albert-BiLSTM-MHA-CRF Model

Zhou

Wang

2022

Wireless Communications and Mobile Computing

View full text Add to dashboard Cite

The extraction and construction of the knowledge graph related to the entity of ancient poems are helpful to excavate the connection between ancient poems, and it is of great significance to inherit the traditional Chinese culture. This paper proposes an Albert-BiLSTM-MHA-CRF model for entity extraction in ancient poems. Based on the BiLSTM-CRF model, the author introduces the Albert pretraining model and the multihead self-attention mechanism to extract character vectors and enhance the generalization ability of word embedding vectors and the potential semantics between characters of the model, depending on weight and other feature extraction capabilities. The experiment is carried out in the corpus of ancient poetry, and the model is compared with Bert-BiLSTM-CRF, BiLSTM-CRF, and CRF model. The results show that the entity extraction effect of ancient poetry is significantly improved, and the harmonic average value is 97.17%. Compared with Bert model as the pretraining model, Albert model reduces the time by 19.56%.

show abstract

Section: Evaluation Indexes and Experimental Resultsmentioning

confidence: 99%

Named Entity Recognition of Ancient Poems Based on Albert-BiLSTM-MHA-CRF Model

Zhou

Wang

2022

Wireless Communications and Mobile Computing

View full text Add to dashboard Cite

show abstract

“…English is the largest language dataset available for NER research because English is the most widely spoken language globally. NER research in other languages include Arabic [64,117,125,132,255], Chinese [51,52,105,156,159,180,256], German [269], Korean [115,130,196,258] and Indian [35,112,119,141]. Among the NER studies in Bahasa Indonesia have been conducted by Wintaka et [284].…”

Section: Discussionmentioning

confidence: 99%

“…Entities in the farming domain are such pests and diseases [99], geology (e.g., rock, stratum, toponym) [48], sports science (e.g., competition name, level of competition, match time) [103], and military domain (e.g., weapons, mission, location, organization) [104]. However, there are some studies in several fields at once, such as those conducted by Zhong et al [87], Johnson et al [105], Jin et al [106], Ekbal and Saha [107], Xu et al [108],band Song et al [109].…”

Section: Ner Research Application Domainmentioning

confidence: 99%

Systematic Literature Review on Named Entity Recognition: Approach, Method, and Application

Warto,

Rustad,

Shidik

et al. 2024

Stat., optim. inf. comput.

View full text Add to dashboard Cite

Named entity recognition (NER) is one of the preprocessing stages in natural language processing (NLP), which functions to detect and classify entities in the corpus. NER results are used in various NLP applications, including sentiment analysis, text summarization, chatbot, machine translation, and question answering. Several previous reviews partially discussed NER, for instance, NER reviews in specific domains, NER classification, and NER deep learning. This paper provides a comprehensive and systematic review on NER topic studies published from 2011 to 2020. The main contribution of this review is to present a comprehensive systematic literature review on NER from preprocessing techniques, datasets, application domains, feature extraction techniques, approaches, methods, and evaluation techniques. The result concludes that the deep learning approach and the Bi-directional long short-term memory with a conditional random field (Bi-LSTM-CRF) method are the most interesting methods among NER researchers. At the same time, medical and health are NER researchers' most popular domains. These developments have also led to an increasing number of public datasets in the medical and health fields. At the end of this review, we recommend some opportunities and challenges for NER research going forward.

show abstract

“…LSTM (Long Short-Term Memory) network is a special form of an RNN (Recurrent Neural Network), which can better handle the sequence data for example the text data [39]. Compared with the traditional RNN model, LSTM can solve the problem of the long gradient disappearance of the text [40].…”

Section: ) Bilstm Layer In Head Entity Identificationmentioning

confidence: 99%

Research on Joint Extraction Method of Entity and Relation Triples Based on Hierarchical Cascade Labeling

et al. 2023

View full text Add to dashboard Cite

As an important research field of artificial intelligence, knowledge graph develops rapidly, and triplet extraction is the key to the construction of a knowledge graph. The traditional pipeline extraction method will bring the error of entity recognition into the relationship extraction and affects the extraction effect. Besides, the traditional pipeline extraction method cannot solve the SEO (Single Entity Overlap) and EPO (Entity Pair Overlap) problems. Inspired by this, we compare the advantages and disadvantages of the mainstream methods of entity and relationship triples joint extraction, propose a new joint extraction method of entity relation triples based on a hierarchical cascade labeling model (named HCL model), and the HCL model is based on multi neural network cooperation. Further, we construct a balanced sampling Chinese dataset about the entity and relational triplet extraction which contains SEO and EPO. We carry out the experiments on the balanced data set, and the F1 value of the HCL model reaches 65.4% better than other baseline models.

show abstract

CWPC_BiAtt: Character–Word–Position Combined BiLSTM-Attention for Chinese Named Entity Recognition

Cited by 17 publications

References 29 publications

Named Entity Recognition of Ancient Poems Based on Albert-BiLSTM-MHA-CRF Model

Named Entity Recognition of Ancient Poems Based on Albert-BiLSTM-MHA-CRF Model

Systematic Literature Review on Named Entity Recognition: Approach, Method, and Application

Research on Joint Extraction Method of Entity and Relation Triples Based on Hierarchical Cascade Labeling

Contact Info

Product

Resources

About