DM_NLP at SemEval-2018 Task 8: neural sequence labeling with
            linguistic features

Ma, Chunping; Zheng, Huafei; Xie, Pengjun; Li, Chen; Li, Linlin; Si, Luo

doi:10.18653/v1/s18-1114

Cited by 7 publications

(6 citation statements)

References 12 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To benefit from the information of both sides of a sentence, BiLSTM was introduced by Graves et al [11], enabling models to capture more information during training which achieved better results in the chunking task. Ma et al [23] and Huang et al [15] used BiLSTM to obtain word representations with respect to both right and left context and a subsequent CRF layer to consider sentence level tag information. Huang et al [15] also used SENNA [6] pre-trained embeddings.…”

Section: -Related Workmentioning

confidence: 99%

“…Huang et al [15] also used SENNA [6] pre-trained embeddings. Moreover, in the proposed model of Ma et al [23], a max-pooling and a convolutional layer were used to obtain character embeddings for each word. They also used the concatenation of character representations, linguistic features like POS and NER labels, and word embeddings to create a general embedding before feeding to BiLSTM.…”

Section: -Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Deep Transformer-based Representation for Text Chunking

Kavehzadeh,

Abdollah Pour,

Momtazi

2023

jist

View full text Add to dashboard Cite

Text chunking is one of the basic tasks in natural language processing. Most proposed models in recent years were employed on chunking and other sequence labeling tasks simultaneously and they were mostly based on Recurrent Neural Networks (RNN) and Conditional Random Field (CRF). In this article, we use state-of-the-art transformer-based models in combination with CRF, Long Short-Term Memory (LSTM)-CRF as well as a simple dense layer to study the impact of different pre-trained models on the overall performance in text chunking. To this aim, we evaluate BERT, RoBERTa, Funnel Transformer, XLM, XLM-RoBERTa, BART, and GPT2 as candidates of contextualized models. Our experiments exhibit that all transformer-based models except GPT2 achieved close and high scores on text chunking. Due to the unique unidirectional architecture of GPT2, it shows a relatively poor performance on text chunking in comparison to other bidirectional transformer-based architectures. Our experiments also revealed that adding a LSTM layer to transformerbased models does not significantly improve the results since LSTM does not add additional features to assist the model to achieve more information from the input compared to the deep contextualized models.

show abstract

Section: -Related Workmentioning

confidence: 99%

Section: -Related Workmentioning

confidence: 99%

Deep Transformer-based Representation for Text Chunking

Kavehzadeh,

Abdollah Pour,

Momtazi

2023

jist

View full text Add to dashboard Cite

show abstract

“…DM-NLP [10] used the predicted output labels from SubTask 2 to get the predictions for SubTask 1. They model this task as a sequence labelling task and used a hybrid approach with BiLSTM-CNNCRF as mentioned in [11].…”

Section: Related Workmentioning

confidence: 99%

Devising Malware Characteristics using Transformers

Shahid¹,

Singh²,

Sharma³

et al. 2020

IJETT

View full text Add to dashboard Cite

show abstract

“…Finally, a voting method is utilized to benefit from multiple models. Input Information Based upon our previous work (Ma et al, 2018) on sequence labeling, our system incorporates four types of linguistic information: Part-of-Speech (POS) tags, NER labels, Chunking labels and ELMo (Peters et al, 2018). The former three are generated by open source tools.…”

Section: Detection In Main Bodymentioning

confidence: 99%

DM_NLP at SemEval-2018 Task 12: A Pipeline System for Toponym Resolution

Wang¹,

Ma²,

Zheng³

et al. 2019

Proceedings of the 13th International Workshop on Semantic Evaluation

Self Cite

View full text Add to dashboard Cite

This paper describes DM-NLP's system for toponym resolution task at Semeval 2019. Our system was developed for toponym detection, disambiguation and end-to-end resolution which is a pipeline of the former two. For toponym detection, we utilized the stateof-the-art sequence labeling model, namely, BiLSTM-CRF model as backbone. A lot of strategies are adopted for further improvement, such as pre-training, model ensemble, model averaging and data augment. For toponym disambiguation, we adopted the widely used searching and ranking framework. For ranking, we proposed several effective features for measuring the consistency between the detected toponym and toponyms in GeoNames. Eventually, our system achieved the best performance among all the submitted results in each sub task.

show abstract

DM_NLP at SemEval-2018 Task 8: neural sequence labeling with linguistic features

Cited by 7 publications

References 12 publications

Deep Transformer-based Representation for Text Chunking

Deep Transformer-based Representation for Text Chunking

Devising Malware Characteristics using Transformers

DM_NLP at SemEval-2018 Task 12: A Pipeline System for Toponym Resolution

Contact Info

Product

Resources

About