Yuxian Meng scite author profile

Many NLP tasks such as tagging and machine reading comprehension (MRC) are faced with the severe data imbalance issue: negative examples significantly outnumber positive ones, and the huge number of easy-negative examples overwhelms training. The most commonly used cross entropy criteria is actually accuracy-oriented, which creates a discrepancy between training and test. At training time, each training instance contributes equally to the objective function, while at test time F1 score concerns more about positive examples.

show abstract

Dice Loss for Data-imbalanced NLP Tasks

Li¹,

Sun²,

Meng³

et al. 2019

Preprint

View full text Add to dashboard Cite

A Unified MRC Framework for Named Entity Recognition

Li¹,

Feng²,

Meng³

et al. 2019

Preprint

View full text Add to dashboard Cite

The task of named entity recognition (NER) is normally divided into nested NER and flat NER depending on whether named entities are nested or not. Models are usually separately developed for the two tasks, since sequence labeling models, the most widely used backbone for flat NER, are only able to assign a single label to a particular token, which is unsuitable for nested NER where a token may be assigned several labels. 1 This paper includes material from the unpublished manuscript "Query-Based Named Entity Recognition".2 Xiaoya and Jingrong contribute equally to this paper. 3 Code is coming soon.

show abstract

BertGCN: Transductive Text Classification by Combining GNN and BERT

Lin

Meng²,

Sun

et al. 2021

114

View full text Add to dashboard Cite

In this work, we propose BertGCN, a model that combines large scale pretraining and transductive learning for text classification. Bert-GCN constructs a heterogeneous graph over the dataset and represents documents as nodes using BERT representations. By jointly training the BERT and GCN modules within Bert-GCN, the proposed model is able to leverage the advantages of both worlds: large-scale pretraining which takes the advantage of the massive amount of raw data and transductive learning which jointly learns representations for both training data and unlabeled test data by propagating label influence through graph convolution. Experiments show that BertGCN achieves SOTA performances on a wide range of text classification datasets. 1

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yuxian Meng

A Unified MRC Framework for Named Entity Recognition

Dice Loss for Data-imbalanced NLP Tasks

Dice Loss for Data-imbalanced NLP Tasks

A Unified MRC Framework for Named Entity Recognition

BertGCN: Transductive Text Classification by Combining GNN and BERT

Contact Info

Product

Resources

About