Yeyun Gong scite author profile

This paper presents a new sequence-tosequence pre-training model called Prophet-Net, which introduces a novel self-supervised objective named future n-gram prediction and the proposed n-stream self-attention mechanism.Instead of optimizing one-stepahead prediction in the traditional sequenceto-sequence model, the ProphetNet is optimized by n-step ahead prediction that predicts the next n tokens simultaneously based on previous context tokens at each time step. The future n-gram prediction explicitly encourages the model to plan for the future tokens and prevent overfitting on strong local correlations. We pre-train ProphetNet using a base scale dataset (16GB) and a large-scale dataset (160GB), respectively. Then we conduct experiments on CNN/DailyMail, Gigaword, and SQuAD 1.1 benchmarks for abstractive summarization and question generation tasks. Experimental results show that Prophet-Net achieves new state-of-the-art results on all these datasets compared to the models using the same scale pre-training corpus.

show abstract

Keyphrase Extraction Using Deep Recurrent Neural Networks on Twitter

Zhang

et al. 2016

View full text Add to dashboard Cite

Keyphrases can provide highly condensed and valuable information that allows users to quickly acquire the main ideas. The task of automatically extracting them have received considerable attention in recent decades. Different from previous studies, which are usually focused on automatically extracting keyphrases from documents or articles, in this study, we considered the problem of automatically extracting keyphrases from tweets. Because of the length limitations of Twitter-like sites, the performances of existing methods usually drop sharply. We proposed a novel deep recurrent neural network (RNN) model to combine keywords and context information to perform this problem. To evaluate the proposed method, we also constructed a large-scale dataset collected from Twitter. The experimental results showed that the proposed method performs significantly better than previous methods.

show abstract

XGLUE: A New Benchmark Datasetfor Cross-lingual Pre-training, Understanding and Generation

Liang¹,

Duan²,

Gong³

et al. 2020

154

116

View full text Add to dashboard Cite

In this paper, we introduce XGLUE, a new benchmark dataset that can be used to train large-scale cross-lingual pre-trained models using multilingual and bilingual corpora and evaluate their performance across a diverse set of cross-lingual tasks. Comparing to GLUE (Wang et al., 2019), which is labeled in English for natural language understanding tasks only, XGLUE has two main advantages: (1) it provides 11 diversified tasks that cover both natural language understanding and generation scenarios; (2) for each task, it provides labeled data in multiple languages. We extend a recent cross-lingual pre-trained model Unicoder to cover both understanding and generation tasks, which is evaluated on XGLUE as a strong baseline. We also evaluate the base versions (12-layer) of Multilingual BERT, XLM and XLM-R for comparison. 1

show abstract

Hashtag Recommendation for Multimodal Microblog Using Co-Attention Network

Zhang

Wang

Huang

et al. 2017

View full text Add to dashboard Cite

In microblogging services, users usually use hashtags to mark keywords or topics. Along with the fast growing of social network, the task of automatically recommending hashtags has received considerable attention in recent years. Previous works focused only on the use of textual information. However, many microblog posts contain not only texts but also the corresponding images. These images can provide additional information that is not included in the text, which could be helpful to improve the accuracy of hashtag recommendation. Motivated by the successful use of the attention mechanism, we propose a co-attention network incorporating textual and visual information to recommend hashtags for multimodal tweets. Experimental results on the data collected from Twitter demonstrated that the proposed method can achieve better performance than state-of-the-art methods using textual information only.

show abstract

yap is required for the development of brain, eyes, and neural crest in zebrafish

Qiu

Liu

Gong

et al. 2009

Biochemical and Biophysical Research Communications

View full text Add to dashboard Cite

Joint Type Inference on Entities and Relations via Graph Convolutional Networks

Sun

Gong

et al. 2019

110

View full text Add to dashboard Cite

We develop a new paradigm for the task of joint entity relation extraction. It first identifies entity spans, then performs a joint inference on entity types and relation types. To tackle the joint type inference task, we propose a novel graph convolutional network (GCN) running on an entity-relation bipartite graph. By introducing a binary relation classification task, we are able to utilize the structure of entity-relation bipartite graph in a more efficient and interpretable way. Experiments on ACE05 show that our model outperforms existing joint models in entity performance and is competitive with the state-of-the-art in relation performance.

show abstract

Generation of a fluorescent transgenic zebrafish for detection of environmental estrogens

Chen

Yang

et al. 2010

Aquatic Toxicology

View full text Add to dashboard Cite

Retweet Prediction with Attention-based Deep Neural Network

Zhang

Gong

et al. 2016

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yeyun Gong

ProphetNet: Predicting Future N-gram for Sequence-to-SequencePre-training

Keyphrase Extraction Using Deep Recurrent Neural Networks on Twitter

XGLUE: A New Benchmark Datasetfor Cross-lingual Pre-training, Understanding and Generation

Hashtag Recommendation for Multimodal Microblog Using Co-Attention Network

yap is required for the development of brain, eyes, and neural crest in zebrafish

Joint Type Inference on Entities and Relations via Graph Convolutional Networks

Generation of a fluorescent transgenic zebrafish for detection of environmental estrogens

Retweet Prediction with Attention-based Deep Neural Network

Contact Info

Product

Resources

About