Wei Zhao scite author profile

Neural machine translation systems have become state-of-the-art approaches for Grammatical Error Correction (GEC) task. In this paper, we propose a copy-augmented architecture for the GEC task by copying the unchanged words from the source sentence to the target sentence. Since the GEC suffers from not having enough labeled training data to achieve high accuracy. We pre-train the copy-augmented architecture with a denoising auto-encoder using the unlabeled One Billion Benchmark and make comparisons between the fully pre-trained model and a partially pretrained model. It is the first time copying words from the source context and fully pretraining a sequence to sequence model are experimented on the GEC task. Moreover, We add token-level and sentence-level multi-task learning for the GEC task. The evaluation results on the CoNLL-2014 test set show that our approach outperforms all recently published state-of-the-art results by a large margin. The code and pre-trained models are released at https://github.com/zhawe01/fairseq-gec.

show abstract

Evaluation of the economic feasibility for the recycling of construction and demolition waste in China—The case of Chongqing

Zhao

Leeftink²,

Rotter

2010

Resources, Conservation and Recycling

257

View full text Add to dashboard Cite

Denoising based Sequence-to-Sequence Pre-training for Text Generation

Wang

Zhao²,

Jia³

et al. 2019

View full text Add to dashboard Cite

This paper presents a new sequence-tosequence (seq2seq) pre-training method PoDA (Pre-training of Denoising Autoencoders), which learns representations suitable for text generation tasks. Unlike encoder-only (e.g., BERT) or decoder-only (e.g., OpenAI GPT) pre-training approaches, PoDA jointly pretrains both the encoder and decoder by denoising the noise-corrupted text, and it also has the advantage of keeping the network architecture unchanged in the subsequent fine-tuning stage. Meanwhile, we design a hybrid model of Transformer and pointer-generator networks as the backbone architecture for PoDA. We conduct experiments on two text generation tasks: abstractive summarization, and grammatical error correction. Results on four datasets show that PoDA can improve model performance over strong baselines without using any task-specific techniques and significantly speed up convergence. 1

show abstract

A system dynamics model for evaluating the alternative of type in construction and demolition waste recycling center – The case of Chongqing, China

Zhao

Hong

Rotter

2011

Resources, Conservation and Recycling

143

View full text Add to dashboard Cite

Yuanfudao at SemEval-2018 Task 11: Three-way Attention and Relational Knowledge for Commonsense Machine Comprehension

Wang¹,

Sun²,

Zhao³

et al. 2018

View full text Add to dashboard Cite

This paper describes our system for SemEval-2018 Task 11: Machine Comprehension using Commonsense Knowledge (Ostermann et al., 2018b).We use Threeway Attentive Networks (TriAN) to model interactions between the passage, question and answers. To incorporate commonsense knowledge, we augment the input with relation embedding from the graph of general knowledge ConceptNet (Speer et al., 2017). As a result, our system achieves state-of-the-art performance with 83.95% accuracy on the official test data. Code is publicly available at https://github.com/ intfloat/commonsense-rc.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Wei Zhao

Improving Grammatical Error Correction via Pre-Training a Copy-Augmented Architecture with Unlabeled Data

Evaluation of the economic feasibility for the recycling of construction and demolition waste in China—The case of Chongqing

Denoising based Sequence-to-Sequence Pre-training for Text Generation

A system dynamics model for evaluating the alternative of type in construction and demolition waste recycling center – The case of Chongqing, China

Yuanfudao at SemEval-2018 Task 11: Three-way Attention and Relational Knowledge for Commonsense Machine Comprehension

Contact Info

Product

Resources

About