Dawei Zhu scite author profile

Multilingual transformer models like mBERT and XLM-RoBERTa have obtained great improvements for many NLP tasks on a variety of languages. However, recent works also showed that results from high-resource languages could not be easily transferred to realistic, low-resource scenarios. In this work, we study trends in performance for different amounts of available resources for the three African languages Hausa, isiXhosa and Yorùbá on both NER and topic classification. We show that in combination with transfer learning or distant supervision, these models can achieve with as little as 10 or 100 labeled sentences the same performance as baselines with much more supervised training data. However, we also find settings where this does not hold. Our discussions and additional experiments on assumptions such as time and hardware restrictions highlight challenges and opportunities in low-resource learning.

show abstract

Concurrent Practical Byzantine Fault Tolerance for Integration of Blockchain and Supply Chain

Zhu

Yang

et al. 2021

ACM Trans. Internet Technol.

View full text Add to dashboard Cite

Currently, the integration of the supply chain and blockchain is promising, as blockchain successfully eliminates the bullwhip effect in the supply chain. Generally, concurrent Practical Byzantine Fault Tolerance (PBFT) consensus method, named C-PBFT, is powerful to deal with the consensus inefficiencies, caused by the fast node expansion in the supply chain. However, due to the tremendous complicated transactions in the supply chain, it remains challenging to select the credible primary peers in the concurrent clusters. To address this challenge, the peers in the supply chain are classified into several clusters by analyzing the historic transactions in the ledger. Then, the primary peer for each cluster is identified by reputation assessment. Finally, the performance of C-PBFT is evaluated by conducting experiments in Fabric.

show abstract

An End-to-End Dialogue State Tracking System with Machine Reading Comprehension and Wide & Deep Classification

Ma¹,

Zeng²,

Zhu³

et al. 2019

Preprint

View full text Add to dashboard Cite

Neural Data-to-Text Generation with LM-based Text Augmentation

Chang¹,

Shen

Zhu³

et al. 2021

View full text Add to dashboard Cite

For many new application domains for datato-text generation, the main obstacle in training neural models consists of a lack of training data. While usually large numbers of instances are available on the data side, often only very few text samples are available. To address this problem, we here propose a novel fewshot approach for this setting. Our approach automatically augments the data available for training by (i) generating new text samples based on replacing specific values by alternative ones from the same category, (ii) generating new text samples based on GPT-2, and (iii) proposing an automatic method for pairing the new text samples with data samples. As the text augmentation can introduce noise to the training data, we use cycle consistency as an objective, in order to make sure that a given data sample can be correctly reconstructed after having been formulated as text (and that text samples can be reconstructed from data).On both the E2E and WebNLG benchmarks, we show that this weakly supervised training paradigm is able to outperform fully supervised seq2seq models with less than 10% annotations. By utilizing all annotated data, our model can boost the performance of a standard seq2seq model by over 5 BLEU points, establishing a new state-of-the-art on both datasets.

show abstract

Image manipulation with natural language using Two-sided Attentive Conditional Generative Adversarial Network

2021

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Dawei Zhu

Transfer Learning and Distant Supervision for Multilingual Transformer Models: A Study on African Languages

Concurrent Practical Byzantine Fault Tolerance for Integration of Blockchain and Supply Chain

An End-to-End Dialogue State Tracking System with Machine Reading Comprehension and Wide & Deep Classification

Neural Data-to-Text Generation with LM-based Text Augmentation

Image manipulation with natural language using Two-sided Attentive Conditional Generative Adversarial Network

Contact Info

Product

Resources

About