Henghui Zhu scite author profile

A key challenge for abstractive summarization is ensuring factual consistency of the generated summary with respect to the original document. For example, state-ofthe-art models trained on existing datasets exhibit entity hallucination, generating names of entities that are not present in the source document. We propose a set of new metrics to quantify the entity-level factual consistency of generated summaries and we show that the entity hallucination problem can be alleviated by simply filtering the training data.In addition, we propose a summary-worthy entity classification task to the training process as well as a joint entity and summary generation approach, which yield further improvements in entity level metrics.

show abstract

End-to-End Synthetic Data Generation for Domain Adaptation of Question Answering Systems

Shakeri¹,

Santos²,

Zhu³

et al. 2020

View full text Add to dashboard Cite

We propose an end-to-end approach for synthetic QA data generation. Our model comprises a single transformer-based encoderdecoder network that is trained end-to-end to generate both answers and questions. In a nutshell, we feed a passage to the encoder and ask the decoder to generate a question and an answer token-by-token. The likelihood produced in the generation process is used as a filtering score, which avoids the need for a separate filtering model. Our generator is trained by finetuning a pretrained LM using maximum likelihood estimation. The experimental results indicate significant improvements in the domain adaptation of QA models outperforming current state-of-the-art methods. * *equal contribution. † Siamak Shakeri is currently with Google. The work was done when he was at AWS AI.

show abstract

Improving Factual Consistency of Abstractive Summarization via Question Answering

Feng¹,

Santos²,

Zhu³

et al. 2021

View full text Add to dashboard Cite

A commonly observed problem with the stateof-the art abstractive summarization models is that the generated summaries can be factually inconsistent with the input documents. The fact that automatic summarization may produce plausible-sounding yet inaccurate summaries is a major concern that limits its wide application. In this paper we present an approach to address factual consistency in summarization. We first propose an efficient automatic evaluation metric to measure factual consistency; next, we propose a novel learning algorithm that maximizes the proposed metric during model training. Through extensive experiments, we confirm that our method is effective in improving factual consistency and even overall quality of the summaries, as judged by both automatic metrics and human evaluation.

show abstract

Supporting Clustering with Contrastive Learning

Zhang¹,

Feng²,

Wei³

et al. 2021

View full text Add to dashboard Cite

Unsupervised clustering aims at discovering the semantic categories of data according to some distance measured in the representation space. However, different categories often overlap with each other in the representation space at the beginning of the learning process, which poses a significant challenge for distance-based clustering in achieving good separation between different categories. To this end, we propose Supporting Clustering with Contrastive Learning (SCCL) -a novel framework to leverage contrastive learning to promote better separation. We assess the performance of SCCL on short text clustering and show that SCCL significantly advances the state-of-the-art results on most benchmark datasets with 3%−11% improvement on Accuracy and 4% − 15% improvement on Normalized Mutual Information. Furthermore, our quantitative analysis demonstrates the effectiveness of SCCL in leveraging the strengths of both bottom-up instance discrimination and top-down clustering to achieve better intracluster and inter-cluster distances when evaluated with the ground truth cluster labels 1 .

show abstract

Finite-time adaptive consensus of a class of multi-agent systems

Liu

Lü

et al. 2015

Sci. China Technol. Sci.

View full text Add to dashboard Cite

Enhancing Clinical BERT Embedding using a Biomedical Knowledge Base

Hao¹,

Zhu²,

Paschalidis³

2020

View full text Add to dashboard Cite

Domain knowledge is important for building Natural Language Processing (NLP) systems for low-resource settings, such as in the clinical domain. In this paper, a novel joint training method is introduced for adding knowledge base information from the Unified Medical Language System (UMLS) into language model pre-training for some clinical domain corpus. We show that in three different downstream clinical NLP tasks, our pre-trained language model outperforms the corresponding model with no knowledge base information and other state-of-the-art models. Specifically, in a natural language inference task applied to clinical texts, our knowledge base pre-training approach improves accuracy by up to 1.7%, whereas in clinical name entity recognition tasks, the F1-score improves by up to 1.0%. The pre-trained models are available at https://github.com/noc-lab/clinical-kb-bert.

show abstract

Synchronizing nonlinear complex networks via switching disconnected topology

Chen

Tan

et al. 2016

Automatica

View full text Add to dashboard Cite

Supporting Clustering with Contrastive Learning

Zhang

Feng

Wei

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Henghui Zhu

Entity-level Factual Consistency of Abstractive Text Summarization

End-to-End Synthetic Data Generation for Domain Adaptation of Question Answering Systems

Improving Factual Consistency of Abstractive Summarization via Question Answering

Supporting Clustering with Contrastive Learning

Finite-time adaptive consensus of a class of multi-agent systems

Enhancing Clinical BERT Embedding using a Biomedical Knowledge Base

Synchronizing nonlinear complex networks via switching disconnected topology

Supporting Clustering with Contrastive Learning

Contact Info

Product

Resources

About