A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond

Xiao, Yisheng; Wu, Lei; Guo, Junliang; Li, Juntao; Zhang, Min; Qin, Tao; Liu, Tie-Yan

doi:10.48550/arxiv.2204.09269

Cited by 3 publications

(3 citation statements)

References 116 publications

(228 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…However, due to the autoregressive nature, searching for the word at current position only considers the information from the left side, not the full context. Besides, the autoregressive nature tends to bring mode collapse problem [74], resulting in captions with less diversity. Moreover, the timecost of iterative gradient-update is high, especially for long captions.…”

Section: Bert Encodermentioning

confidence: 99%

See 1 more Smart Citation

ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing

Zequn¹,

Zhang²,

Wang³

et al. 2023

Preprint

View full text Add to dashboard Cite

Zero-shot capability has been considered as a new revolution of deep learning, letting machines work on tasks without curated training data. As a good start and the only existing outcome of zero-shot image captioning (IC), ZeroCap abandons supervised training and sequentially searches every word in the caption using the knowledge of large-scale pre-trained models. Though effective, its autoregressive generation and gradient-directed searching mechanism limit the diversity of captions and inference speed, respectively. Moreover, ZeroCap does not consider the controllability issue of zero-shot IC. To move forward, we propose a framework for Controllable Zero-shot IC, named ConZIC. The core of ConZIC is a novel samplingbased non-autoregressive language model named Gibbs-BERT, which can generate and continuously polish every word. Extensive quantitative and qualitative results demonstrate the superior performance of our proposed ConZIC for both zero-shot IC and controllable zero-shot IC. Especially, ConZIC achieves about 5× faster generation speed than ZeroCap, and about 1.5× higher diversity scores, with accurate generation given different control signals. Our code is available at https://github.com/joeyz0z/ConZIC.

show abstract

Section: Bert Encodermentioning

confidence: 99%

“…However, such autoregressive generation often results in issues such as sequential error accumulation and lack of diversity [13,74]. Further, for zero-shot IC, the sequential searching-order is lack of flexible.…”

Section: Sampling-based Language Model For P(x <1n> )mentioning

confidence: 99%

ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing

Zequn¹,

Zhang²,

Wang³

et al. 2023

Preprint

View full text Add to dashboard Cite

show abstract

“…Non-autoregressive (NAR) generation refers to a method of generating sequences where each element is generated independently, without relying on previously generated elements, allowing for faster parallel generation but potentially sacrificing the generation accuracy (Xiao et al, 2022). Recently, diffusion models have demonstrated powerful generative capabilities in image generation tasks, gradually becoming a new paradigm in generative models.…”

Section: Introductionmentioning

confidence: 99%

InfoDiffusion: Information Entropy Aware Diffusion Process for Non-Autoregressive Text Generation

Wang,

Li,

2023

Findings of the Association for Computational Linguistics: EMNLP 2023

View full text Add to dashboard Cite

Diffusion models have garnered considerable interest in the field of text generation. Several studies have explored text diffusion models with different structures and applied them to various tasks, including named entity recognition and summarization. However, there exists a notable disparity between the "easy-first" text generation process of current diffusion models and the "keyword-first" natural text generation process of humans, which has received limited attention. To bridge this gap, we propose In-foDiffusion, a non-autoregressive text diffusion model. Our approach introduces a "keyinfofirst" generation strategy and incorporates a noise schedule based on the amount of text information. In addition, InfoDiffusion combines self-conditioning with a newly proposed partially noising model structure. Experimental results show that InfoDiffusion outperforms the baseline model in terms of generation quality and diversity, as well as exhibiting higher sampling efficiency. 1

show abstract

AutoPoster: A Highly Automatic and Content-aware Design System for Advertising Poster Generation

Lin,

Zhou,

et al. 2023

Proceedings of the 31st ACM International Conference on Multimedia

View full text Add to dashboard Cite

A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond

Cited by 3 publications

References 116 publications

ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing

ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing

InfoDiffusion: Information Entropy Aware Diffusion Process for Non-Autoregressive Text Generation

AutoPoster: A Highly Automatic and Content-aware Design System for Advertising Poster Generation

Contact Info

Product

Resources

About