2021
DOI: 10.48550/arxiv.2112.13528
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Learning Generative Vision Transformer with Energy-Based Latent Space for Saliency Prediction

Abstract: Vision transformer networks have shown superiority in many computer vision tasks. In this paper, we take a step further by proposing a novel generative vision transformer with latent variables following an informative energy-based prior for salient object detection. Both the vision transformer network and the energy-based prior model are jointly trained via Markov chain Monte Carlo-based maximum likelihood estimation, in which the sampling from the intractable posterior and prior distributions of the latent va… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 49 publications
(75 reference statements)
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?