2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021
DOI: 10.1109/iccv48922.2021.01367
|View full text |Cite
|
Sign up to set email alerts
|

Frequency Domain Image Translation: More Photo-realistic, Better Identity-preserving

Abstract: Image-to-image translation aims at translating a particular style of an image to another. The synthesized images can be more photo-realistic and identity-preserving by decomposing the image into content and style in a disentangled manner. While existing models focus on designing specialized network architecture to separate the two components, this paper investigates how to explicitly constrain the content and style statistics of images. We achieve this goal by transforming the input image into high frequency a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
33
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 47 publications
(33 citation statements)
references
References 50 publications
0
33
0
Order By: Relevance
“…More recently, Gal et al [9] propose a wavelet based image generation method. Jiang et al [15] introduce the focal frequency loss which focuses on hard frequencies.Meanwhile, there are some other work [4,17] on image restoration in the frequency domain.…”
Section: Loss Functionsmentioning
confidence: 99%
See 1 more Smart Citation
“…More recently, Gal et al [9] propose a wavelet based image generation method. Jiang et al [15] introduce the focal frequency loss which focuses on hard frequencies.Meanwhile, there are some other work [4,17] on image restoration in the frequency domain.…”
Section: Loss Functionsmentioning
confidence: 99%
“…After a lot of experiments, we find that the policy of mask generation noticeably influences the performance of the inpainting model as shown in Tab. 4.…”
Section: Training With General Maskmentioning
confidence: 99%
“…First, to obtain the latent code 𝑤 1 ∈ W + of the first frame, we make use of the existing GAN inversion model called FDIT [28], which aims to invert an image back to the latent space of a pre-trained generator. Then 𝑤 1 is used for initial cell 𝐿𝑆𝑇 𝑀 𝑖 to get the initialization state of motion prediction:…”
Section: B Motion Inferencementioning
confidence: 99%
“…The image-to-image translation task aims to match the style of images from a source domain into a target domain, while retaining the original structure of the source images [3,23,37]. For current unpaired image-to-image translation datasets, the generative adversarial nets (GAN) [10,31,43,41,24] are able to generate images that match the style, but the adversarial loss suffers from the collapse of the structure.…”
Section: Introductionmentioning
confidence: 99%
“…To alleviate this problem, a family of methods based on cycle-consistency learning [44,3,12,9,22] have been proposed. Cycle-consistency assumes that there exists a reversible relationship between the source image and the target image.…”
Section: Introductionmentioning
confidence: 99%