2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022
DOI: 10.1109/cvpr52688.2022.01044
|View full text |Cite
|
Sign up to set email alerts
|

ManiTrans: Entity-Level Text-Guided Image Manipulation via Token-wise Semantic Alignment and Generation

Abstract: Existing text-guided image manipulation methods aim to modify the appearance of the image or to edit a few objects in a virtual or simple scenario, which is far from practical applications. In this work, we study a novel task on text-guided image manipulation on the entity level in the real world (eL-TGIM). The task imposes three basic requirements, (1) to edit the entity consistent with the text descriptions, (2) to preserve the entity-irrelevant regions, and (3) to merge the manipulated entity into the image… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(7 citation statements)
references
References 57 publications
0
7
0
Order By: Relevance
“…Following previous works (Wang et al 2022;, we employ the Inception Score (IS) (Salimans et al 2016), CLIP-sim, L2-error, and Manipulation Precision (MP) to evaluate the performance. Higher IS means higher fidelity of the edited images.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…Following previous works (Wang et al 2022;, we employ the Inception Score (IS) (Salimans et al 2016), CLIP-sim, L2-error, and Manipulation Precision (MP) to evaluate the performance. Higher IS means higher fidelity of the edited images.…”
Section: Methodsmentioning
confidence: 99%
“…But there is no ground truth (humanedited images) in this task, and calculating with source images makes the FID prefer lazy models which do not edit input images. So we adopt the IS to evaluate the image fidelity as previous works (Wang et al 2022;).…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…and image manipulation [1,2,36,39,47]. Computational approaches [5,9,30] for modifying the style and appearance of objects in natural photographs have made remarkable progress, allowing beginner users to accomplish a wide range of editing effects.…”
Section: Introductionmentioning
confidence: 99%