Proceedings of the 30th ACM International Conference on Multimedia 2022
DOI: 10.1145/3503161.3548086
|View full text |Cite
|
Sign up to set email alerts
|

PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative Grounding

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
17
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(17 citation statements)
references
References 56 publications
0
17
0
Order By: Relevance
“…Specifically, the PNG task seeks to segment objects and regions in an image corresponding to nouns in its long text description. Numerous studies have been conducted on this task [10,13,53]. González et al [13] first introduced this new task, establishing a benchmark that includes new standard data and evaluation methods, and proposed a robust baseline method as the foundation for future work.…”
Section: Related Work 21 Panoptic Narrative Groundingmentioning
confidence: 99%
See 4 more Smart Citations
“…Specifically, the PNG task seeks to segment objects and regions in an image corresponding to nouns in its long text description. Numerous studies have been conducted on this task [10,13,53]. González et al [13] first introduced this new task, establishing a benchmark that includes new standard data and evaluation methods, and proposed a robust baseline method as the foundation for future work.…”
Section: Related Work 21 Panoptic Narrative Groundingmentioning
confidence: 99%
“…González et al [13] first introduced this new task, establishing a benchmark that includes new standard data and evaluation methods, and proposed a robust baseline method as the foundation for future work. To address the limitations of the previous twostage approach, such as low-quality proposals and spatial details loss, Ding et al [10] proposed a one-stage Pixel-Phrase Matching Network that directly matches each phrase to its corresponding pixels and outputs panoptic segmentation. Concurrently, Wang et al [53] proposed a similar one-stage network for real-time PNG, but with a greater focus on the real-time performance of the model.…”
Section: Related Work 21 Panoptic Narrative Groundingmentioning
confidence: 99%
See 3 more Smart Citations