2019 IEEE/CVF International Conference on Computer Vision (ICCV) 2019
DOI: 10.1109/iccv.2019.00471
|View full text |Cite
|
Sign up to set email alerts
|

Counterfactual Critic Multi-Agent Training for Scene Graph Generation

Abstract: Scene graphs -objects as nodes and visual relationships as edges -describe the whereabouts and interactions of objects in an image for comprehensive scene understanding. To generate coherent scene graphs, almost all existing methods exploit the fruitful visual context by modeling message passing among objects. For example, "person" on "bike" can help to determine the relationship "ride", which in turn contributes to the confidence of the two objects. However, we argue that the visual context is not properly le… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
105
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
4
1

Relationship

2
8

Authors

Journals

citations
Cited by 150 publications
(111 citation statements)
references
References 60 publications
0
105
0
Order By: Relevance
“…This task is to produce graph representations of images in terms of objects and their relationships. Scene graphs have been shown effective in boosting several vision-language tasks [14,25,28,5]. To the best of our knowledge, we are the first to design neural module networks that can reason over scene graphs.…”
Section: Related Workmentioning
confidence: 99%
“…This task is to produce graph representations of images in terms of objects and their relationships. Scene graphs have been shown effective in boosting several vision-language tasks [14,25,28,5]. To the best of our knowledge, we are the first to design neural module networks that can reason over scene graphs.…”
Section: Related Workmentioning
confidence: 99%
“…To the best of our knowledge, there are only two exceptions among all NLVL models: RWM and SM-RL (Wang et al, 2019), which are not under either top-down or bottomup frameworks. They both formulate NLVL as a sequential decision making problem, solved by reinforcement learning, e.g., actor critic (Chen et al, 2019b). The action space for each step is a set of handcraft-designed temporal box transformations.…”
Section: Natural Language Video Localizationmentioning
confidence: 99%
“…However, DL models which have no interaction with the market has a natural disadvantage in decision making problem like PM. Reinforcement learning algorithms have been proved effective in decision making problems in recent years and deep reinforcement learning (DRL) (Chen et al 2019), the integration of DL and RL, is widely used in the financial field. For instance, (Almahdi and Yang 2017) proposed a recurrent reinforcement learning (RRL) method, with a coherent riskadjusted performance objective function named the Calmar ratio, to obtain both buy and sell signals and asset allocation weights.…”
Section: Related Workmentioning
confidence: 99%