2020
DOI: 10.1007/978-3-030-58539-6_19
|View full text |Cite
|
Sign up to set email alerts
|

Semantic Curiosity for Active Visual Learning

Abstract: In this paper, we study the task of embodied interactive learning for object detection. Given a set of environments (and some labeling budget), our goal is to learn an object detector by having an agent select what data to obtain labels for. How should an exploration policy decide which trajectory should be labeled? One possibility is to use a trained object detector's failure cases as an external reward. However, this will require labeling millions of frames required for training RL policies, which is infeasi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
40
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 40 publications
(43 citation statements)
references
References 44 publications
(49 reference statements)
1
40
0
Order By: Relevance
“…These methods often assume access to data from the test environment to start with a viable graph, which may not be available in a new environment. Some works have studied this unseen setting by predicting explorable areas for semantically rich parts of the environment to accelerate visual exploration [31,32]. While these methods can yield promising results in a variety of domains, they come at the cost of high sample complexity (over 10M samples) [23], making them difficult to use in the real world-the most performant algorithms take 10-20 minutes to find goals up to 50m away [33].…”
Section: Roadmap Hintmentioning
confidence: 99%
“…These methods often assume access to data from the test environment to start with a viable graph, which may not be available in a new environment. Some works have studied this unseen setting by predicting explorable areas for semantically rich parts of the environment to accelerate visual exploration [31,32]. While these methods can yield promising results in a variety of domains, they come at the cost of high sample complexity (over 10M samples) [23], making them difficult to use in the real world-the most performant algorithms take 10-20 minutes to find goals up to 50m away [33].…”
Section: Roadmap Hintmentioning
confidence: 99%
“…Maximizing epistemic uncertainty is used as a proxy for maximizing information gain [47,41]. We use this uncertainty objective to actively fine-tune our models with a training procedure similar to the active learning approaches leveraged during training presented by [7,13,46]. In particular, [13] uses this active training method to improve performance on a semantic segmentation model, leading us to initially hypothesize analogous results would be possible for our semantic hallucination task.…”
Section: Related Workmentioning
confidence: 99%
“…We use this uncertainty objective to actively fine-tune our models with a training procedure similar to the active learning approaches leveraged during training presented by [7,13,46]. In particular, [13] uses this active training method to improve performance on a semantic segmentation model, leading us to initially hypothesize analogous results would be possible for our semantic hallucination task. We also use this epistemic uncertainty estimate to construct confidence bounds for our estimated probability distribution which we use to select goals for target-driven navigation at test time.…”
Section: Related Workmentioning
confidence: 99%
“…Practical considerations for curiosity have been listed in recent work [63], such as using Proximal Policy Optimization (PPO) for policy optimisation. Curiosity has been used to generate more advanced maps like semantic maps in recent work [66]. Stochasticity poses a serious challenge in the curiosity approach, since the forward-dynamics model can exploit stochasticity [63] for high prediction errors (i.e.…”
Section: Approachesmentioning
confidence: 99%