2018
DOI: 10.1038/s41467-018-06217-x
|View full text |Cite
|
Sign up to set email alerts
|

Finding any Waldo with zero-shot invariant and efficient visual search

Abstract: Searching for a target object in a cluttered scene constitutes a fundamental challenge in daily vision. Visual search must be selective enough to discriminate the target from distractors, invariant to changes in the appearance of the target, efficient to avoid exhaustive exploration of the image, and must generalize to locate novel target objects with zero-shot training. Previous work on visual search has focused on searching for perfect matches of a target after extensive category-specific training. Here, we … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
78
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
6

Relationship

1
5

Authors

Journals

citations
Cited by 39 publications
(82 citation statements)
references
References 50 publications
(117 reference statements)
1
78
0
Order By: Relevance
“…Related recent work by Adeli and Zelinsky provided a biologically inspired implementation of biased competition theory, whereby the multiple objects in a display compete with each other for attention and a top-down signal is used to disambiguate and bias this competition in favor of the sought target. 125 Such feature-based modulation is more efficient when applied at later stages of the visual hierarchy, 124,126 which is consistent with physiological observations showing that both spatial and feature-based attention is considerably weaker in early visual cortical areas compared with higher visual cortical areas.…”
Section: Attention and Searchsupporting
confidence: 83%
See 3 more Smart Citations
“…Related recent work by Adeli and Zelinsky provided a biologically inspired implementation of biased competition theory, whereby the multiple objects in a display compete with each other for attention and a top-down signal is used to disambiguate and bias this competition in favor of the sought target. 125 Such feature-based modulation is more efficient when applied at later stages of the visual hierarchy, 124,126 which is consistent with physiological observations showing that both spatial and feature-based attention is considerably weaker in early visual cortical areas compared with higher visual cortical areas.…”
Section: Attention and Searchsupporting
confidence: 83%
“…In stark contrast, Zhang et al . show that their model can rapidly find target objects after a single exposure to them 124 …”
Section: The Role Of Recurrence Beyond Recognitionmentioning
confidence: 99%
See 2 more Smart Citations
“…Some eye movement characteristics, such as exploratory eye movements, are known to change with development and can be changed with reinforcement learning . Perhaps by combining findings from recent mathematical models of visual search and exploration, development of “eye movement training programs” for schizophrenia aimed at improving visual cognition or social functioning may become possible in the future. The development of biomarkers that can be used for clinical and personal recovery would be of great benefit for both the individual with mental illness and their supporters.…”
Section: Future Clinical Implementationsmentioning
confidence: 99%