2014
DOI: 10.21236/ada611903
|View full text |Cite
|
Sign up to set email alerts
|

Visual Information Theory and Visual Representation for Achieving Provable Bounds in Vision-Based Control and Decision

Abstract: This project pursued the development of representations of visual data suitable for control and decision tasks. The fundamental premise is that traditional notions of information developed in support of communication engineering-where the task is reproduction of the source data, and nuisance factors can be easily characterized statisticallyare unsuited to visual inference, where the task is decision or control, and the data formation process include scaling (that makes the continuous limit relevant) and occlus… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
3
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
2
2

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 11 publications
0
3
0
Order By: Relevance
“…Note that such objects often exhibit complex geometry, topology, and photometry, thus precluding the use of off-the-shelf laser scanners (due to specular reflections); volume displacement techniques, e.g., submerging objects in water, cannot be easily employed as many objects either float (e.g., apples), absorb water (e.g., cardboard packaging for foodstuffs, stuffed animals), or are permanently damaged by water (e.g., hand-held consumer electronics). Further, we wished to measure volume in a manner as analogous as possible to the way in which humans do so without access to haptic information, i.e., on the basis of visual information alone [ 24 ]. For example, visual inspection prior to lifting would provide no information about internal cavities (as in hollow or porous objects).…”
Section: Methodsmentioning
confidence: 99%
“…Note that such objects often exhibit complex geometry, topology, and photometry, thus precluding the use of off-the-shelf laser scanners (due to specular reflections); volume displacement techniques, e.g., submerging objects in water, cannot be easily employed as many objects either float (e.g., apples), absorb water (e.g., cardboard packaging for foodstuffs, stuffed animals), or are permanently damaged by water (e.g., hand-held consumer electronics). Further, we wished to measure volume in a manner as analogous as possible to the way in which humans do so without access to haptic information, i.e., on the basis of visual information alone [ 24 ]. For example, visual inspection prior to lifting would provide no information about internal cavities (as in hollow or porous objects).…”
Section: Methodsmentioning
confidence: 99%
“…Much of the research on perception and representation learning, both in ML and neuroscience, has focused on object recognition. In ML, this line of research has historically emphasized the importance of learning representations that are invariant to transformations like pose or illumination (Lowe, 1999 ; Dalal and Triggs, 2005 ; Sundaramoorthi et al, 2009 ; Soatto, 2010 ; Krizhevsky et al, 2012 ). In this framework, transformations are considered nuisance variables to be thrown away ( Figures 1A , 2B ).…”
Section: What Are Symmetries?mentioning
confidence: 99%
“…b) Task: From a given sample of data z, "accomplishing a task" [30] refers to the ability to infer the corresponding label y ∈ Y. While in other works [27]- [29] the Markov chain x → z → y reads data → representation → task, in our multi-sensor setting it refers to scene → data → task.…”
Section: Problem Formalizationmentioning
confidence: 99%