Actionable Information in Vision

Soatto, Stefano

doi:10.1007/978-3-642-28661-2_2

Cited by 41 publications

(38 citation statements)

References 70 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Already the image, in the local frame, is by construction invariant to similarity transformations. To achieve invariance to contrast, we replace the image with the gradient direction at each point, since the gradient direction is dual to the geometry of the level lines which is a maximal contrast invariant statistic (Soatto 2009). However, instead of coarsely binning the descriptor to achieve some kind of insensitivity to viewpoint changes beyond similarities, as in SIFT and HOG (Dalal and Triggs 2005), we have the luxury of tracking, which gives us samples of the image in the local frame Fig.…”

Section: Feature Representationmentioning

confidence: 99%

Visual-inertial navigation, mapping and localization: A scalable real-time causal approach

Jones

Soatto

2011

The International Journal of Robotics Research

Self Cite

381

324

View full text Add to dashboard Cite

We present a model to estimate motion from monocular visual and inertial measurements. We analyze the model and characterize the conditions under which its state is observable, and its parameters are identifiable. These include the unknown gravity vector, and the unknown transformation between the camera coordinate frame and the inertial unit. We show that it is possible to estimate both state and parameters as part of an on-line procedure, but only provided that the motion sequence is "rich enough," a condition that we characterize explicitly. We then describe an efficient implementation of a filter to estimate the state and parameters of this model, including gravity and camera-to-inertial calibration. It runs in real-time on an embedded platform, and its performance has been tested extensively. We report experiments of continuous operation, without failures, re-initialization, or re-calibration, on paths of length up to 30Km. We also describe an integrated approach to "loop-closure," that is the recognition of previously-seen locations and the topological re-adjustment of the traveled path. It represents visual features relative to the global orientation reference provided by the gravity vector estimated by the filter, and relative to the scale provided by their known position within the map; these features are organized into "locations" defined by visibility constraints, represented in a topological graph, where loop closure can be performed without the need to re-compute past trajectories or perform bundle adjustment. The software infrastructure as well as the embedded platform is described in detail in a technical report (Jones and Soatto (2009).)

show abstract

Section: Feature Representationmentioning

confidence: 99%

Visual-inertial navigation, mapping and localization: A scalable real-time causal approach

Jones

Soatto

2011

The International Journal of Robotics Research

Self Cite

381

324

View full text Add to dashboard Cite

show abstract

“…Optimal Rapidly-exploring Random Trees (RRT*s) [10] have been widely used in path planning problems and their extension to Rapidly-exploring Random Belief Trees (RRBTs) [7] takes pose uncertainty into account and avoids collisions. Selecting sequences of viewpoints that optimize for a certain task (e.g, pose estimation or map uncertainty minimization) is referred to as active perception [11,12]. While previous papers on active perception relied on using range sensors (e.g, [8]), Davison and Murray [13] were among the first to use vision sensors (a stereo camera setup) to select where the camera should look to reduce the pose drift during visual SLAM.…”

Section: Related Workmentioning

confidence: 99%

“…The fundamental steps of the perception-aware RRT* are summarized in Algorithm 1. At each iteration, it samples a new state from the state space and connects it to the nearest vertex (lines [3][4][5][6][7][8][9][10][11][12][13][14][15][16][17][18][19]. Next, the function Near() checks on the vertices within a ball, centered at the sampled state (see [10]), and propagate the pose covariance from these vertices to the newly sampled one.…”

Section: Dense Image-to-model Alignmentmentioning

confidence: 99%

Exploiting Photometric Information for Planning Under Uncertainty

Delmerico

Werlberger

Valigi

et al. 2017

Springer Proceedings in Advanced Robotics

View full text Add to dashboard Cite

Vision-based localization systems rely on highly-textured areas for achieving an accurate pose estimation. However, most previous path planning strategies propose to select trajectories with minimum pose uncertainty by leveraging only the geometric structure of the scene, neglecting the photometric information (i.e, texture). Our planner exploits the scene's visual appearance (i.e, the photometric information) in combination with its 3D geometry. Furthermore, we assume that we have no prior knowledge about the environment given, meaning that there is no pre-computed map or 3D geometry available. We introduce a novel approach to update the optimal plan on-the-fly, as new visual information is gathered. We demonstrate our approach with real and simulated Micro Aerial Vehicles (MAVs) that perform perception-aware path planning in real-time during exploration. We show significantly reduced pose uncertainty over trajectories planned without considering the perception of the robot.Abstract Vision-based localization systems rely on highly-textured areas for achieving an accurate pose estimation. However, most previous path planning strategies propose to select trajectories with minimum pose uncertainty by leveraging only the geometric structure of the scene, neglecting the photometric information (i.e, texture). Our planner exploits the scene's visual appearance (i.e, the photometric information) in combination with its 3D geometry. Furthermore, we assume that we have no prior knowledge about the environment given, meaning that there is no precomputed map or 3D geometry available. We introduce a novel approach to update the optimal plan on-the-fly, as new visual information is gathered. We demonstrate our approach with real and simulated Micro Aerial Vehicles (MAVs) that perform perception-aware path planning in real-time during exploration. We show significantly reduced pose uncertainty over trajectories planned without considering the perception of the robot. Supplementary MaterialA video showing the results of our monocular depth estimation approach is available at https://www.youtube.com/watch?v=5UmEw8LDJCI.

show abstract

“…Thus this "control-authority/actionable information" tradeo↵ extends "rate/distortion" theory when the underlying task is not the storage or transmission of data, but its use in decision and control tasks. This construction is described in [22].…”

Section: Invariance In Representationmentioning

confidence: 99%

Dynamic Vision for Control

Soatto¹

2012

Self Cite

View full text Add to dashboard Cite

5b. GRANT NUMBER 5c. PROGRAM ELEMENT NUMBER 5d. PROJECT NUMBER 5e. TASK NUMBER 5f. WORK UNIT NUMBER 11. SPONSOR/MONITOR'S REPORT NUMBER(S) 16. SECURITY CLASSIFICATION OF: 19b. TELEPHONE NUMBER (Include area code) The public reporting burden for this collection of information is estimated to average 1 hour per response, including the time for reviewing instructions, searching existing data sources, gathering and maintaining the data needed, and completing and reviewing the collection of information. Send comments regarding this burden estimate or any other aspect of this collection of information, including suggestions for reducing the burden, to the Department of Defense, Executive Service Directorate (0704-0188). Respondents should be aware that notwithstanding any other provision of law, no person shall be subject to any penalty for failing to comply with a collection of information if it does not display a currently valid OMB control number.

show abstract

Actionable Information in Vision

Cited by 41 publications

References 70 publications

Visual-inertial navigation, mapping and localization: A scalable real-time causal approach

Visual-inertial navigation, mapping and localization: A scalable real-time causal approach

Exploiting Photometric Information for Planning Under Uncertainty

Dynamic Vision for Control

Contact Info

Product

Resources

About