Active Policy Learning for Robot Planning and Exploration under Uncertainty

Martínez-Cantín, Rubén; Freitas, Nando de; Doucet, Arnaud; Castellanos, José A.

doi:10.15607/rss.2007.iii.041

Cited by 113 publications

(107 citation statements)

References 28 publications

(32 reference statements)

Supporting

Mentioning

106

Contrasting

Order By: Relevance

“…This idea has been pursued in some works [67,[94][95][96]106]. Huang et al [67] introduced a discussion about the problem of multi-step look-ahead exploration in the context of SLAM, arguing that multi-step active SLAM is possible when the current estimation error is small, the probability of observing new feature is low, and the computation capability is high.…”

Section: Action Selectionmentioning

confidence: 99%

See 1 more Smart Citation

Mapping, Planning and Exploration with Pose SLAM

Valencia,

Andrade-Cetto

2018

Springer Tracts in Advanced Robotics

View full text Add to dashboard Cite

This thesis reports research on mapping, path planning, and autonomous exploration. These are classical problems in robotics, typically studied independently, and here we link such problems by framing them within a common SLAM approach, adopting Pose SLAM as the basic state estimation machinery. The main contribution of this thesis is an approach that allows a mobile robot to plan a path using the map it builds with Pose SLAM and to select the appropriate actions to autonomously construct this map.Pose SLAM is the variant of SLAM where only the robot trajectory is estimated and where landmarks are only used to produce relative constraints between robot poses. In Pose SLAM, observations come in the form of relative-motion measurements between robot poses. With regards to extending the original Pose SLAM formulation, this thesis studies the computation of such measurements when they are obtained with stereo cameras and develops the appropriate noise propagation models for such case. Furthermore, the initial formulation of Pose SLAM assumes poses in SE(2) and in this thesis we extend this formulation to SE(3), parameterizing rotations either with Euler angles and quaternions. We also introduce a loop closure test that exploits the information from the filter using an independent measure of information content between poses. In the application domain, we present a technique to process the 3D volumetric maps obtained with this SLAM methodology, but with laser range scanning as the sensor modality, to derive traversability maps that were useful for the navigation of a heterogeneous fleet of mobile robots in the context of the EU project URUS.Aside from these extensions to Pose SLAM, the core contribution of the thesis is an approach for path planning that exploits the modeled uncertainties in Pose SLAM to search for the path in the pose graph with the lowest accumulated robot pose uncertainty, i.e., the path that allows the robot to navigate to a given goal with the least probability of becoming lost. An added advantage of the proposed path planning approach is that since Pose SLAM is agnostic with respect to the sensor modalities used, it can be used in different environments and with i different robots, and since the original pose graph may come from a previous mapping session, the paths stored in the map already satisfy constraints not easy modeled in the robot controller, such as the existence of restricted regions, or the right of way along paths. The proposed path planning methodology has been extensively tested both in simulation and with a real outdoor robot.Our path planning approach is adequate for scenarios where a robot is initially guided during map construction, but autonomous during execution. For other scenarios in which more autonomy is required, the robot should be able to explore the environment without any supervision. The second core contribution of this thesis is an autonomous exploration method that complements the aforementioned path planning strategy. The method selects the appropriate...

show abstract

Section: Action Selectionmentioning

confidence: 99%

“…In the work presented in [105,106], Martinez Cantin et al proposed a reinforcement learning approach to solve the problem of exploration for SLAM, their technique is based on the work presented in [114]. They employ a direct policy search approach [122], where the value funtion is approximated using Gaussian Processes (GP).…”

Section: Action Selectionmentioning

confidence: 99%

Mapping, Planning and Exploration with Pose SLAM

Valencia,

Andrade-Cetto

2018

Springer Tracts in Advanced Robotics

View full text Add to dashboard Cite

show abstract

“…Active learning differs however in that the aim is only to poll the user when the information returned is useful (above a threshold or according to some constrained budget). To the best of our knowledge is it novel in the area of thermal comfort modelling but has a long history and has been applied in many fields such as robot control [24], fault detection [25], as a general optimisation approach [26] amongst others [27][28][29]. 4 …”

Section: Related Workmentioning

confidence: 99%

“…Note that it is through ζ x that C1 and C2 may be considered in the model. The maximum a-posteriori estimates for the process parameters are [24]:…”

Section: Gaussian Process Modelsmentioning

confidence: 99%

Gaussian Process models for ubiquitous user comfort preference sampling; global priors, active sampling and outlier rejection

Fay

O'Toole

Brown

2017

Pervasive and Mobile Computing

View full text Add to dashboard Cite

a b s t r a c tThis paper presents a ubiquitous thermal comfort preference learning study in a noisy environment. We introduce Gaussian Process models into this field and show they are ideal, allowing rejection of outliers, deadband samples, and produce excellent estimates of a users preference function. In addition, informative combinations of users preferences becomes possible, some of which demonstrate well defined maxima ideal for control signals. Interestingly, while those users studied have differing preferences, their hyperparameters are concentrated allowing priors for new users. In addition, we present an active learning algorithm which estimates when to poll users to maximise the information returned.

show abstract

“…In our case, the expected number of landmarks to see and a very rough uniform disposition of them in the environment are our initial conditions. Several authors make such assumption either with a priori grid-based discretization of the environment [14], [20] or by adding uniformly distributed unvisited landmarks as vague priors [21], [22].…”

Section: Action Selectionmentioning

confidence: 99%

Action Selection for Single-Camera SLAM

Vidal-Calleja

Sanfeliu

Andrade‐Cetto

2010

IEEE Trans. Syst., Man, Cybern. B

View full text Add to dashboard Cite

Abstract-A method for evaluating, at video rate, the quality of actions for a single camera while mapping unknown indoor environments is presented. The strategy maximizes mutual information between measurements and states to help the camera avoid making ill-conditioned measurements that are appropriate to lack of depth in monocular vision systems. Our system prompts a user with the appropriate motion commands during 6-DOF visual simultaneous localization and mapping with a handheld camera. Additionally, the system has been ported to a mobile robotic platform, thus closing the control-estimation loop. To show the viability of the approach, simulations and experiments are presented for the unconstrained motion of a handheld camera and for the motion of a mobile robot with nonholonomic constraints. When combined with a path planner, the technique safely drives to a marked goal while, at the same time, producing an optimal estimated map.Index Terms-Action selection, active vision, bearing-only simultaneous localization and mapping (SLAM), mutual information, path planning.

show abstract

Active Policy Learning for Robot Planning and Exploration under Uncertainty

Cited by 113 publications

References 28 publications

Mapping, Planning and Exploration with Pose SLAM

Mapping, Planning and Exploration with Pose SLAM

Gaussian Process models for ubiquitous user comfort preference sampling; global priors, active sampling and outlier rejection

Action Selection for Single-Camera SLAM

Contact Info

Product

Resources

About