Bi-Directional Value Learning for Risk-Aware Planning Under Uncertainty

Kim, Sung-Kyun; Thakker, Rohan; Agha–mohammadi, Ali–akbar

doi:10.1109/lra.2019.2903259

Cited by 31 publications

(24 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…, y t ). The belief state b t can be recursively updated with the following transition function τ (Kim et al, 2019)…”

Section: Preliminariesmentioning

confidence: 99%

“…Various types of rewards modification in POMDPs have been investigated in previous research efforts ( Lee et al, 2018 ; Kim et al, 2019 ). Typically, the reward function in POMDPs is designed to solve the stochastic shortest path problem, where the goal is to compute a feedback plan that reaches a target state from a known initial state by maximizing the expected total reward.…”

Section: Related Workmentioning

confidence: 99%

“…A belief state

of the vehicle is defined as a posterior distribution over all possible states given the past actions and sensor observations

. The belief state

can be recursively updated with the following transition function τ ( Kim et al, 2019 )

in which the next belief state depends only on the current belief state, action, and observation.…”

Section: Preliminariesmentioning

confidence: 99%

See 2 more Smart Citations

Towards Energy-Aware Feedback Planning for Long-Range Autonomous Underwater Vehicles

et al. 2021

View full text Add to dashboard Cite

Ocean ecosystems have spatiotemporal variability and dynamic complexity that require a long-term deployment of an autonomous underwater vehicle for data collection. A new generation of long-range autonomous underwater vehicles (LRAUVs), such as the Slocum glider and Tethys-class AUV, has emerged with high endurance, long-range, and energy-aware capabilities. These new vehicles provide an effective solution to study different oceanic phenomena across multiple spatial and temporal scales. For these vehicles, the ocean environment has forces and moments from changing water currents which are generally on the order of magnitude of the operational vehicle velocity. Therefore, it is not practical to generate a simple trajectory from an initial location to a goal location in an uncertain ocean, as the vehicle can deviate significantly from the prescribed trajectory due to disturbances resulted from water currents. Since state estimation remains challenging in underwater conditions, feedback planning must incorporate state uncertainty that can be framed into a stochastic energy-aware path planning problem. This article presents an energy-aware feedback planning method for an LRAUV utilizing its kinematic model in an underwater environment under motion and sensor uncertainties. Our method uses ocean dynamics from a predictive ocean model to understand the water flow pattern and introduces a goal-constrained belief space to make the feedback plan synthesis computationally tractable. Energy-aware feedback plans for different water current layers are synthesized through sampling and ocean dynamics. The synthesized feedback plans provide strategies for the vehicle that drive it from an environment’s initial location toward the goal location. We validate our method through extensive simulations involving the Tethys vehicle’s kinematic model and incorporating actual ocean model prediction data.

show abstract

“…, y t ). The belief state b t can be recursively updated with the following transition function τ (Kim et al, 2019)…”

Section: Preliminariesmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

“…A belief state

of the vehicle is defined as a posterior distribution over all possible states given the past actions and sensor observations

. The belief state

can be recursively updated with the following transition function τ ( Kim et al, 2019 )

in which the next belief state depends only on the current belief state, action, and observation.…”

Section: Preliminariesmentioning

confidence: 99%

See 1 more Smart Citation

Towards Energy-Aware Feedback Planning for Long-Range Autonomous Underwater Vehicles

et al. 2021

View full text Add to dashboard Cite

show abstract

“…For replanning purposes, forward search algorithms can be used in an RHC scheme. Recently, methods using RHC have been extended for the belief space as well as dynamic environments (Agha‐mohammadi, Agarwal, Kim, Chakravorty, & Amato, ; Chakravorty & Erwin, ; Erez & Smart, ; He, Brunskill, & Roy, ; Kim, Thakker, & Agha‐mohammadi, ; Platt, Tedrake, Kaelbling, & Lozano‐Perez, ; Toit & Burdick, ). In an RHC scheme, optimization is performed only within a limited horizon; thus, the system performs optimization within the specified horizon, then the system takes the next immediate action and moves the optimization horizon one step forward before repeating the process.…”

Section: Related Workmentioning

confidence: 99%

Perception‐aware autonomous mast motion planning for planetary exploration rovers

Strader

Otsu

Agha–mohammadi

2019

Journal of Field Robotics

Self Cite

View full text Add to dashboard Cite

Highly accurate real-time localization is of fundamental importance for the safety and efficiency of planetary rovers exploring the surface of Mars. Mars rover operations rely on vision-based systems to avoid hazards as well as plan safe routes. However, vision-based systems operate on the assumption that sufficient visual texture is visible in the scene. This poses a challenge for vision-based navigation on Mars where regions lacking visual texture are prevalent. To overcome this, we make use of the ability of the rover to actively steer the visual sensor to improve fault tolerance and maximize the perception performance. This paper answers the question of where and when to look by presenting a method for predicting the sensor trajectory that maximizes the localization performance of the rover. This is accomplished by an online assessment of possible trajectories using synthetic, future camera views created from previous observations of the scene. The proposed trajectories are quantified and chosen based on the expected localization performance. In this work, we validate the proposed method in field experiments at the Jet Propulsion Laboratory (JPL) Mars Yard. Furthermore, multiple performance metrics are identified and evaluated for reducing the overall runtime of the algorithm. We show how actively steering the perception system increases the localization accuracy compared to traditional fixed-sensor configurations.

show abstract

“…For example, the works in [8], [9] improved the performance of VO localization by actively choosing timing and camera direction to obtain an optimal image sequence using the predictive perception technique. This problem is typically approached by Partially Observable Markov Decision Process (POMDP), or belief-space planning [10], [11], [12], [13], [14], where the planner chooses optimal actions under motion and sensing uncertainty.…”

Section: Introductionmentioning

confidence: 99%

Where to Map? Iterative Rover-Copter Path Planning for Mars Exploration

Sasaki

Otsu

Thakker

et al. 2020

IEEE Robot. Autom. Lett.

Self Cite

View full text Add to dashboard Cite

In addition to conventional ground rovers, the Mars 2020 mission will send a helicopter to Mars. The copter's highresolution data helps the rover to identify small hazards such as steps and pointy rocks, as well as providing rich textual information useful to predict perception performance. In this paper, we consider a three-agent system composed of a Mars rover, copter, and orbiter. The objective is to provide good localization to the rover by selecting an optimal path that minimizes the localization uncertainty accumulation during the rover's traverse. To achieve this goal, we quantify the localizability as a goodness measure associated with the map, and conduct a joint-space search over rover's path and copter's perceptual actions given prior information from the orbiter. We jointly address where to map by the copter and where to drive by the rover using the proposed iterative copter-rover path planner. We conducted numerical simulations using the map of Mars 2020 landing site to demonstrate the effectiveness of the proposed planner.

show abstract

Bi-Directional Value Learning for Risk-Aware Planning Under Uncertainty

Cited by 31 publications

References 21 publications

Towards Energy-Aware Feedback Planning for Long-Range Autonomous Underwater Vehicles

Towards Energy-Aware Feedback Planning for Long-Range Autonomous Underwater Vehicles

Perception‐aware autonomous mast motion planning for planetary exploration rovers

Where to Map? Iterative Rover-Copter Path Planning for Mars Exploration

Contact Info

Product

Resources

About