“…This localization solution has the advantages of being both cheap and ubiquitous, and has the potential to provide position and orientation (pose) estimates which are on-par in terms of accuracy with more expensive sensors such as LiDAR. To date, various algorithms are available for VINS problems including visual-inertial (VI)-SLAM [19,45] and visual-inertial odometry (VIO) [30,29,22], such as the extended Kalman filter (EKF) [30,20,14,22,17,16,50,37], unscented Kalman filter (UKF) [10,4], and batch or slidingwindow optimization methods [46,18,21,33,52,45,40], among which the EKF-based approaches remain arguably the most popular for resource constrained devices because of their efficiency. While current approaches can perform well over a short period of time in a small-scale environment (e.g., see [13,22,15]), they are not robust and accurate enough for long-term, large-scale deployments in challenging environments, due to their limited available resources of sensing, memory and computation, which, if not properly addressed, often result in short mission duration or intractable real-time estimator performance.…”