This study proposes a novel approach for a vision-based navigation problem using semantically segmented aerial images generated by a convolutional neural network. Vision-based navigation provides a position solution by matching an aerial image to a georeferenced database, and it has been increasingly studied for global navigation satellite system–denied environments. Aerial images include a vast amount of information that infers the position where they are located. However, it also includes features that disturb the estimation accuracy. The progress of convolutional neural network may provide a promising solution for extracting only helpful features for this purpose. Therefore, segmented images are modeled as a Gaussian mixture model, and the [Formula: see text] distance for a quantitative discrepancy between two images is established. This allows us to compare the two images quickly with improved accuracy. In addition, a framework of a particle filter is applied to estimate the position using an inertial navigation system. It employs the [Formula: see text] distance as a measurement, and the particles tend to converge to the true position. Flight test experiments were conducted to verify that the proposed approach achieved distance error of less than 10 m.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.