Deep EndoVO: A recurrent convolutional neural network (RCNN) based visual odometry approach for endoscopic capsule robots

Turan, Mehmet; Almalioglu, Yasin; Araújo, Hélder; Konukoğlu, Ender; Sitti, Metin

doi:10.1016/j.neucom.2017.10.014

Cited by 106 publications

(64 citation statements)

References 36 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…To solve this, we apply GANs that provide shaper and more accurate depth maps. The second issue of the aforementioned unsupervised techniques is the fact that they only employ CNNs that only analyse just-in-moment information to estimate camera pose [5], [7]. We address this issue by employing a CNN-RNN architecture to capture temporal relations across frames.…”

Section: Related Workmentioning

confidence: 99%

GANVO: Unsupervised Deep Monocular Visual Odometry and Depth Estimation with Generative Adversarial Networks

Almalioglu

Saputra

Gusmão

et al. 2019

2019 International Conference on Robotics and Automation (ICRA)

Self Cite

141

View full text Add to dashboard Cite

In the last decade, supervised deep learning approaches have been extensively employed in visual odometry (VO) applications, which is not feasible in environments where labelled data is not abundant. On the other hand, unsupervised deep learning approaches for localization and mapping in unknown environments from unlabelled data have received comparatively less attention in VO research. In this study, we propose a generative unsupervised learning framework that predicts 6-DoF pose camera motion and monocular depth map of the scene from unlabelled RGB image sequences, using deep convolutional Generative Adversarial Networks (GANs). We create a supervisory signal by warping view sequences and assigning the re-projection minimization to the objective loss function that is adopted in multi-view pose estimation and single-view depth generation network. Detailed quantitative and qualitative evaluations of the proposed framework on the KITTI [1] and Cityscapes [2] datasets show that the proposed method outperforms both existing traditional and unsupervised deep VO methods providing better results for both pose estimation and depth recovery.

show abstract

Section: Related Workmentioning

confidence: 99%

GANVO: Unsupervised Deep Monocular Visual Odometry and Depth Estimation with Generative Adversarial Networks

Almalioglu

Saputra

Gusmão

et al. 2019

2019 International Conference on Robotics and Automation (ICRA)

Self Cite

141

View full text Add to dashboard Cite

show abstract

“…As an emerging example, various diseases such as colorectal cancer and inflamatory bowel disease are diagnosed by the usage of swallowable capsule endoscopes, which are noninvasive, painless, suitable to be used for long duration screening purposes which can access difficult body parts (e.g.,small intestines) better than standard endoscopy. Such benefits make swallowable, non-tethered capsule endoscopes an exciting alternative over standard endoscopy [1], [2].…”

Section: Introductionmentioning

confidence: 99%

Unsupervised Odometry and Depth Learning for Endoscopic Capsule Robots

Turan

Örnek²,

Ibrahimli³

et al. 2018

2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

Self Cite

View full text Add to dashboard Cite

In the last decade, many medical companies and research groups have tried to convert passive capsule endoscopes as an emerging and minimally invasive diagnostic technology into actively steerable endoscopic capsule robots which will provide more intuitive disease detection, targeted drug delivery and biopsy-like operations in the gastrointestinal(GI) tract. In this study, we introduce a fully unsupervised, realtime odometry and depth learner for monocular endoscopic capsule robots. We establish the supervision by warping view sequences and assigning the re-projection minimization to the loss function, which we adopt in multi-view pose estimation and single-view depth estimation network. Detailed quantitative and qualitative analyses of the proposed framework performed on non-rigidly deformable ex-vivo porcine stomach datasets proves the effectiveness of the method in terms of motion estimation and depth recovery.

show abstract

“…There is a need to develop more general information processing methods for classification and categorization across a broad range of data types. While many researchers have successfully used deep learning for classification problems (e.g., see [9,23,28,30,51]), the central problem remains as to which deep learning architecture (DNN, CNN, or RNN) and structure (how many nodes (units) and hidden layers) is more efficient for different types of data and applications. The favored approach to this problem is trial and error for the specific application and dataset.…”

Section: Introductionmentioning

confidence: 99%

RMDL

Kowsari

Heidarysafa

Brown

et al. 2018

Proceedings of the 2nd International Conference on Information System and Data Mining

View full text Add to dashboard Cite

The continually increasing number of complex datasets each year necessitates ever improving machine learning methods for robust and accurate categorization of these data. This paper introduces Random Multimodel Deep Learning (RMDL): a new ensemble, deep learning approach for classification. Deep learning models have achieved state-of-the-art results across many domains. RMDL solves the problem of finding the best deep learning structure and architecture while simultaneously improving robustness and accuracy through ensembles of deep learning architectures. RDML can accept as input a variety data to include text, video, images, and symbolic. This paper describes RMDL and shows test results for image and text data including MNIST, CIFAR-10, WOS, Reuters, IMDB, and 20newsgroup. These test results show that RDML produces consistently better performance than standard methods over a broad range of data types and classification problems. 1

show abstract

Deep EndoVO: A recurrent convolutional neural network (RCNN) based visual odometry approach for endoscopic capsule robots

Cited by 106 publications

References 36 publications

GANVO: Unsupervised Deep Monocular Visual Odometry and Depth Estimation with Generative Adversarial Networks

GANVO: Unsupervised Deep Monocular Visual Odometry and Depth Estimation with Generative Adversarial Networks

Unsupervised Odometry and Depth Learning for Endoscopic Capsule Robots

RMDL

Contact Info

Product

Resources

About