Real-time indoor scene understanding using Bayesian filtering with motion cues

Tsai, Grace; Xu, Caiqi; Liu, Jingen; Kuipers, Benjamin

doi:10.1109/iccv.2011.6126233

Cited by 58 publications

(53 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This is because both proposed point sampling techniques ensure that points in each sample are chosen with high likelihood from the same planar surface, and that only one model per sample is built for the Table 1. ARI results (larger is better) on sequences from three different data sets: the Michigan indoor (Mich:) data set [31], the MichiganMilan (MM:) data set [7], and a new data set collected in our campus (New:). The algorithms considered are: T-linkage (T-L), Manhattan World-constrained T-linkage (MW), Weak Manhattan World-constrained T-linkage (WMW) [26], MW with samples constrained by the orientation map (MW-OM), and MW with samples from regions grown around each point (MW-RS).…”

Section: Resultsmentioning

confidence: 99%

“…Images in this data set contain an average number of 2.5 planes visible, with 36 feature points per plane on average. (2) The Michigan indoor data set [31], for which camera calibration was available (although substantial residual radial distortion had to be removed via manual calibration). 4.5 planar surfaces are visible in each image on average, and 33 feature points were detected per plane on average.…”

Section: Data Setsmentioning

confidence: 99%

See 1 more Smart Citation

Multi-planar Fitting in an Indoor ManhattanWorld

Kim

Manduchi

2017

2017 IEEE Winter Conference on Applications of Computer Vision (WACV)

View full text Add to dashboard Cite

We present an algorithm that finds planar structures in a Manhattan world from two pictures taken from different viewpoints with unknown baseline. The Manhattan world assumption constrains the homographies induced by the visible planes on the image pair, thus enabling robust reconstruction. We extend the T-linkage algorithm for multistructure discovery to account for constrained homographies, and introduce algorithms for sample point selection and orientation-preserving cluster merging. Results are presented on three indoor data set, showing the benefit of the proposed constraints and algorithms.

show abstract

Section: Resultsmentioning

confidence: 99%

Section: Data Setsmentioning

confidence: 99%

Multi-planar Fitting in an Indoor ManhattanWorld

Kim

Manduchi

2017

2017 IEEE Winter Conference on Applications of Computer Vision (WACV)

View full text Add to dashboard Cite

show abstract

“…[3] already demonstrated that 3D superpixels can be reconstructed in realtime. [12,36] estimated a multiview layoutwithout labeling the image-in real-time. Finally, although there is no experimental evidence of real-time for Data-Driven Primitives, it consists of HOG features extraction and SVM classification.…”

Section: Resultsmentioning

confidence: 99%

“…[14,16,37] used the Manhattan assumption to fill textureless gaps in sparse 3D reconstructions. [25,3,4] and [12,36] have used super-pixels and indoor scene understanding respectively to fill textureless gaps in sparse 3D reconstructions. Our contribution is to fuse the previously mentioned cues and a new onedata-driven primitives-in a dense variational formulation.…”

Section: Data-driven Depth Cuesmentioning

confidence: 99%

Automatic implementation of the numerical Taylor series method: A Mathematica and Sage approach

Abad

Barrio

Marco-Buzunáriz

et al. 2015

Applied Mathematics and Computation

View full text Add to dashboard Cite

This paper presents a dense monocular mapping algorithm that improves the accuracy of the state-of-the-art variational and multiview stereo methods by incorporating scene priors into its formulation. Most of the improvement of our proposal is in lowtextured image regions and for low-parallax camera motions; two typical failure cases of multiview mapping.The specific priors we model are the planarity of homogeneous color regions, the repeating geometric primitives of the scene -that can be learned from data-and the Manhattan structure of indoor rooms. We evaluate the performance of our method in our own sequences and in the publicly available NYU dataset, emphasizing its strengths and weaknesses in different cases.

show abstract

“…For on-line mobile agent that perceives its local environment through a temporally continuous stream of images (e.g. a video), Tsai, et al [11] generates a set of hypotheses from the first frame of the video, and uses a Bayesian filter to evaluate the hypotheses on-line based on their abilities to explain the 2D motions of a set of tracked features. Tsai and Kuipers [10] extended the real-time scene understanding method to generate children hypotheses on-line from existing hypotheses to describe the scene in more detail.…”

Section: Introductionmentioning

confidence: 99%

Focusing Attention on Visual Features that Matter

Tsai

Kuipers²

2013

Procedings of the British Machine Vision Conference 2013

Self Cite

View full text Add to dashboard Cite

A common approach to scene understanding generates a set of structural hypotheses and evaluates these hypotheses using visual features that are easy to detect. However, these features may not necessarily be the most informative features to discriminate among the hypotheses. This paper demonstrates that by focusing attention on regions where the hypotheses differ in how they explain the visual features, we can then evaluate those hypotheses more efficiently. We define the informativeness of each feature based on the expected information gain that the feature provides to the current set of hypotheses, and demonstrate how these informative features can be selected efficiently. We evaluate our attention focusing method on a Bayesian filter-based approach to scene understanding. Our experimental results demonstrate that by focusing attention on the most informative point features, the Bayesian filter converges to a single hypothesis more efficiently, with no loss of accuracy.

show abstract

Real-time indoor scene understanding using Bayesian filtering with motion cues

Cited by 58 publications

References 24 publications

Multi-planar Fitting in an Indoor ManhattanWorld

Multi-planar Fitting in an Indoor ManhattanWorld

Automatic implementation of the numerical Taylor series method: A Mathematica and Sage approach

Focusing Attention on Visual Features that Matter

Contact Info

Product

Resources

About