Michal Havlena scite author profile

Estimating a depth map from multiple views of a scene is a fundamental task in computer vision. As soon as more than two viewpoints are available, one faces the very basic question how to measure similarity across >2 image patches. Surprisingly, no direct solution exists, instead it is common to fall back to more or less robust averaging of two-view similarities. Encouraged by the success of machine learning, and in particular convolutional neural networks, we propose to learn a matching function which directly maps multiple image patches to a scalar similarity score. Experiments on several multi-view datasets demonstrate that this approach has advantages over methods based on pairwise patch similarity.

show abstract

Predicting Matchability

Hartmann

Havlena

Schindler

2014

View full text Add to dashboard Cite

From Google Street View to 3D city models

Torii

Havlena

Pajdla

2009

View full text Add to dashboard Cite

Efficient Structure from Motion by Graph Optimization

Havlena

Torii

Pajdla

2010

View full text Add to dashboard Cite

Abstract. We present an efficient structure from motion algorithm that can deal with large image collections in a fraction of time and effort of previous approaches while providing comparable quality of the scene and camera reconstruction. First, we employ fast image indexing using large image vocabularies to measure visual overlap of images without running actual image matching. Then, we select a small subset from the set of input images by computing its approximate minimal connected dominating set by a fast polynomial algorithm. Finally, we use task prioritization to avoid spending too much time in a few difficult matching problems instead of exploring other easier options. Thus we avoid wasting time on image pairs with low chance of success and avoid matching of highly redundant images of landmarks. We present results for several challenging sets of thousands of perspective as well as omnidirectional images.

show abstract

Beyond Novelty Detection: Incongruent Events, When General and Specific Classifiers Disagree

Weinshall

Zweig

Heřmanský

et al. 2012

IEEE Trans. Pattern Anal. Mach. Intell.

View full text Add to dashboard Cite

Unexpected stimuli are a challenge to any machine learning algorithm. Here, we identify distinct types of unexpected events when general-level and specific-level classifiers give conflicting predictions. We define a formal framework for the representation and processing of incongruent events: Starting from the notion of label hierarchy, we show how partial order on labels can be deduced from such hierarchies. For each event, we compute its probability in different ways, based on adjacent levels in the label hierarchy. An incongruent event is an event where the probability computed based on some more specific level is much smaller than the probability computed based on some more general level, leading to conflicting predictions. Algorithms are derived to detect incongruent events from different types of hierarchies, different applications, and a variety of data types. We present promising results for the detection of novel visual and audio objects, and new patterns of motion in video. We also discuss the detection of Out-Of- Vocabulary words in speech recognition, and the detection of incongruent events in a multimodal audiovisual scenario.

show abstract

Recent developments in large-scale tie-point matching

Hartmann

Havlena

Schindler

2016

ISPRS Journal of Photogrammetry and Remote Sensing

View full text Add to dashboard Cite

12 3 4 5

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Michal Havlena

Hyperpoints and Fine Vocabularies for Large-Scale Location Recognition

Large-Scale Location Recognition and the Geometric Burstiness Problem

Learned Multi-patch Similarity

Predicting Matchability

From Google Street View to 3D city models

Efficient Structure from Motion by Graph Optimization

Beyond Novelty Detection: Incongruent Events, When General and Specific Classifiers Disagree

Recent developments in large-scale tie-point matching

Contact Info

Product

Resources

About