Optical Flow Estimation Using a Spatial Pyramid Network

Ranjan, Anurag; Black, Michael J.

doi:10.1109/cvpr.2017.291

Cited by 1,153 publications

(827 citation statements)

References 52 publications

Supporting

Mentioning

783

Contrasting

Unclassified

Order By: Relevance

“…Results. We compare our approach with the state-of-theart methods [43,34,21]. Table 4 shows that our method achieves improved performance on both datasets.…”

Section: Optical Flow Estimationmentioning

confidence: 99%

CrDoCo: Pixel-Level Domain Transfer With Cross-Domain Consistency

Chen

Lin

Yang

et al. 2019

2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

307

199

View full text Add to dashboard Cite

Semantic seg. Depth prediction Optical flow Labeled examples (source domain) Input (target domain) Output Figure 1: Applications of the proposed method. Our method has the applications ranging from semantic segmentation (top row), depth prediction (middle row), to optical flow estimation (bottom row). AbstractUnsupervised domain adaptation algorithms aim to transfer the knowledge learned from one domain to another (e.g., synthetic to real images). The adapted representations often do not capture pixel-level domain shifts that are crucial for dense prediction tasks (e.g., semantic segmentation). In this paper, we present a novel pixel-wise adversarial domain adaptation algorithm. By leveraging image-toimage translation methods for data augmentation, our key insight is that while the translated images between domains may differ in styles, their predictions for the task should be consistent. We exploit this property and introduce a crossdomain consistency loss that enforces our adapted model to produce consistent predictions. Through extensive experimental results, we show that our method compares favorably against the state-of-the-art on a wide variety of unsupervised domain adaptation tasks.

show abstract

“…Results. We compare our approach with the state-of-theart methods [43,34,21]. Table 4 shows that our method achieves improved performance on both datasets.…”

Section: Optical Flow Estimationmentioning

confidence: 99%

CrDoCo: Pixel-Level Domain Transfer With Cross-Domain Consistency

Chen

Lin

Yang

et al. 2019

2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

307

199

View full text Add to dashboard Cite

show abstract

“…Analysis of the Results Figure 4: On the left, we compare our flow network against the FlowNet [IMS * 17] and SPyNet [RB17] by producing an HDR frame from the POKER FULLSHOT scene. Note that, we trained both the FlowNet and SPyNet networks in combination with our merge network (Sec.…”

Section: Hdr Resultsmentioning

confidence: 99%

“…For the flow network, we build upon the hierarchical coarse-to-fine architecture, concurrently proposed by Ranjan and Black[RB17] andWang et al [WZK * 17], and incorporate the three c 2019 The Author(s) Computer Graphics Forum c 2019 The Eurographics Association and John Wiley & Sons Ltd.…”

mentioning

confidence: 99%

Deep HDR Video from Sequences with Alternating Exposures

Kalantari¹,

Ramamoorthi

2019

Computer Graphics Forum

View full text Add to dashboard Cite

A practical way to generate a high dynamic range (HDR) video using off‐the‐shelf cameras is to capture a sequence with alternating exposures and reconstruct the missing content at each frame. Unfortunately, existing approaches are typically slow and are not able to handle challenging cases. In this paper, we propose a learning‐based approach to address this difficult problem. To do this, we use two sequential convolutional neural networks (CNN) to model the entire HDR video reconstruction process. In the first step, we align the neighboring frames to the current frame by estimating the flows between them using a network, which is specifically designed for this application. We then combine the aligned and current images using another CNN to produce the final HDR frame. We perform an end‐to‐end training by minimizing the error between the reconstructed and ground truth HDR images on a set of training scenes. We produce our training data synthetically from existing HDR video datasets and simulate the imperfections of standard digital cameras using a simple approach. Experimental results demonstrate that our approach produces high‐quality HDR videos and is an order of magnitude faster than the state‐of‐the‐art techniques for sequences with two and three alternating exposures.

show abstract

“…By considering state of the art computer vision approaches [16], our model (average EPE for all the sequences, aEPE=0.71 pixel) performs better than some algorithms, e.g. FlowNetC (aEPE=0.93 pixel), but other algorithms outperform it, e.g.…”

Section: Resultsmentioning

confidence: 99%

“…Very few attempts have been made to incorporate these ideas into spatio-temporal filter based models, and given the recent growth in neuroscience, it is very interesting to revisit this model incorporating the new findings and examining the efficacy. Differently from FFV1MT and Spynet [16], which only rely on scale space for diffusion of non-local cues, our AMPD model provides a clue on the potential role played by the recurrent interactions in solving the blank wall problem by non local cue propagation. It is also worth noting that bilateral filtering based techniques are gaining popularity in semantic segmentation using convolutional neural networks.…”

Section: Resultsmentioning

confidence: 99%

Adaptive Motion Pooling and Diffusion for Optical Flow Computation

Medathati

Chessa

Masson

et al. 2017

New Trends in Image Analysis and Processing – ICIAP 2017

View full text Add to dashboard Cite

Abstract. We propose to extend a state of the art bio-inspired model for optic flow computation through adaptive processing by focusing on the role of local context indicative of the local velocity estimates reliability. We set a network structure representative of cortical areas V1, V2 and MT, and incorporate three functional principles observed in primate visual system: contrast adaptation, adaptive afferent pooling and MT diffusion that are adaptive dependent upon the 2D image structure (Adaptive Motion Pooling and Diffusion, AMPD). We assess the AMPD performance on Middlebury optical flow estimation dataset, showing that the proposed AMPD model performs better than the baseline one and its overall performance is comparable with many computer vision methods.

show abstract

Optical Flow Estimation Using a Spatial Pyramid Network

Cited by 1,153 publications

References 52 publications

CrDoCo: Pixel-Level Domain Transfer With Cross-Domain Consistency

CrDoCo: Pixel-Level Domain Transfer With Cross-Domain Consistency

Deep HDR Video from Sequences with Alternating Exposures

Adaptive Motion Pooling and Diffusion for Optical Flow Computation

Contact Info

Product

Resources

About