2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017
DOI: 10.1109/cvpr.2017.291
|View full text |Cite
|
Sign up to set email alerts
|

Optical Flow Estimation Using a Spatial Pyramid Network

Abstract: We learn to compute optical flow by combining a classical spatial-pyramid formulation with deep learning. This estimates large motions in a coarse-to-fine approach by warping one image of a pair at each pyramid level by the current flow estimate and computing an update to the flow. Instead of the standard minimization of an objective function at each pyramid level, we train one deep network per level to compute the flow update. Unlike the recent FlowNet approach, the networks do not need to deal with large mot… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

3
783
0
1

Year Published

2017
2017
2024
2024

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 1,153 publications
(827 citation statements)
references
References 52 publications
3
783
0
1
Order By: Relevance
“…Results. We compare our approach with the state-of-theart methods [43,34,21]. Table 4 shows that our method achieves improved performance on both datasets.…”
Section: Optical Flow Estimationmentioning
confidence: 99%
“…Results. We compare our approach with the state-of-theart methods [43,34,21]. Table 4 shows that our method achieves improved performance on both datasets.…”
Section: Optical Flow Estimationmentioning
confidence: 99%
“…Analysis of the Results Figure 4: On the left, we compare our flow network against the FlowNet [IMS * 17] and SPyNet [RB17] by producing an HDR frame from the POKER FULLSHOT scene. Note that, we trained both the FlowNet and SPyNet networks in combination with our merge network (Sec.…”
Section: Hdr Resultsmentioning
confidence: 99%
“…For the flow network, we build upon the hierarchical coarse-to-fine architecture, concurrently proposed by Ranjan and Black[RB17] andWang et al [WZK * 17], and incorporate the three c 2019 The Author(s) Computer Graphics Forum c 2019 The Eurographics Association and John Wiley & Sons Ltd.…”
mentioning
confidence: 99%
“…By considering state of the art computer vision approaches [16], our model (average EPE for all the sequences, aEPE=0.71 pixel) performs better than some algorithms, e.g. FlowNetC (aEPE=0.93 pixel), but other algorithms outperform it, e.g.…”
Section: Resultsmentioning
confidence: 99%
“…Very few attempts have been made to incorporate these ideas into spatio-temporal filter based models, and given the recent growth in neuroscience, it is very interesting to revisit this model incorporating the new findings and examining the efficacy. Differently from FFV1MT and Spynet [16], which only rely on scale space for diffusion of non-local cues, our AMPD model provides a clue on the potential role played by the recurrent interactions in solving the blank wall problem by non local cue propagation. It is also worth noting that bilateral filtering based techniques are gaining popularity in semantic segmentation using convolutional neural networks.…”
Section: Resultsmentioning
confidence: 99%