2019 IEEE/CVF International Conference on Computer Vision (ICCV) 2019
DOI: 10.1109/iccv.2019.00399
|View full text |Cite
|
Sign up to set email alerts
|

BMN: Boundary-Matching Network for Temporal Action Proposal Generation

Abstract: Temporal action proposal generation is an challenging and promising task which aims to locate temporal regions in real-world videos where action or event may occur. Current bottom-up proposal generation methods can generate proposals with precise boundary, but cannot efficiently generate adequately reliable confidence scores for retrieving proposals. To address these difficulties, we introduce the Boundary-Matching (BM) mechanism to evaluate confidence scores of densely distributed proposals, which denote a pr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

4
547
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
5
1
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 561 publications
(562 citation statements)
references
References 29 publications
4
547
0
Order By: Relevance
“…For clarification, +ODTC denotes first performing ordering-dependency regularization then executing taskconsistency method, while +TCOD is the other way round. (4) R-C3D [74], BSN [40] and BMN [39] with TC and OD. We further plugged our TC and OD methods into these action detection models to verify their generalization ability.…”
Section: Evaluation On Step Localizationmentioning
confidence: 99%
See 2 more Smart Citations
“…For clarification, +ODTC denotes first performing ordering-dependency regularization then executing taskconsistency method, while +TCOD is the other way round. (4) R-C3D [74], BSN [40] and BMN [39] with TC and OD. We further plugged our TC and OD methods into these action detection models to verify their generalization ability.…”
Section: Evaluation On Step Localizationmentioning
confidence: 99%
“…(2) Can the proposed task-consistency and orderingdependency methods be applied to other action detection models? Since our proposed TC and OD are two plug-and-play methods, we futher validate them on the R-C3D [74], BSN [40] and BMN [39] models. From Table 8 we can see that both TC and OD could improve the performance of various basic models, which further demonstrate the effectiveness of our proposed methods.…”
Section: Evaluation On Step Localizationmentioning
confidence: 99%
See 1 more Smart Citation
“…Instead, we employ an average pooling layer to turn thef into a series of consecutive basic momentsf base ∈ R L×d , where L ≪ T is the numbers of basic moments. With these low-resolution basic moments, we construct the 2D candidate map F 0 c ∈ R L×L×d as the candidate-level representation inspired from [20,48]. Specifically, we denote the (i, j) t h element of F 0 c as F 0 c i j ∈ R d .…”
Section: 32mentioning
confidence: 99%
“…CDC [16] predicts per-frame confidence scores using 3D convolutional neural networks. BSN [10] and BMN [9] adopt 2D convolutions to estimate actionness, starting time, and ending time at each frame. These methods can be applicable to informative channel identification by using their per-channel classification as a measure of channel importance.…”
Section: Related Workmentioning
confidence: 99%