2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021
DOI: 10.1109/cvpr46437.2021.00913
|View full text |Cite
|
Sign up to set email alerts
|

Gradient Forward-Propagation for Large-Scale Temporal Video Modelling

Abstract: How can neural networks be trained on large-volume temporal data efficiently? To compute the gradients required to update parameters, backpropagation blocks computations until the forward and backward passes are completed. For temporal signals, this introduces high latency and hinders real-time learning. It also creates a coupling between consecutive layers, which limits model parallelism and increases memory consumption. In this paper, we build upon Sideways, which avoids blocking by propagating approximate g… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 46 publications
0
1
0
Order By: Relevance
“…Sparse Network [14] is applied to image recognition models but can only save the memory theoretically. Sideways [35,36] reduce the memory cost by overwriting activations whenever new ones become available but can only be applied to causal models. Regarding video specific methods, a popular paradigm for training temporal action detectors is to build the model upon pre-extracted features for temporal modeling and reasoning ("Freeze Backbone" in Fig.…”
Section: Related Workmentioning
confidence: 99%
“…Sparse Network [14] is applied to image recognition models but can only save the memory theoretically. Sideways [35,36] reduce the memory cost by overwriting activations whenever new ones become available but can only be applied to causal models. Regarding video specific methods, a popular paradigm for training temporal action detectors is to build the model upon pre-extracted features for temporal modeling and reasoning ("Freeze Backbone" in Fig.…”
Section: Related Workmentioning
confidence: 99%