2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020
DOI: 10.1109/cvpr42600.2020.01404
|View full text |Cite
|
Sign up to set email alerts
|

Improving Action Segmentation via Graph-Based Temporal Reasoning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
48
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
3
3

Relationship

0
9

Authors

Journals

citations
Cited by 97 publications
(51 citation statements)
references
References 55 publications
0
48
0
Order By: Relevance
“…Recently, models that use graph convolutional networks have shown promise. Examples include those proposed by Zeng et al (2019) [44] and Huang et al (2020) [244].…”
Section: ) Temporal Action Localization/detection Modelsmentioning
confidence: 99%
“…Recently, models that use graph convolutional networks have shown promise. Examples include those proposed by Zeng et al (2019) [44] and Huang et al (2020) [244].…”
Section: ) Temporal Action Localization/detection Modelsmentioning
confidence: 99%
“…In addition, there exist several approaches to improve the performance of action segmentation models such as MS-TCN [3,26,9,10]. Chen et al [3] proposed to apply selfsupervised domain adaptation techniques when training a model such as MS-TCN, and it exploits unlabeled videos to boost the performance of action segmentation.…”
Section: Related Workmentioning
confidence: 99%
“…Chen et al [3] proposed to apply selfsupervised domain adaptation techniques when training a model such as MS-TCN, and it exploits unlabeled videos to boost the performance of action segmentation. Wang et al [26] suggested a framework named boundary-aware cascade network (BCN), and Yifei et al [9] suggested a graphbased temporal reasoning module (GTRM). These [26,9] can be easily attached to various action segmentation models to improve performance.…”
Section: Related Workmentioning
confidence: 99%
“…NS-CL builds an object-based scene representation and translates sentences into symbolic programs, allowing question and answering about the elements of the scene. Another work used a graph-based network module to detect action in videos, by reasoning over the temporal relations present in each video [16].…”
Section: Neuro-symbolic Modelsmentioning
confidence: 99%