2019
DOI: 10.48550/arxiv.1909.09278
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Forecasting Future Action Sequences with Neural Memory Networks

Abstract: We propose a novel neural memory network based framework for future action sequence forecasting. This is a challenging task where we have to consider short-term, within sequence relationships as well as relationships in between sequences, to understand how sequences of actions evolve over time. To capture these relationships effectively, we introduce neural memory networks to our modelling scheme. We show the significance of using two input streams, the observed frames and the corresponding action labels, whic… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
1
1
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(5 citation statements)
references
References 17 publications
0
5
0
Order By: Relevance
“…An extension of the initial work [100] used the temporal point process for the same task [105], where the intensity parameter λ is dynamically calculated from past observations as Though the Poisson process has been known for modelling time arrival events, not all researchers adopt it. An alternative research direction modeled time as a real-valued output directly from regression models, e.g., using an MLP to generate the output [1,2,30,61,101] as shown in Figure 3.8: For any nextstep action, the "length of future action" and "remaining length of current action" will be regressed from fully-connected layers. In comparison to previous Poisson methods [100,105], these approaches choose deterministic time models rather than the Poisson distribution and therefore their outputs for a given input will be fixed.…”
Section: Add_milk Crack_egg Add_buttermentioning
confidence: 99%
See 2 more Smart Citations
“…An extension of the initial work [100] used the temporal point process for the same task [105], where the intensity parameter λ is dynamically calculated from past observations as Though the Poisson process has been known for modelling time arrival events, not all researchers adopt it. An alternative research direction modeled time as a real-valued output directly from regression models, e.g., using an MLP to generate the output [1,2,30,61,101] as shown in Figure 3.8: For any nextstep action, the "length of future action" and "remaining length of current action" will be regressed from fully-connected layers. In comparison to previous Poisson methods [100,105], these approaches choose deterministic time models rather than the Poisson distribution and therefore their outputs for a given input will be fixed.…”
Section: Add_milk Crack_egg Add_buttermentioning
confidence: 99%
“…To obtain the whole scope of the future, results of next-step prediction are reused as input for the next-next. This procedure repeats until it reaches the end of sequence (e.g., [30,101,105,110]). Alternatively, other approaches generate the rest of the sequence in one shot (e.g., [1,183]).…”
Section: Add_milk Crack_egg Add_buttermentioning
confidence: 99%
See 1 more Smart Citation
“…Action recognition can be broadly grouped into two main categories: Discrete (Delaitre et al, 2010;Simonyan and Zisserman, 2014;Gammulle et al, 2017Gammulle et al, , 2019b and continuous fine-grained action recognition (Lea et al, 2017;Ni et al, 2014;Zhou et al, 2014Zhou et al, , 2015Gammulle et al, 2020Gammulle et al, , 2019a approaches. Discrete action recognition is performed on pre-segmented video sequences containing one action per video while continuous fine-grained models are evaluated on untrimmed video sequences containing more than one action per video.…”
Section: Related Workmentioning
confidence: 99%
“…Despite the widespread use of these tools in the behavioral neuroscience community, there is no existing research attempting to forecast future behaviors from behavioral video data. There is a large machine learning literature dedicated to time series forecasting in the context of human activity, from forecasting of future human poses [33, 34, 35] to prediction of specific activities [36, 37, 38, 39, 40, 41, 42, 43, 44, 45], as well as prediction of future semantic information in video [46]. Given that locomotion patterns are implicated in – and defining criteria of – neurobehavioral disorders [47, 48], it is useful to predict future behaviors from behavioral video, both in order to understand the antecedent patterns that yield adverse (or favorable) behaviors and to automatically intervene in the experiment in real-time.…”
Section: Introductionmentioning
confidence: 99%