“…However, thus far, the best approach for action recognition is arguably the DT approach , which is based on descriptions of the trajectories of tracked feature points, which are densely sampled. When obtaining these trajectories, the following spatiotemporal features are used: the trajectory histograms of oriented gradients ( Following the introduction of the original DT, dense sampling approaches for action recognition were also proposed in [15,5,14,7,18]. These studies improved the DT in various ways, for example, by introducing mid-level trajectory clustering ( Recent approaches have assigned human-object interactions to the IDT framework (Zhou et al, 2014(Zhou et al, , 2015.…”