“…Although many have used HoG/HoF descriptors [18,22,17,4], they aggregate them into a static signature, whereas our previous analysis and [36] suggest retaining their temporal evolution. However, rather than averaging by spatial binning (that presumes ergodicity), we prefer to use at least a crude approximation of the prior dP (g, w) in the form of samples {g(t j )}, {w(x, t j )} inferred during the training phase.…”