2017
DOI: 10.1007/s10851-017-0766-9
|View full text |Cite
|
Sign up to set email alerts
|

Spatio-Temporal Scale Selection in Video Data

Abstract: This work presents a theory and methodology for simultaneous detection of local spatial and temporal scales in video data. The underlying idea is that if we process video data by spatio-temporal receptive fields at multiple spatial and temporal scales, we would like to generate hypotheses about the spatial extent and the temporal duration of the underlying spatio-temporal image structures that gave rise to the feature responses. For two types of spatio-temporal scale-space representations, (i) a non-causal Gau… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
6
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
5
2
1

Relationship

5
3

Authors

Journals

citations
Cited by 14 publications
(6 citation statements)
references
References 120 publications
(157 reference statements)
0
6
0
Order By: Relevance
“…When computing receptive field responses over multiple spatial and temporal scales, there is an issue about how the receptive field responses should be normalized so as to enable appropriate comparisons between receptive field responses at different scales. Issues of scale normalisation of the derivative based receptive fields defined from scalespace operations are treated in [36,58,59] regarding spatial receptive fields and in [21,38,55] regarding spatio-temporal receptive fields. a) Scale-normalized spatial receptive fields.…”
Section: Scale Normalisation Of Spatial and Spatio-temporal Receptive Fieldsmentioning
confidence: 99%
See 1 more Smart Citation
“…When computing receptive field responses over multiple spatial and temporal scales, there is an issue about how the receptive field responses should be normalized so as to enable appropriate comparisons between receptive field responses at different scales. Issues of scale normalisation of the derivative based receptive fields defined from scalespace operations are treated in [36,58,59] regarding spatial receptive fields and in [21,38,55] regarding spatio-temporal receptive fields. a) Scale-normalized spatial receptive fields.…”
Section: Scale Normalisation Of Spatial and Spatio-temporal Receptive Fieldsmentioning
confidence: 99%
“…Several of these figures are similar to figures in [19] , [20] and in [21] . These results have, however, also been cleaned by replacing the previous time-causal models in [19] , [20] with the much better time-causal theory in [21] , and updating the distribution parameter c from the previous default value to the value 2 found more suitable for computer vision algorithms that operate on these time-causal spatio-temporal receptive fields with real-time requirements of shorter temporal delays [38] .…”
Section: Introductionmentioning
confidence: 99%
“…of the scale-normalized spatio-temporal quasi quadrature entity (73) f (x, y, t) = g(x, y; s 0 ) g(t; τ 0 ) = 1 (2π) 3/2 s 0 √ τ 0 e −(x 2 +y 2 )/2s0 e −t 2 /2τ0 , for which the spatio-temporal scale-space representation is of the form (113) L(x, y, t; s 0 , τ 0 ) = g(x, y; s 0 + s) g(t; τ 0 + τ ), the spatial and temporal scale estimates will according to the theoretical analysis in (Lindeberg [44,45]) be given by ŝ = γ s s 0 2 − γ s s 0 , If we want to calibrate the spatial and temporal scale estimates such that the spatial and temporal scale estimates are equal to ŝ = s 0 and τ = τ 0 for a Gaussian blink of spatial extent s 0 and temporal duration τ 0 , we should therefore calibrate the phase compensated scale estimates ŝQ (x,y,t)L,comp and τQ (x,y,t)L,comp according to ŝQ the spatial scale estimate for a diffuse Gaussian edge will by combination of (106) with (120) be given by (123) ŝ = 2 + Γ s Γ s s 0 , whereas the temporal scale estimate for a Gaussian onset ramp will by combination of (107) with (121) be given by (124) τ = 1 + Γ τ Γ τ τ 0 .…”
Section: Dependency Of the Ratio Between The Scale Calibration Factorsmentioning
confidence: 99%
“…Specifically, such scale estimates transform in a scale-covariant way under spatial scaling transformations of the image domain, which is a highly desirable property of a scale selection mechanism, since it implies that the scale estimates will automatically follow local scale variations in the image data. Corresponding local scale selection mechanisms can also be expressed over temporal and spatio-temporal domains (Lindeberg [46,44,45]).…”
mentioning
confidence: 99%
“…On the other hand, Geert Willems, Tinne Tuytelaars and Luc van Gool proposed "An efficient dense and scale-invariant spatiotemporal-temporal interest point detector" [12]. Meantime, Tony Lindeberg proposed "Spatio-temporal scale selection in video data" [13]. Also, I. Everts, J. van Gemert and T. Gevers proposed "Evaluation of color spatiotemporal interest points for human action recognition" [14].…”
Section: Introductionmentioning
confidence: 99%