2014
DOI: 10.1117/12.2050413
|View full text |Cite
|
Sign up to set email alerts
|

Activity recognition using Video Event Segmentation with Text (VEST)

Abstract: Multi-Intelligence (multi-INT) data includes video, text, and signals that require analysis by operators. Analysis methods include information fusion approaches such as filtering, correlation, and association. In this paper, we discuss the Video Event Segmentation with Text (VEST) method, which provides event boundaries of an activity to compile related message and video clips for future interest. VEST infers meaningful activities by clustering multiple streams of time-sequenced multi-INT intelligence data and… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2014
2014
2019
2019

Publication Types

Select...
4
1

Relationship

3
2

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 15 publications
0
3
0
Order By: Relevance
“…Although POL/AD is comprehensive, the approach is unfortunately too expensive to be implemented on fog nodes which host edge units' data streams. Labeling partial video segments rather than bounding boxes in video frames, anomaly analysis segments the video where an action such as moving, stealing, or incident [17], [42]. Video range labeling using a refined Recurrent Neural Network (RNN) also translates to a more accurate rare instance detection along with outputs of the bounding box around the anomalous object [26].…”
Section: B Safety Modeling and Anomaly Detectionmentioning
confidence: 99%
“…Although POL/AD is comprehensive, the approach is unfortunately too expensive to be implemented on fog nodes which host edge units' data streams. Labeling partial video segments rather than bounding boxes in video frames, anomaly analysis segments the video where an action such as moving, stealing, or incident [17], [42]. Video range labeling using a refined Recurrent Neural Network (RNN) also translates to a more accurate rare instance detection along with outputs of the bounding box around the anomalous object [26].…”
Section: B Safety Modeling and Anomaly Detectionmentioning
confidence: 99%
“…Individual video clips are exploited for multiple mover tracking, player classification, and team relationships. The text information helps in the segmenting of important activities as surrounded by events boundaries (Holloway, et al, 2014). Note that the same stimulus could be used by a coach trying to instruct lessons learned to his/her players (offense and defense) as well as game highlights.…”
Section: Example 1: Quest Narrativementioning
confidence: 99%
“…The user is looking at video that includes target tracking, space-time correlation, and clustering. Graphical fusion aids in text-to-video association for event and activity based intelligence (ABI) detection [30]. ABI [31] can support a User-Defined Operating Picture (UDOP) but requires visualization of graphical information fusion results that link the text-and tracking-derived objects graphs.…”
Section: Introductionmentioning
confidence: 99%