Signal Processing: Image Communication

2014

DOI: 10.1016/j.image.2014.03.004

|View full text |Cite

|

Sign up to set email alerts

|

Stereo object tracking with fusion of texture, color and disparity information

¹

,

Nikos Nikolaidis

²

,

Anastasios Tefas

³

et al.

Abstract: A novel method for visual object tracking in stereo videos is proposed, which fuses an appearance based representation of the object based on Local Steering Kernel features and 2D color-disparity histogram information. The algorithm employs Kalman filtering for object position prediction and a sampling technique for selecting the candidate object regions of interest in the left and right channels. Disparity information is exploited, for matching corresponding regions in the left and right video frames. As trac… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Multimodal Shot Pruning (Msp)2

Introduction2

Related Work a Video Summarization1

Citation Types

Supporting

0

Mentioning

16

Contrasting

0

Unclassified

1

Year Published

2015

2015

2018

2018

Publication Types

Select...

Article5

Book2

Other1

Relationship

Self Cite3

Independent5

Authors

Journals

Cited by 18 publications

(17 citation statements)

References 56 publications

Supporting

0

Mentioning

16

Contrasting

0

Unclassified

1

Order By: Relevance

“…The video stream is first segmented into shots. Then, face detection and tracking [28] [29] [30] is performed on the segmented video clips. Clustering is applied on the extracted facial images, in order to determine which images belong to the same character.…”

Section: Related Work a Video Summarizationmentioning

confidence: 99%

“…As in the case of speakers, each face appearance consists simply of a video segment that starts and ends at the temporal boundaries of an uninterrupted face appearance. Such data may have been acquired through the successive application of face detection [52], face tracking [53], face clustering [54] and label propagation [55] algorithms.…”

Section: Multimodal Shot Pruning (Msp)mentioning

confidence: 99%

See 1 more Smart Citation

Multimodal Stereoscopic Movie Summarization Conforming to Narrative Characteristics

¹

,

²

,

³

et al. 2016

IEEE Trans. on Image Process.

Self Cite

View full text Add to dashboard Cite

No abstract

“…The video stream is first segmented into shots. Then, face detection and tracking [28] [29] [30] is performed on the segmented video clips. Clustering is applied on the extracted facial images, in order to determine which images belong to the same character.…”

Section: Related Work a Video Summarizationmentioning

confidence: 99%

“…As in the case of speakers, each face appearance consists simply of a video segment that starts and ends at the temporal boundaries of an uninterrupted face appearance. Such data may have been acquired through the successive application of face detection [52], face tracking [53], face clustering [54] and label propagation [55] algorithms.…”

Section: Multimodal Shot Pruning (Msp)mentioning

confidence: 99%

Multimodal Stereoscopic Movie Summarization Conforming to Narrative Characteristics

¹

,

²

,

³

et al. 2016

IEEE Trans. on Image Process.

Self Cite

View full text Add to dashboard Cite

No abstract

“…As in the case of speakers, each face appearance consists simply of a video segment that starts and ends at the temporal boundaries of an uninterrupted face appearance. Such data may have been acquired through the successive application of face detection [12], face tracking [13], face clustering [14] and label propagation [15] algorithms. Despite these algorithmic prerequisites, no extra data modalities (such as the movie script) are required, beyond the film itself.…”

Section: Multimodal Shot Pruning (Msp)mentioning

confidence: 99%

Movie shot selection preserving narrative properties

¹

,

²

,

³

et al. 2016

2016 IEEE 18th International Workshop on Multimedia Signal Processing (MMSP)

Self Cite

View full text Add to dashboard Cite

No abstract

“…The intuition behind the modalities fusion is that one can perform a similar to speaker diarization analysis upon the visual data: face clustering. In more detail, assume that faces are detected in the frames of a movie and then the detected faces are tracked over time, resulting in a number of video facial trajectories [13], [2], [5]. A representative face is selected to represent a facial trajectory.…”

Section: Introductionmentioning

confidence: 99%

Multimodal Speaker Diarization Utilizing Face Clustering Information

¹

,

²

,

³

et al. 2015

Lecture Notes in Computer Science

Self Cite

View full text Add to dashboard Cite

Abstract. Multimodal clustering/diarization tries to answer the question "who spoke when" by using audio and visual information. Diarization consists of two steps, at first segmentation of the audio information and detection of the speech segments and then clustering of the speech segments to group the speakers. This task has been mainly studied on audiovisual data from meetings, news broadcasts or talk shows. In this paper, we use visual information to aid speaker clustering. We tested the proposed method in three full length movies, i.e. a scenario much more difficult than the ones used so far, where there is no certainty that speech segments and video appearances of actors will always overlap. The results proved that the visual information can improve the speaker clustering accuracy and hence the diarization process.

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Product

Browser Extension Assistant by scite Citation Statement Search Reference Check Visualizations Dashboards Explore Journals Explore Organizations Explore Funders Embedding Badge Embedding Citation Search Pricing

Resources

Blog Help & FAQ Accessibility Statement API Terms For Universities & Governments For Researchers For Publishers For Corporate, Pharma & Enterprise Author Marketing Become an Affiliate Get an organization trial or quote scite Data & Services

About

News & Press Careers Read our Paper Coverage

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Copyright © 2024 scite LLC. All rights reserved.

Made with 💙 for researchers

Part of the Research Solutions Family.