Multimodal extraction of events and of information about the recording activity in user generated videos

Cricri, Francesco; Dabov, Kostadin; Curcio, Igor D. D.; Mate, Sujeet; Gabbouj, Moncef

doi:10.1007/s11042-012-1085-1

Cited by 12 publications

(27 citation statements)

References 33 publications

(46 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This information is used to annotate the generated video stream with the motion, environmental information such as the position of the device 4 . Based on this data, the detection of user behavior that leads to degraded video quality, such as camera shaking or wrong orientation of the phone, can also be observed directly on the users' device without analyzing (or transmitting) the video data itself [5]. In previous work [10], we found that degradations such as camera shakes have a huge impact on the QoE.…”

Section: Prediction Of Quality Of Expe-riencementioning

confidence: 94%

“…Also, software-based sensors indicate the current video recording properties or energy level of the device. Here, patterns can be used to identify recording degradations based on sensor readingsas shown by CriCri [5].…”

Section: Composition and Orchestration Servicementioning

confidence: 99%

“…CriCri [5] shows a first step towards replacing video analysis with mechanisms that leverage different sensors in the recording devices to compose the video. We propose a monitoring and metric framework for the increasing need of mobile broadcasting systems that rely on video collection of multiple streams and composition of one superior video.…”

Section: Related Workmentioning

confidence: 99%

See 2 more Smart Citations

Monitoring of User Generated Video Broadcasting Services

Stohr

Wilk

Effelsberg

2014

Proceedings of the First International Workshop on Internet-Scale Multimedia Management

View full text Add to dashboard Cite

Mobile video broadcasting services offer users the opportunity to instantly share content from their mobile handhelds to a large audience over the Internet. However, existing data caps in cellular network contracts and limitations in their upload capabilities restrict the adoption of mobile video broadcasting services. Additionally, the quality of those video streams is often reduced by the lack of skills of recording users and the technical limitations of the video capturing devices. Our research focuses on large-scale events that attract dozens of users to record video in parallel. In many cases, available network infrastructure is not capable to upload all video streams in parallel. To make decisions on how to appropriately transmit those video streams, a suitable monitoring of the video generation process is required. For this scenario, a measurement framework is proposed that allows Internet-scale mobile broadcasting services to deliver samples in an optimized way. Our framework architecture analyzes three zones for effectively monitoring user-generated video. Besides classical Quality of Service metrics on the network state, video quality indicators and additional auxiliary sensor information is gathered. Aim of this framework is an efficient coordination of devices and their uploads based on the currently observed system state.

show abstract

Section: Prediction Of Quality Of Expe-riencementioning

confidence: 94%

Section: Composition and Orchestration Servicementioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Monitoring of User Generated Video Broadcasting Services

Stohr

Wilk

Effelsberg

2014

Proceedings of the First International Workshop on Internet-Scale Multimedia Management

View full text Add to dashboard Cite

show abstract

“…MoviMash is an advanced approach existing for live video composition. CriCri [3] shows a first step towards replacing video analysis with mechanisms, that leverage different sensors in the recording devices to compose the video. Without respecting visual features the resulting quality of such approaches degrades [4] but the speed increases.…”

Section: Background and Related Workmentioning

confidence: 99%

“…Common artifacts in compressing video streams such as blockiness are mitigated by the agreed video parameters (framerate, resolution and target bitrate). Thus, CrowdRecord investigates degradations commonly occurring during recording with retail smartphones, to (1) ensure the correct camera orientation, (2) identify and measure camera shaking, (3) identify the field of interest of a recording, (4) find under-/overexposed frame regions, and (5) measure frame sharpness. Streams that fail in one of these categories are not considered for composition.…”

Section: Crowdrecordmentioning

confidence: 99%

Video composition by the crowd

Wilk

Kopf

Effelsberg

2015

Proceedings of the 6th ACM Multimedia Systems Conference

View full text Add to dashboard Cite

To compose high-quality movies directors need life-long learning and talent. User-generated video defines a new era of video production in which non-professionals record videos and share them on platforms such as YouTube. As hiring professional directors results in high costs, our work focuses on replacing those directors by crowdsourcing. The proposed system allows users to record and stream live videos to servers on which workers create a video mashup. A smartphone application for recording live video has been designed that supports the composition in the crowd by a multi-modal analysis of the recording quality. The contributions of this work are: The proposed system demonstrates that composing a large number of video views can be achieved in near real-time. Second, the system achieves comparable video quality for user-generated video in comparison to manual composition. Third, it offers insights on how to design realtime capable crowdsourcing systems. Fourth, by leveraging multi-modal features that can already be evaluated during recording the number of streams considered for presentation can be reduced.

show abstract