Dynamic Bayesian networks for meeting structuring

Dielmann, Alfred; Renals, Steve

doi:10.1109/icassp.2004.1327189

Cited by 48 publications

(45 citation statements)

References 7 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Previously, we have outlined a meeting action recognition framework based on acoustic and lexical related features and a layered multistream dynamic Bayesian network model [19], [21]. This model combines the advantages of independent feature-stream processing together with a structured approach.…”

Section: B Group Action Recognitionmentioning

confidence: 99%

“…The vector consists of all possible products of the six sound activity locations during a time window of three frames [19] where each vector highlights the turn taking interaction pattern around the time . Considering, for simplicity, a smaller turn taking matrix evaluated only on two frames the diagonal elements highlight whether a speaker active at time , is still speaking at time .…”

Section: B Speaker Turn Featuresmentioning

confidence: 99%

See 1 more Smart Citation

Automatic Meeting Segmentation Using Dynamic Bayesian Networks

Dielmann

Renals

2007

IEEE Trans. Multimedia

View full text Add to dashboard Cite

Abstract-Multiparty meetings are a ubiquitous feature of organizations, and there are considerable economic benefits that would arise from their automatic analysis and structuring. In this paper, we are concerned with the segmentation and structuring of meetings (recorded using multiple cameras and microphones) into sequences of group meeting actions such as monologue, discussion and presentation. We outline four families of multimodal features based on speaker turns, lexical transcription, prosody, and visual motion that are extracted from the raw audio and video recordings. We relate these low-level features to more complex group behaviors using a multistream modelling framework based on multistream dynamic Bayesian networks (DBNs). This results in an effective approach to the segmentation problem, resulting in an action error rate of 12.2%, compared with 43% using an approach based on hidden Markov models. Moreover, the multistream DBN developed here leaves scope for many further improvements and extensions.

show abstract

Section: B Group Action Recognitionmentioning

confidence: 99%

Section: B Speaker Turn Featuresmentioning

confidence: 99%

Automatic Meeting Segmentation Using Dynamic Bayesian Networks

Dielmann

Renals

2007

IEEE Trans. Multimedia

View full text Add to dashboard Cite

show abstract

“…Dielmann et al [15] proposed two approaches for meeting structuring from audio-only features using multilevel Dynamic Bayesian Networks (DBNs). The first DBN decomposed the group activities as sequences of sub-actions with no explicit meaning.…”

Section: Turn-taking Patternsmentioning

confidence: 99%

Analyzing Group Interactions in Conversations: a Review

Gática-Pérez

2006

2006 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems

View full text Add to dashboard Cite

Abstract-Multiparty face-to-face conversations in professional and social settings represent an emerging research domain for which automatic activity-based analysis is relevant for scientific and practical reasons. The activity patterns emerging from groups engaged in conversations are intrinsically multimodal and thus constitute interesting target problems for multistream and multisensor fusion techniques. In this paper, a summarized review of the literature on automatic analysis of group activities in face-to-face conversational settings is presented. A basic categorization of group activities is proposed based on their typical temporal scale, and existing works are then discussed for various types of activities and trends including addressing, turn taking, interest, and dominance.

show abstract

“…Hierarchical Hidden Markov Models (HHMMs) and layered hidden Markov models have been used to model various phenomena that exhibit stochastic structures at several different levels in areas such as speech and text recognition, modeling of group actions in meetings and extracting context from video, [12], [15]- [18]. Zhang et al used a two-layer HMM to model individual and group actions during meetings in [16].…”

Section: Theoretical Background and Related Workmentioning

confidence: 99%

Layered HMM for Motion Intention Recognition

Aarno

Kragić

2006

2006 IEEE/RSJ International Conference on Intelligent Robots and Systems

View full text Add to dashboard Cite

Abstract-Acquiring, representing and modeling human skills is one of the key research areas in teleoperation, programmingby-demonstration and human-machine collaborative settings. One of the common approaches is to divide the task that the operator is executing into several subtasks in order to provide manageable modeling.In this paper we consider the use of a Layered Hidden Markov Model (LHMM) to model human skills. We evaluate a gestem classifier that classifies motions into basic action-primitives, or gestems. The gestem classifiers are then used in a LHMM to model a simulated teleoperated task. We investigate the online and offline classification performance with respect to noise, number of gestems, type of HMM and the available number of training sequences. We also apply the LHMM to data recorded during the execution of a trajectory-tracking task in 2D and 3D with a robotic manipulator in order to give qualitative as well as quantitative results for the proposed approach. The results indicate that the LHMM is suitable for modeling teleoperative trajectory-tracking tasks and that the difference in classification performance between one and multi dimensional HMMs for gestem classification is small. It can also be seen that the LHMM is robust w.r.t misclassifications in the underlying gestem classifiers.

show abstract

Dynamic Bayesian networks for meeting structuring

Cited by 48 publications

References 7 publications

Automatic Meeting Segmentation Using Dynamic Bayesian Networks

Automatic Meeting Segmentation Using Dynamic Bayesian Networks

Analyzing Group Interactions in Conversations: a Review

Layered HMM for Motion Intention Recognition

Contact Info

Product

Resources

About