MUMBAI: multi-person, multimodal board game affect and interaction analysis dataset

Doyran, Metehan; Baki, Pinar; Ergin, Kübra; Türkmen, Batıkan; Salah, Alkım Almila Akdağ; Bakkes, Sander; Kaya, Heysem; Poppe, Ronald; Salah, Albert Ali

doi:10.1007/s12193-021-00364-0

Cited by 22 publications

(16 citation statements)

References 68 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Usually, third-view datasets consist of structured interactions where participants need to follow basic directives which favor spontaneous and fluent interactions. Despite the fact that conversations are the most common interaction structure, there are datasets which aim at fostering specific social signals like leadership, competitiveness, empathy, or affect, and therefore engage the participants in competitive/cooperative scenarios (Hung and Chittaranjan, 2010;Sanchez-Cortes et al, 2012;Rehg et al, 2013;Ringeval et al, 2013;Vella and Paggio, 2013;Bambach et al, 2015;Salter et al, 2015;Edwards et al, 2016;Beyan et al, 2016;Georgakis et al, 2017;Doyran et al, 2021;. Other datasets, instead, record in-the-wild interactions during the so-called cocktail parties (Alameda-Pineda et al, 2016;Cabrera-Quiros et al, 2018) and represent very interesting benchmarks to study group dynamics.…”

Section: Datasetsmentioning

confidence: 99%

“…The latter is considerably less frequent due to its tedious manual annotation process (McCowan et al, 2005;Douglas-Cowie et al, 2007;McKeown et al, 2010;Lücking et al, 2012;Vella and Paggio, 2013;Vandeventer et al, 2015;Naim et al, 2015;Chou et al, 2017;Paggio and Navarretta, 2017;Cafaro et al, 2017;Joo et al, 2019b;Kossaifi et al, 2019;Chen et al, 2020;Khan et al, 2020;. The most frequent low-level annotations that the datasets provide are the participants' body poses and facial expressions (Douglas-Cowie et al, 2007;Rehg et al, 2013;Bilakhia et al, 2015;Vandeventer et al, 2015;Naim et al, 2015;Edwards et al, 2016;Cafaro et al, 2017;Feng et al, 2017;Georgakis et al, 2017;Paggio and Navarretta, 2017;Bozkurt et al, 2017;Andriluka et al, 2018;von Marcard et al, 2018;Mehta et al, 2018;Lemaignan et al, 2018;Joo et al, 2019b;Kossaifi et al, 2019;Schiphorst et al, 2020;Doyran et al, 2021;. Given their annotation complexity, they are usually automatically retrieved with tools like OpenPose (Cao et al, 2019), and manually fixed or discarded.…”

Section: Datasetsmentioning

confidence: 99%

“…Indeed, some of the datasets have been complementary annotated and added in posterior studies. As a result, most common high-level labels consist of elicited emotions (McCowan et al, 2005;Douglas-Cowie et al, 2007;van Son et al, 2008;McKeown et al, 2010;Naim et al, 2015;Vandeventer et al, 2015;Chou et al, 2017;Paggio and Navarretta, 2017;Maman et al, 2020;Doyran et al, 2021), action labels (Soomro et al, 2012;Yonetani et al, 2016;Silva et al, 2018;Abebe et al, 2018;Carreira et al, 2019;Zhao et al, 2019;Schiphorst et al, 2020;Monfort et al, 2020;Martín-Martín et al, 2021), and social cues/signals (Hung and Chittaranjan, 2010;Sanchez-Cortes et al, 2012;Ringeval et al, 2013;Vandeventer et al, 2015;Shukla et al, 2016;Bozkurt et al, 2017;Cafaro et al, 2017;Feng et al, 2017;Lemaignan et al, 2018;Cabrera-Quiros et al, 2018;Celiktutan et al, 2019;Chen et al, 2020;Maman et al, 2020).…”

Section: Datasetsmentioning

confidence: 99%

See 2 more Smart Citations

Didn't see that coming: a survey on non-verbal social human behavior forecasting

Barquero¹,

Núñez²,

Escalera³

et al. 2022

Preprint

View full text Add to dashboard Cite

Non-verbal social human behavior forecasting has increasingly attracted the interest of the research community in recent years. Its direct applications to human-robot interaction and socially-aware human motion generation make it a very attractive field. In this survey, we define the behavior forecasting problem for multiple interactive agents in a generic way that aims at unifying the fields of social signals prediction and human motion forecasting, traditionally separated. We hold that both problem formulations refer to the same conceptual problem, and identify many shared fundamental challenges: future stochasticity, context awareness, history exploitation, etc. We also propose a taxonomy that comprises methods published in the last 5 years in a very informative way and describes the current main concerns of the community with regard to this problem. In order to promote further research on this field, we also provide a summarized and friendly overview of audiovisual datasets featuring non-acted social interactions. Finally, we describe the most common metrics used in this task and their particular issues.

show abstract

Section: Datasetsmentioning

confidence: 99%

Section: Datasetsmentioning

confidence: 99%

Section: Datasetsmentioning

confidence: 99%

See 1 more Smart Citation

Didn't see that coming: a survey on non-verbal social human behavior forecasting

Barquero¹,

Núñez²,

Escalera³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…Facial expression analysis and video games have been combined in multiple studies discussing various topics, such as affective gaming [22,27], game personalisation [7], player affect evaluation [24,28] and alternative gameplay mechanisms [25]. Recently, Doyran et al [9] published a rich dataset that enables multi-modal, multi-player affect and interaction analysis through capturing the facial expressions of board game players.…”

Section: Related Workmentioning

confidence: 99%

Correlating Facial Expressions and Subjective Player Experiences in Competitive Hearthstone

Blom

Kosa

Bakkes

et al. 2021

The 16th International Conference on the Foundations of Digital Games (FDG) 2021

View full text Add to dashboard Cite

General rightsCopyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights.• Users may download and print one copy of any publication from the public portal for the purpose of private study or research. • You may not further distribute the material or use it for any profit-making activity or commercial gain • You may freely distribute the URL identifying the publication in the public portal Take down policyIf you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately and investigate your claim.

show abstract

“…Music works contain rich human emotions, and emotions play an indispensable role in the transmission of musical emotions and understanding and appreciation of music [1][2][3]. With the current development of Internet technology and artificial intelligence, the amount of digital music is growing rapidly.…”

Section: Introductionmentioning

confidence: 99%

Multimodal Music Emotion Recognition Method Based on the Combination of Knowledge Distillation and Transfer Learning

Tong

2022

Scientific Programming

View full text Add to dashboard Cite

The main difficulty of music emotion recognition is the lack of sufficient labeled data. Only the labeled data with unbalanced categories are used to train the emotion recognition model. Not only is accurate labeling of emotion categories costly and time-consuming, but it also requires extensive musical background for labelers At the same time, the emotion of music is often affected by many factors. Singing methods, music styles, arrangement methods, lyrics, and other factors will affect the expression of music emotions. This paper proposes a multimodal method based on the combination of knowledge distillation and music style transfer learning and verifies the effectiveness of the method on 20,000 songs. Experiments show that compared with traditional methods, such as single audio, single lyric, and single audio with multimodal lyric methods, the method proposed in this paper has significantly improved the accuracy of emotion recognition, and the generalization ability has been significantly improved.

show abstract

MUMBAI: multi-person, multimodal board game affect and interaction analysis dataset

Cited by 22 publications

References 68 publications

Didn't see that coming: a survey on non-verbal social human behavior forecasting

Didn't see that coming: a survey on non-verbal social human behavior forecasting

Correlating Facial Expressions and Subjective Player Experiences in Competitive Hearthstone

Multimodal Music Emotion Recognition Method Based on the Combination of Knowledge Distillation and Transfer Learning

Contact Info

Product

Resources

About