Addressing Data Scarcity in Multimodal User State Recognition by Combining Semi-Supervised and Supervised Learning

Hendric, Voß,; Wersing, Heiko; Kopp, Stefan

doi:10.1145/3461615.3486575

Cited by 2 publications

(1 citation statement)

References 33 publications

(34 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Furthermore, AL techniques can encounter challenges with technically complex data, as designing an effective query strategy is non-trivial, and uninformative examples may be selected. Recently, Voß and colleagues [48] tackled multimodal dis-/agreement classification in human-robot interactions and YouTube videos using semi-supervised deep architectures. While their work demonstrates promising results, it still relies on a supervised branch for the final classification, necessitating a significant amount of labeled examples to generalize effectively.…”

Section: Related Workmentioning

confidence: 99%

Few Labels are Enough! Semi-supervised Graph Learning for Social Interaction

Corbellini,

Giraldo,

Varni

et al. 2023

2023 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW)

View full text Add to dashboard Cite

Endowing machines with social intelligence is a fundamental goal of artificial social intelligence. Dealing with human-centered phenomena requires, however, a considerable amount of manually annotated data, making data annotation a costly and challenging task that hinders the training of supervised learning algorithms. In this study, we apply an approach grounded on Graph Convolutional Network (GCN) to alleviate the annotation burden. As a test bed, we select emergent states analysis with specific reference to the team potency. At first, we build the PO-TENCY dataset by fusing three datasets on social interaction. Next, we compute a set of multimodal features characterizing the social behavior of the team members and the team as one. Finally, we feed the POTENCY dataset to a semi-supervised GCN, trained on a binary node classification task, with variable amounts of labels. We show that GCN can assign team potency labels to an unlabeled team in the dataset by using only a few labeled examples (i.e., 10% of data), with performances comparable to or higher than those of two baseline algorithms carrying out the same task in a fully supervised way.

show abstract

Section: Related Workmentioning

confidence: 99%