2021
DOI: 10.48550/arxiv.2101.06053
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

The Multimodal Sentiment Analysis in Car Reviews (MuSe-CaR) Dataset: Collection, Insights and Improvements

Abstract: Truly real-life data presents a strong, but exciting challenge for sentiment and emotion research. The high variety of possible 'in-the-wild' properties makes large datasets such as these indispensable with respect to building robust machine learning models. A sufficient quantity of data covering a deep variety in the challenges of each modality to force the exploratory analysis of the interplay of all modalities has not yet been made available in this context. In this contribution, we present MuSe-CaR, a firs… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
3
1

Relationship

2
2

Authors

Journals

citations
Cited by 4 publications
(11 citation statements)
references
References 69 publications
(103 reference statements)
0
11
0
Order By: Relevance
“…. $15.00 https://doi.org/10.1145/3453892.3461009 fusion (LF) [3,9] approaches are the most prominent techniques. As discussed in [4] by N.Majumder et al, sequential (or hierarchical in their case) late fusion can filter out inter-modal correlation.…”
Section: Related Workmentioning
confidence: 99%
“…. $15.00 https://doi.org/10.1145/3453892.3461009 fusion (LF) [3,9] approaches are the most prominent techniques. As discussed in [4] by N.Majumder et al, sequential (or hierarchical in their case) late fusion can filter out inter-modal correlation.…”
Section: Related Workmentioning
confidence: 99%
“…The MuSe-CaR [30] is a multimodal data set gathered in-the-wild from English YouTube videos centred around car reviews. It was created with different computational tasks in mind, allowing researchers to improve the machine's understanding of how sentiment and topics are connected.…”
Section: Muse-carmentioning
confidence: 99%
“…These pipelines provide timestamps for each word (start and end point of an utterance) through which all words articulated within a segment can be assigned to it. Due to the in-the-wild factors, the error rate of the automatic transcriptions is estimated to be relatively high and specified at around 25 % on a subset of 10 hand-transcribed videos by the authors of [30].…”
Section: Muse-carmentioning
confidence: 99%
See 1 more Smart Citation
“…Furthermore, we utilize a richly annotated data set of ca. 600 h of continuous annotations (Stappen et al, 2021), and derive cross-task features from this initial correlation analysis. Second, we compare these engineered, lean features, to a computationally intensive feature selection approach and to all features when predicting selected engagement indicators (i.e., views, likes, number of comments, likes of the comments).…”
Section: Introductionmentioning
confidence: 99%