Bimodal HCI-related affect recognition

Zeng, Zhihong; Tu, Jilin; Liu, Ming; Zhang, Tong; Rizzolo, Nicholas; Zhang, Zhenqiu; Huang, Thomas S.; Roth, Dan; Levinson, Stephen E.

doi:10.1145/1027933.1027958

Cited by 51 publications

(42 citation statements)

References 12 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…They could improve the performance of decision-level fusion by considering the dominant modality, determined by empirical studies, in case significant discrepancy between the outputs of each unimodal classifier has been observed. Recently, a large-scale audio-visual dataset was collected by (Zeng et al, 2004), which contains five HCI-related affective responses (confusion, interest, boredom, and frustration) in addition to seven affects (the six basic emotions + neutral). To classify the 11 emotions subject-dependently, they used the SNoW (Sparse Network of Winnow) classifier with Naive Bayes as the update rule and achieved a recognition accuracy of almost 90% through bimodal fusion while the unimodal classifiers yielded only 45-56%.…”

Section: Automatic Emotion Recognition Using Speech and Biosignalsmentioning

confidence: 99%

Bimodal Emotion Recognition using Speech and Physiological Changes

Kim¹

2007

Robust Speech Recognition and Understanding

View full text Add to dashboard Cite

Section: Automatic Emotion Recognition Using Speech and Biosignalsmentioning

confidence: 99%

Bimodal Emotion Recognition using Speech and Physiological Changes

Kim¹

2007

Robust Speech Recognition and Understanding

View full text Add to dashboard Cite

“…The number of 30 features seems to be good point where the training and testing errors are about 0.14 and 0.16 on evaluation axis, and 0.11 and 0.13 on activation axis. These coarse-emotion-category-based performances are greatly above our previous fineemotion-category-based classification at the frame level [11]. …”

Section: Methodsmentioning

confidence: 57%

“…In addition, [11] considered four HCI-related cognitive states besides six basic emotions. Contrary to the fine categories of emotion representation, this paper explores the coarse categories for automatic affect recognition.…”

Section: Related Workmentioning

confidence: 99%

Audio-Visual Affect Recognition in Activation-Evaluation Space

Zeng

Zhang

Pianfetti

et al.

2005 IEEE International Conference on Multimedia and Expo

View full text Add to dashboard Cite

The ability of a computer to detect and appropriately respond to changes in a user's affective state has significant implications to Human-Computer Interaction (HCI). To more accurately simulate the human ability to assess affects through multi-sensory data, automatic affect recognition should also make use of multimodal data. In this paper, we present our efforts toward audio-visual affect recognition. Based on psychological research, we have chosen affect categories based on an activationevaluation space which is robust in capturing significant aspects of emotion. We apply the Fisher boosting learning algorithm which can build a strong classifier by combining a small set of weak classification functions. Our experimental results show with 30 Fisher features, the testing error rates of our bimodal affect recognition is about 16% on the evaluation axis and 13% on the activation axis.

show abstract

“…This variant is manifested in very different application areas including among others data input (audio-visual speech recognition), person identification [39], emotion recognition [44].…”

Section: Known Usesmentioning

confidence: 99%

Steps in Identifying Interaction Design Patterns for Multimodal Systems

Ratzka

Engineering Interactive Systems 2008

View full text Add to dashboard Cite

Abstract. The context of this work is usability engineering for multimodal interaction. In contrast to other work that concentrates on prototyping toolkits or abstract guidelines, this research focuses on user interface patterns for multimodal interaction. Designing multimodal applications requires several skills ranging from design and implementation. Thus, different kinds of patterns (from architecture patterns to user interface patterns) can be applied to this field. This work focuses on user-task near user interface patterns. At first, a traditional approach of modality selection based on task-and context-based rules is presented. Next, a twofold process of pattern mining is presented. In the first phase, pattern candidates are derived top-down from proven knowledge about how multimodality enhances usability. In the second phase, literature is mined for real solutions to underpin these pattern candidates and find new ones. Along with this, relationships between patterns are depicted.

show abstract

Bimodal HCI-related affect recognition

Cited by 51 publications

References 12 publications

Bimodal Emotion Recognition using Speech and Physiological Changes

Bimodal Emotion Recognition using Speech and Physiological Changes

Audio-Visual Affect Recognition in Activation-Evaluation Space

Steps in Identifying Interaction Design Patterns for Multimodal Systems

Contact Info

Product

Resources

About