2020
DOI: 10.1177/1754073919898526
|View full text |Cite
|
Sign up to set email alerts
|

A Review on Five Recent and Near-Future Developments in Computational Processing of Emotion in the Human Voice

Abstract: We provide a short review on the recent and near-future developments of computational processing of emotion in the voice, highlighting (a) self-learning of representations moving continuously away from traditional expert-crafted or brute-forced feature representations to end-to-end learning, (b) a movement towards the coupling of analysis and synthesis of emotional voices to foster better mutual understanding, (c) weakly supervised learning at a large scale, (d) transfer learning from related domains such as s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
24
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 46 publications
(25 citation statements)
references
References 44 publications
0
24
0
Order By: Relevance
“…To effectively establish an emotional identification and expression system, emotional identification and synthesis based on DL have considerable potential in human–machine interactions (Schuller and Schuller, 2021 ). Recognizing emotions through the automatic extraction of acoustic features and generating expressions through emotions are the main strategies for relevant research development.…”
Section: Literature Reviewmentioning
confidence: 99%
See 1 more Smart Citation
“…To effectively establish an emotional identification and expression system, emotional identification and synthesis based on DL have considerable potential in human–machine interactions (Schuller and Schuller, 2021 ). Recognizing emotions through the automatic extraction of acoustic features and generating expressions through emotions are the main strategies for relevant research development.…”
Section: Literature Reviewmentioning
confidence: 99%
“…It has been proven that a generative adversarial network (GAN) can improve the machine's performance in emotional analysis tasks (Han et al, 2019 ). Additionally, people begin to think about transfer learning applications in relevant tasks and voice emotional computing modes (Schuller and Schuller, 2021 ).…”
Section: Literature Reviewmentioning
confidence: 99%
“…Expressions can be produced spontaneously in ecologically valid scenarios (Tracy & Matsumoto, 2008), with poses in controlled laboratory settings (Cordaro et al, 2018), or elicited with stimuli that are meant to induce specific emotions (Levenson et al, 1991). The expressive cues, such as facial muscle movements or vocalizations, are then empirically mapped using established coding schemes (Dael et al, 2012a(Dael et al, , 2012bEkman et al, 2002) and automatic detection software using algorithms (Krumhuber et al, 2019;Schuller & Schuller, 2020). A broad range of positive emotions have been found to have specific nonverbal cues (for a review, see Sauter, 2017).…”
Section: Nonverbal Expressions Of Positive Emotionsmentioning
confidence: 99%
“…Expressions can be produced spontaneously in ecologically valid scenarios (Tracy & Matsumoto, 2008), with poses in controlled laboratory settings (Cordaro et al, 2018), or elicited with stimuli that are meant to induce specific emotions (Levenson et al, 1991). The expressive cues, such as facial muscle movements or vocalizations, are then empirically mapped using established coding schemes (Ekman et al, 2002; and automatic detection software using algorithms (Schuller & Schuller, 2020;Krumhuber et al, 2019). A broad range of positive emotions have been found to have specific nonverbal cues (for a review, see Sauter, 2017).…”
Section: Nonverbal Expressions Of Positive Emotionsmentioning
confidence: 99%