2018
DOI: 10.3390/s18113744
|View full text |Cite
|
Sign up to set email alerts
|

Adaptive Data Boosting Technique for Robust Personalized Speech Emotion in Emotionally-Imbalanced Small-Sample Environments

Abstract: Personalized emotion recognition provides an individual training model for each target user in order to mitigate the accuracy problem when using general training models collected from multiple users. Existing personalized speech emotion recognition research has a cold-start problem that requires a large amount of emotionally-balanced data samples from the target user when creating the personalized training model. Such research is difficult to apply in real environments due to the difficulty of collecting numer… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
10
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 20 publications
(10 citation statements)
references
References 39 publications
0
10
0
Order By: Relevance
“…Furthermore, collecting SER speech samples and tagging them with emotion labels is time-consuming and expensive. Thus, to overcome the limitations of volume and diversity of labeled speech samples for deep-learning SER models, studies have been performed using data augmentation [ 11 , 12 , 40 , 41 , 42 ], active learning [ 12 , 43 ] based on collected datasets, and domain adaptation [ 8 , 9 , 10 , 13 , 14 , 15 , 16 ] to adapt the existing SER datasets to the target domains.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…Furthermore, collecting SER speech samples and tagging them with emotion labels is time-consuming and expensive. Thus, to overcome the limitations of volume and diversity of labeled speech samples for deep-learning SER models, studies have been performed using data augmentation [ 11 , 12 , 40 , 41 , 42 ], active learning [ 12 , 43 ] based on collected datasets, and domain adaptation [ 8 , 9 , 10 , 13 , 14 , 15 , 16 ] to adapt the existing SER datasets to the target domains.…”
Section: Related Workmentioning
confidence: 99%
“…Active-learning methods have been used to present greedy selection methods of speech samples to construct an initial SER model suitable for a target speaker based on limited samples [ 12 , 43 ]. Abdelwahab et al [ 43 ] proposed the active learning of greedy sampling to select the most informative samples to improve the performance of DNN-based SER models.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Thus arose the need to develop solutions around smart home concepts using hardware and software capable of capturing residents' behavior and understanding their activities, informing them of risk situations, or taking action for their satisfaction [9]. Event recognition and emotion recognition are also part of this technology concept [10]. The smart home is considered as a technology that can help reduce the cost of living and care for the elderly and disabled population, and improve their quality of life.…”
Section: Introductionmentioning
confidence: 99%