2021
DOI: 10.1002/int.22505
|View full text |Cite
|
Sign up to set email alerts
|

Optimal feature selection based speech emotion recognition using two‐stream deep convolutional neural network

Abstract: Speech signal processing is an active area of research, the most dominant source of exchanging information among human beings, and the best way for human–computer interaction (HCI). Human behavior assessments and emotion recognition from a speech signal, such as speech emotion recognition (SER) is an emerging HCI area of exploration with various real time claims. The performance of an efficient SER system depends on feature learning, which include salient and discriminative information such as high‐level deep … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
26
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 75 publications
(26 citation statements)
references
References 68 publications
(79 reference statements)
0
26
0
Order By: Relevance
“…Codeword HASH CRC , and bl is given in (5), where bl corresponds to the size of the message block D i . Since the codeword in the inter-codeword verification resetting group lacks the covert message section, the capacity ρ is evaluated on average and is expressed by (6). Finally, the throughput of the CTC is calculated based on ρ and ψ, as shown in (7).…”
Section: Throughputmentioning
confidence: 99%
See 2 more Smart Citations
“…Codeword HASH CRC , and bl is given in (5), where bl corresponds to the size of the message block D i . Since the codeword in the inter-codeword verification resetting group lacks the covert message section, the capacity ρ is evaluated on average and is expressed by (6). Finally, the throughput of the CTC is calculated based on ρ and ψ, as shown in (7).…”
Section: Throughputmentioning
confidence: 99%
“…[1][2][3] The deployment of Voice over Long Term Evolution (VoLTE) for next-generation audio and video communication over mobile networks has significantly increased the available data rates and quality of service (QoS). [4][5][6] The core network of this technology is a packet-switched network, that is also part of the 5G standard. In recent years, some research efforts have focused on constructing covert timing channels (CTCs) over wireless and VoLTE networks.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Considering the excellent performance of deep learning technologies, 30,31 the deep learning is utilized to capture forgery traces. Relying on the deep analysis of forgery methods, different traces have been utilized for forgery localization, such as noise level, 3,5,32,33 unnatural boundaries, 34 contrast/brightness inconsistencies, [35][36][37] pixels periodic correlation caused by interpolation, 1,3,19 and so on.…”
Section: Introductionmentioning
confidence: 99%
“…[27] It has been widely used in image recognition [28,29] and speech recognition. [30] Recently, the inverse design of many metasurface structures has been realized by utilizing DNN. [31][32][33][34] In these studies, most DNNs adopted inverse design to obtain suitable parameters according to the design goal.…”
mentioning
confidence: 99%