2020
DOI: 10.21608/ejle.2020.22022.1002
|View full text |Cite
|
Sign up to set email alerts
|

Noise-Robust Speech Recognition System based on Multimodal Audio-Visual Approach Using Different Deep Learning Classification Techniques

Abstract: Multimodal speech recognition is proved to be one of the most promising solutions for designing a robust speech recognition system, especially when the audio signal is affected by noise. The visual signal can be used to obtain more information to enhance the recognition accuracy in a noisy system, whereas the reliability of the visual signal is not affected by the acoustic noise. The critical stage in designing a robust speech recognition system is the choice of an appropriate feature extraction method for bot… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
1
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(2 citation statements)
references
References 32 publications
(39 reference statements)
0
1
0
Order By: Relevance
“…PCA uses statistical tools to identify noise and redundancy in the dataset [30]. It keeps the necessary parts that have more variation of the data and removes the unnecessary parts with fewer variations, therefore speeding up the training and testing time of the machine learning algorithm.…”
Section: Model Setupmentioning
confidence: 99%
“…PCA uses statistical tools to identify noise and redundancy in the dataset [30]. It keeps the necessary parts that have more variation of the data and removes the unnecessary parts with fewer variations, therefore speeding up the training and testing time of the machine learning algorithm.…”
Section: Model Setupmentioning
confidence: 99%
“…Alternatively, it is known that the CNN model is a deep learning algorithm that can perform complex tasks with images, videos, texts, and sounds that are inspired by the human visual system [3]. CNN's achieved great success in image recognition [4], and recently they are widely adopted in ASR systems [5]- [9]. Most leading technology companies like Google, Facebook, Microsoft, IBM, Yahoo!, Twitter and Adobe, have initiated research and development projects [10]- [13] which employs CNN for image recognition products and services.…”
Section: Introductionmentioning
confidence: 99%