2017
DOI: 10.1007/978-3-319-66429-3_76
|View full text |Cite
|
Sign up to set email alerts
|

Using a High-Speed Video Camera for Robust Audio-Visual Speech Recognition in Acoustically Noisy Conditions

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
7
0
1

Year Published

2018
2018
2023
2023

Publication Types

Select...
5
3
2

Relationship

2
8

Authors

Journals

citations
Cited by 16 publications
(8 citation statements)
references
References 12 publications
0
7
0
1
Order By: Relevance
“…Every practical lip-reading system necessarily include 4 main processing stages, such as image acquisition, region of interest (ROI) localization, feature extraction and speech recognition. More detailed description of the used VSR system is described in the work (Ivanko et al, 2017).…”
Section: Proposed Methodsmentioning
confidence: 99%
“…Every practical lip-reading system necessarily include 4 main processing stages, such as image acquisition, region of interest (ROI) localization, feature extraction and speech recognition. More detailed description of the used VSR system is described in the work (Ivanko et al, 2017).…”
Section: Proposed Methodsmentioning
confidence: 99%
“…In traditional methods of AVSR, a transform (e.g., principal component analysis (PCA) [ 27 ], linear discriminant analysis (LDA) [ 28 ], or t-distributed stochastic neighbor embedding (t-SNE) [ 29 ]) is usually applied to the detected ROI for video and concatenated mel-frequency cepstral coefficients (MFCCs) for audio, followed by a deep autoencoder to extract bottleneck features [ 30 , 31 , 32 ]. Then, extracted features are fed to a classifier such as support vector machine (SVM) [ 33 ], hidden markov model (HMM) [ 34 ], coupled HMM [ 35 ], etc.…”
Section: Related Workmentioning
confidence: 99%
“…Followed in the second stage by a sequence model (e.g. Hidden Markov Model) to tackle the temporal dependency from the extracted features for classification [8][9][10].…”
Section: Related Workmentioning
confidence: 99%