Proceedings of the 31th International Conference on Computer Graphics and Vision. Volume 2 2021
DOI: 10.20948/graphicon-2021-3027-905-916
|View full text |Cite
|
Sign up to set email alerts
|

Development of Visual and Audio Speech Recognition Systems Using Deep Neural Networks

Abstract: In this paper we design end-to-end neural network for the low-resource lip-reading task and audio speech recognition task using 3D CNNs, pre-trained CNN weights of several state-of- the-art models (e.g. VGG19, InceptionV3, MobileNetV2, etc.) and LSTMs. We present two phrase-level speech recognition pipelines: for lip-reading and acoustic speech recognition. We evaluate different combinations of front-end and back-end modules on the RUSAVIC dataset. We compare our results with traditional 2D CNN approach and de… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
references
References 34 publications
0
0
0
Order By: Relevance