2018 Eleventh International Conference on Contemporary Computing (IC3) 2018
DOI: 10.1109/ic3.2018.8530509
|View full text |Cite
|
Sign up to set email alerts
|

A Lip Reading Model Using CNN with Batch Normalization

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
7
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 22 publications
(6 citation statements)
references
References 8 publications
0
6
0
Order By: Relevance
“…Likewise, the derived gradient was used for updating the weights, following the Adam optimizer, as expressed in Eq. ( 5) [22].…”
Section: F Optimized Hyperparametersmentioning
confidence: 99%
“…Likewise, the derived gradient was used for updating the weights, following the Adam optimizer, as expressed in Eq. ( 5) [22].…”
Section: F Optimized Hyperparametersmentioning
confidence: 99%
“…H. Gupta et al [158] proposed a lip-reading model using CNN batch normalization for audio-less video data. The Haar Cascade algorithm is employed to extract the lip region from each individual frontal facial image in the video sequence and combine them into a single image.…”
Section: Sequence Of Video Framesmentioning
confidence: 99%
“…For instance, IAN [143] utilizes 3D-ResNet [138] for visual representation. DNF [149] subtly designs 2D-CNN with the 1D temporal convolution, which has become one of the mainstream baseline methods. Although CNN-based methods can effectively capture spatial features in gesture images, they are limited in handling the temporal dynamics of gestures directly, and 3D-CNN-based methods involve significant computational overhead.…”
Section: Sign Language Recognitionmentioning
confidence: 99%