2002
DOI: 10.1155/s1110865702206162
|View full text |Cite
|
Sign up to set email alerts
|

Audio-Visual Speech Recognition Using MPEG-4 Compliant Visual Features

Abstract: The most important challenges in AVSR and the focus of most research are the features that are extracted, and when combined give better results. The other challenge is the resulted feature here of nature are large in size, then prefers here to reduce the features by use of an appropriate way to reduce these data with ensure have their properties after downsizing. The System that is presented in this research is for recognition a group of Arabic words voices, from one to ten words. In the acoustic parts the fea… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
33
0

Year Published

2006
2006
2020
2020

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 47 publications
(33 citation statements)
references
References 28 publications
0
33
0
Order By: Relevance
“…There exist many techniques in the literature that attempt to solve the lip segmentation/tracking problem [12], [29]- [35]. The performance of these techniques usually depend on acquisition specifics such as image quality, resolution, head pose and illumination conditions.…”
Section: Extraction Of Contour-based Motion Features 1) Lip Contoumentioning
confidence: 99%
See 2 more Smart Citations
“…There exist many techniques in the literature that attempt to solve the lip segmentation/tracking problem [12], [29]- [35]. The performance of these techniques usually depend on acquisition specifics such as image quality, resolution, head pose and illumination conditions.…”
Section: Extraction Of Contour-based Motion Features 1) Lip Contoumentioning
confidence: 99%
“…Deformable templates [4], [5], active shape models (ASM) [6], [10], [11], and snakes [12] have been used to obtain different lip geometry features; however, they all suffer from complex feature extraction and training procedures. In [5], Gaussian mixture models (GMM) are used to model both the lip and the non-lip region, and lip tracking is performed by deformable templates.…”
mentioning
confidence: 99%
See 1 more Smart Citation
“…Though the discriminant analysis is followed by a smoothing step, the segmentation remains noisy. Snakes [16] have been widely used for lip segmentation ( [1] [20]) because snakes can take into account in a same framework smoothing and elasticity constraints. Snake-based methods yield to interesting results but the main drawback is the tuning of several parameters.…”
Section: Introductionmentioning
confidence: 99%
“…Significant research has been carried out to accurately obtain the outer lip contour. One of the most popular approaches is using snakes (Kass et al 1988), which have the ability to take smoothing and elasticity constraints into account (Terzopoulos and Waters 1993;Aleksic et al 2002). Another popular approach is using active shape models and appearance shape models.…”
Section: Lip Readingmentioning
confidence: 99%