Proceedings of First Signal Processing Society Workshop on Multimedia Signal Processing
DOI: 10.1109/mmsp.1997.602606
|View full text |Cite
|
Sign up to set email alerts
|

Frame rate and viseme analysis for multimedia applications

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 17 publications
(13 citation statements)
references
References 6 publications
0
13
0
Order By: Relevance
“…-linguistic -the classes of visemes are defined on the basis of an intuitive linguistic classification of groups of phonemes according to their expected visual realization, -data driven -the classes of visemes are defined on the basis of data acquired through parameter extraction and clustering [40].…”
Section: Viseme Classification Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…-linguistic -the classes of visemes are defined on the basis of an intuitive linguistic classification of groups of phonemes according to their expected visual realization, -data driven -the classes of visemes are defined on the basis of data acquired through parameter extraction and clustering [40].…”
Section: Viseme Classification Methodsmentioning
confidence: 99%
“…17 and 18 three most important parameters used in many implementations during parameter extraction from the lip area are shown [1,22,28,40]. They are geometrical parameters: the outer horizontal aperture, the outer vertical aperture and the angle of lip opening.…”
Section: Preparation Of Visual Feature Parameter Vectormentioning
confidence: 99%
“…For instance (Dupont & Luettin, 2000) and (Luettin et al, 1996) combine ASM with PCA features and (Chiou & Hwang, 1997) combines snake features with PCA. It was shown that the tongue, teeth and cavity have great influence on lip reading (Williams et al, 1998), therefore, the addition of these appearance related elements has significant influence on the performance of lip reading (Chitu et al, 2007). A special example is the so called Active Appearance Models (AAM) (Cootes et al, 1998) which combines the ASM method with texture based information to accurately detect the shape of the mouth or the face.…”
Section: Feature Vectors Definitionmentioning
confidence: 99%
“…The teeth, the tongue and the cavity were shown to be of great importance for lip reading by humans (Williams et al, 1998). Also other face elements were shown to be important during face to face communication; however, their exact influence is not completely elucidated.…”
Section: Introductionmentioning
confidence: 99%
“…The concept of visual phoneme does not suggest an explicit definition of lips' structure during phoneme utterance. The visemes are formed based on human perceptions which are categorized using confusion matrix where the most accurately detected visemes form a phoneme-viseme table (Williams, Rutledge, Garstecki, & Katsaggelos, 1997). The deficiency of this method can be observed by the fact that there are various phoneme-viseme tables used (Goldschen, Garcia, & Petajan, 1994;Hazen, Saenko, La, & Glass, 2004;Jiang, Alwan, Auer, & Bernstein, 2001).…”
Section: Introductionmentioning
confidence: 99%