2007 IEEE 9th Workshop on Multimedia Signal Processing 2007
DOI: 10.1109/mmsp.2007.4412866
|View full text |Cite
|
Sign up to set email alerts
|

An Embedded System for In-Vehicle Visual Speech Activity Detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

0
4
0

Year Published

2010
2010
2019
2019

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 5 publications
0
4
0
Order By: Relevance
“…(1) Feature extraction extracts attention-related visual features (ostensive-stimuli) from an image sequence and/or audio features from a sound stream. Various visual features are often chosen to be used as stimuli for the attention system such as the distance between a robot and a person [12,13], the head direction of the people participating in an interaction [14,15,16,17,18,19], and/or visual speaking status detection [20,21,22,23,24,25]. When audio features are used for the attention model, the direction of a sound source and the distance to a sound source are usually adopted [26,27,28].…”
Section: Introductionmentioning
confidence: 99%
“…(1) Feature extraction extracts attention-related visual features (ostensive-stimuli) from an image sequence and/or audio features from a sound stream. Various visual features are often chosen to be used as stimuli for the attention system such as the distance between a robot and a person [12,13], the head direction of the people participating in an interaction [14,15,16,17,18,19], and/or visual speaking status detection [20,21,22,23,24,25]. When audio features are used for the attention model, the direction of a sound source and the distance to a sound source are usually adopted [26,27,28].…”
Section: Introductionmentioning
confidence: 99%
“…However, systems have only been examined in unrealistic scenarios. There are few attempts 978-1-4244-7167-6/10/$26.00 ©201 0 IEEE to incorporate the visual modality [3,4] in real-time sys tem. Recently, one notable attempt has been the work of Libal et.…”
Section: Introductionmentioning
confidence: 99%
“…Recently, one notable attempt has been the work of Libal et. al [4], who developed a real-time system to recognize visual speech activity on low cost embedded platforms. This system uses a camera mounted on the rearview mirror to monitor the driver.…”
Section: Introductionmentioning
confidence: 99%
“…In this paper, we propose an extraction method of lip movement images from successive image frames in the speech activity extraction process [5] which is preprocessing phase of speech recognition. The image frames are acquired from the PC image camera.…”
Section: Introductionmentioning
confidence: 99%