Ninth IEEE International Symposium on Multimedia (ISM 2007) 2007
DOI: 10.1109/ism.2007.4412354
|View full text |Cite
|
Sign up to set email alerts
|

Multi-stream Asynchrony Modeling for Audio-Visual Speech Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2008
2008
2018
2018

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 7 publications
(1 citation statement)
references
References 15 publications
0
1
0
Order By: Relevance
“…Usually the visual features are handled in a separate stream in a multi-stream HMM (MSHMM) [3]. However, since regular multi-stream HMM handles both channels synchronously and there may be asynchrony between audio and video channels, some extensions like coupled hidden Markov models (CHMM) [4], product hidden Markov models (PHMM) [3] and multi-stream asynchrony dynamic Bayesian networks [5] have been proposed to take this asynchrony into consideration. Also as This research is supported by The Scientific and Technological Research Council of Turkey (TUBITAK) under the scientific and technological research support program (code 1001), project number 107E015 entitled "Novel Approaches in Audio Visual Speech Recognition".…”
Section: Introductionmentioning
confidence: 99%
“…Usually the visual features are handled in a separate stream in a multi-stream HMM (MSHMM) [3]. However, since regular multi-stream HMM handles both channels synchronously and there may be asynchrony between audio and video channels, some extensions like coupled hidden Markov models (CHMM) [4], product hidden Markov models (PHMM) [3] and multi-stream asynchrony dynamic Bayesian networks [5] have been proposed to take this asynchrony into consideration. Also as This research is supported by The Scientific and Technological Research Council of Turkey (TUBITAK) under the scientific and technological research support program (code 1001), project number 107E015 entitled "Novel Approaches in Audio Visual Speech Recognition".…”
Section: Introductionmentioning
confidence: 99%