Proceedings of the Ninth International Symposium on Consumer Electronics, 2005. (ISCE 2005).
DOI: 10.1109/isce.2005.1502369
|View full text |Cite
|
Sign up to set email alerts
|

Relative timing of sound and vision: evaluation and correction

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 2 publications
0
2
0
Order By: Relevance
“…We newly captured speech scenes of two persons, X and Y (the same persons in the training data), and segmented them into interval sequences 1 According to the research on the over all timing tolerance between video and audio by A. Peregudov et al [13], the thresholds of acceptability are about +90 ms (sound leading) to −185 ms (sound delayed). For these reason, we decided the standard deviation of the Gaussian, σ, as 2σ 100 ms ( the frame rate was 60 fps, therefore σ = 3 frames 50 ms ).…”
Section: Speaker Detection Using the Timing Structurementioning
confidence: 99%
See 1 more Smart Citation
“…We newly captured speech scenes of two persons, X and Y (the same persons in the training data), and segmented them into interval sequences 1 According to the research on the over all timing tolerance between video and audio by A. Peregudov et al [13], the thresholds of acceptability are about +90 ms (sound leading) to −185 ms (sound delayed). For these reason, we decided the standard deviation of the Gaussian, σ, as 2σ 100 ms ( the frame rate was 60 fps, therefore σ = 3 frames 50 ms ).…”
Section: Speaker Detection Using the Timing Structurementioning
confidence: 99%
“…However, we human consider that these temporal differences are perfectly normal. In fact, it is known that some temporal variance is allowed in our speech perception [13]. Frame-wise integration methods are often used in speech recognition, however, they sometimes fail to describe such loose synchronization.…”
Section: Introductionmentioning
confidence: 99%