IEEE International Conference on Acoustics Speech and Signal Processing 2002
DOI: 10.1109/icassp.2002.5745041
|View full text |Cite
|
Sign up to set email alerts
|

Foveated multipoint videoconferencing at low bit rates

Abstract: Multipoint videoconferencing (MPVC) involves three or more participants engaged in video communication over a network. A video server combines the video streams from each participant and then broadcasts the resulting stream to all participants. In this paper, we propose to use foveation, which is non-uniform resolution representation of an image reflecting the sampling in the retina, to reduce the bandwidth requirements of MPVC. We develop foveated MPVC algorithms for variable and constant bit rate MPVC. We sh… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
5
0

Year Published

2005
2005
2022
2022

Publication Types

Select...
2
1
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(5 citation statements)
references
References 7 publications
0
5
0
Order By: Relevance
“…In the first of these, a foveated retinal sampling geometry is used to either apply a foveating coordinate transformation on an original uniform resolution image [33], or to average and map local pixel groups into superpixels [34], [35]. Filter-based methods process images with space-varying low-pass filter with cut-off frequencies determined by foveated resolution-reduction protocols [36], [37]. Multiresolution methods foveation involves decomposing images into bandpass scales, and only retaining scales specified by a foveal fall-off function defined relative to a measured or presumed fixation point [4], [38].…”
Section: A Foveated Video Compressionmentioning
confidence: 99%
“…In the first of these, a foveated retinal sampling geometry is used to either apply a foveating coordinate transformation on an original uniform resolution image [33], or to average and map local pixel groups into superpixels [34], [35]. Filter-based methods process images with space-varying low-pass filter with cut-off frequencies determined by foveated resolution-reduction protocols [36], [37]. Multiresolution methods foveation involves decomposing images into bandpass scales, and only retaining scales specified by a foveal fall-off function defined relative to a measured or presumed fixation point [4], [38].…”
Section: A Foveated Video Compressionmentioning
confidence: 99%
“…In addition, identifying a dominant speaker requires periodical analysis of conversational patterns from different clients and the ensuing unequal rate control. The rate control typically applies a form of foveating such that the visual clarity of a dominant speaker will appear sharper, relative to the nondominant speakers [29]. Research into unequal rate control for a dominant speaker applied dynamic bit allocation and dynamic region of interest transcoding [31,32,19,11].…”
Section: Related Workmentioning
confidence: 99%
“…29), Y represents a set of speech durations of a loudest speaker during a video communication session, with y i ∈ Y. For instance, inFig.…”
mentioning
confidence: 99%
See 1 more Smart Citation
“…However, in some applications, users focus more on some regions of images and expect better quality in those regions. For example, in a videoconference environment, more user attention is paid to the face region of the speaker than other regions [28]. Yet another example is in the remote education applications, where students focus mostly on the teacher or some specific region of the blackboard or the lecture slide.…”
Section: Foveation-based Rate Shapingmentioning
confidence: 99%