Our system is currently under heavy load due to increased usage. We're actively working on upgrades to improve performance. Thank you for your patience.
2006
DOI: 10.1109/glocom.2006.603
|View full text |Cite
|
Sign up to set email alerts
|

SPCp1-01: Voice Activity Detection for VoIP-An Information Theoretic Approach

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
12
0

Year Published

2008
2008
2012
2012

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 13 publications
(13 citation statements)
references
References 11 publications
1
12
0
Order By: Relevance
“…In [4], authors used the entropy measure to distinguish between speech and silence as a robust extension to the 3GPP standard. Nevertheless, the system assumes close-talking microphones and during tests ignores the effect of reverberation.…”
Section: Introductionmentioning
confidence: 99%
“…In [4], authors used the entropy measure to distinguish between speech and silence as a robust extension to the 3GPP standard. Nevertheless, the system assumes close-talking microphones and during tests ignores the effect of reverberation.…”
Section: Introductionmentioning
confidence: 99%
“…Higher playout buffer size offers increased tolerance towards jitter but increases mouth to ear delay. One simple way to reduce the delay at the playout buffer is to detect the talk spurts [2] and transmit only those segments. This scheme, while reducing the bandwidth, avoids building up of playout buffer.…”
Section: Introductionmentioning
confidence: 99%
“…Krätzer, Dittmann, and Vogel [14] argued that the inactive voice of a speech was not suitable for a being used as a cover object for steganography owing to an obvious distortion of the original speech. By contrast, Huang et al [15] suggested an algorithm for embedding information in some parameters of the speech frame encoded by ITU G.723.1 codec, without leading to distinction between inactive voices and active voices. These are computationally complex and require training and building a model.…”
Section: Related Workmentioning
confidence: 99%
“…Having a packet size equivalent to 10 ms allows the VoIP system to start playing the audio at the receiver's end after 30-40 ms from the time the queue start building up. If the frame duration were 50 ms, an initial delay would be of 150-200 ms, which is unsuitable since, maximum round trip delay within 400 ms [15] for a good quality speech. Therefore, the frame duration must be chosen properly.…”
Section: 1choice Of Frame Durationmentioning
confidence: 99%