2006
DOI: 10.1016/j.specom.2005.11.003
|View full text |Cite
|
Sign up to set email alerts
|

Efficient scalable encoding for distributed speech recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

0
10
0

Year Published

2007
2007
2016
2016

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 17 publications
(10 citation statements)
references
References 36 publications
0
10
0
Order By: Relevance
“…What is more exciting is the possibility of distributing the processing work between a low resource client and a server machine without any change in implementation, leveraging the network transparency provided by libcppa. For instance, a smartphone can run early stages of the recognition pipeline, such as audio acquisition and feature extraction, while a remote server does the processor-intensive decoding [6]. …”
Section: Introductionmentioning
confidence: 99%
“…What is more exciting is the possibility of distributing the processing work between a low resource client and a server machine without any change in implementation, leveraging the network transparency provided by libcppa. For instance, a smartphone can run early stages of the recognition pipeline, such as audio acquisition and feature extraction, while a remote server does the processor-intensive decoding [6]. …”
Section: Introductionmentioning
confidence: 99%
“…The resulting source encoding bit-rate is 4.4 kbps. The authors in [5] present a scalable predictive approach, in which each feature is independently quantized using a uniform scalar quantizer, providing flexibility in optimizing the DSR system to the changing bandwidth requirement and server load, while achieving the best possible recognition performance. However, the performance is considerably reduced using the state of the art speech codec's at low bit-rate condition.…”
Section: Introductionmentioning
confidence: 99%
“…Bits allocation of the proposed encoder. 5 9 9 v 6 , v 7 , v 8 9 9 v 9 , v 10 , v 11 9 9 v 12 9 9 v 13 9 9…”
mentioning
confidence: 99%
“…These portable devices are typically small in size and difficult to manipulate, thus current IUIs available for them are limited. Thus, as a promising user interface to make them easier to use, speech recognition can take the place of the keyboard or touch pad on these devices since voice input only requires a microphone (Srinivasamurthy et al, 2006). A major problem, however, is that the computational complexity of speech recognition is too high for most portable devices.…”
Section: Introductionmentioning
confidence: 99%
“…Several coding methods for compressing MFCCs have been proposed in literature (Srinivasamurthy et al, 2006;Kiss, 2000;Zhu and Alwan, 2001;Hirsch, 1998;So and Paliwal, 2006;Borgstrom and Alwan, 2007;Digalakis et al, 1999;Ramaswamy and Gopalakrishnan, 1998;Kiss and Kapanen, 1999). In early attempts at compression, scalar quantization and vector quantization were applied to MFCCs, and the word error rates (WERs) were measured according to various bit-rates (Digalakis et al, 1999).…”
Section: Introductionmentioning
confidence: 99%