2005
DOI: 10.1145/1075389.1075390
|View full text |Cite
|
Sign up to set email alerts
|

Automatic summarization of voicemail messages using lexical and prosodic features

Abstract: This aticle presents trainable methods for extracting principal content words from voicemail messages. The short text summaries generated are suitable for mobile messaging applications. The system uses a set of classifiers to identify the summary words with each word described by a vector of lexical and prosodic features. We use an ROC-based algorithm, Parcel, to select input features (and classifiers). We have performed a series of objective and subjective evaluations using unseen data from two different spee… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
41
0

Year Published

2005
2005
2015
2015

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 48 publications
(42 citation statements)
references
References 26 publications
1
41
0
Order By: Relevance
“…The formulae for the derivatives when the generative model is an HMM may be found in [6]. Let (23) so that the diagonal covariance GMM likelihood is (24) where is the set of parameters in the GMM, . In particular, is the prior of the th Gaussian component of the GMM, is the mean vector of the th component, and is the corresponding diagonal covariance vector.…”
Section: B Computing the Score-vectorsmentioning
confidence: 99%
See 1 more Smart Citation
“…The formulae for the derivatives when the generative model is an HMM may be found in [6]. Let (23) so that the diagonal covariance GMM likelihood is (24) where is the set of parameters in the GMM, . In particular, is the prior of the th Gaussian component of the GMM, is the mean vector of the th component, and is the corresponding diagonal covariance vector.…”
Section: B Computing the Score-vectorsmentioning
confidence: 99%
“…Cepstral mean subtraction was applied to remove the effects of the communication channel. Silence frames within each utterance were segmented out using a multilayer perceptron pre-trained on a different dataset [23].…”
Section: E Xperimentsmentioning
confidence: 99%
“…all shouted segments, because shouting may indicate extra importance in some situations. To determine the vocal effort of a speaker, LPC (Linear Predictive Coding) analysis is performed using several different orders (13)(14)(15)(16)(17)(18)(19). The best fit to the incoming spectrum is then determined, and used for inverse-filtering.…”
Section: Vocal Effort Estimationmentioning
confidence: 99%
“…There has been much significant progress made in speech summarization for English or Japanese text and audio sources (Hori and Furui, 2003;Inoue et al, 2004;Koumpis and Renals, 2005;Maskey and Hirschberg, 2003;Maskey and Hirschberg, 2005). Some research efforts have focused on summarizing Mandarin sources Huang et al, 2005), which are dependent on lexical features.…”
Section: Introductionmentioning
confidence: 99%
“…Some research efforts have focused on summarizing Mandarin sources Huang et al, 2005), which are dependent on lexical features. Considering the difficulty in obtaining high quality transcriptions, some researchers proposed speech summarization systems with non-lexical features (Inoue et al, 2004;Koumpis and Renals, 2005;Maskey and Hirschberg, 2003;. However, there does not exist any empirical study on speech summarization without lexical features for Mandarin Chinese sources.…”
Section: Introductionmentioning
confidence: 99%