Automatic summarization of voicemail messages using lexical and prosodic features

Koumpis, Konstantinos; Renals, Steve

doi:10.1145/1075389.1075390

Cited by 48 publications

(42 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The formulae for the derivatives when the generative model is an HMM may be found in [6]. Let (23) so that the diagonal covariance GMM likelihood is (24) where is the set of parameters in the GMM, . In particular, is the prior of the th Gaussian component of the GMM, is the mean vector of the th component, and is the corresponding diagonal covariance vector.…”

Section: B Computing the Score-vectorsmentioning

confidence: 99%

See 1 more Smart Citation

Speaker verification using sequence discriminant support vector machines

Wan

Renals

2005

IEEE Trans. Speech Audio Process.

167

View full text Add to dashboard Cite

Abstract-This paper presents a text-independent speaker verification system using support vector machines (SVMs) with scorespace kernels. Score-space kernels generalize Fisher kernels and are based on underlying generative models such as Gaussian mixture models (GMMs). This approach provides direct discrimination between whole sequences, in contrast with the frame-level approaches at the heart of most current systems. The resultant SVMs have a very high dimensionality since it is related to the number of parameters in the underlying generative model. To address problems that arise in the resultant optimization we introduce a technique called spherical normalization that preconditions the Hessian matrix. We have performed speaker verification experiments using the PolyVar database. The SVM system presented here reduces the relative error rates by 34% compared to a GMM likelihood ratio system. Index Terms-Fisher kernel, score-space kernel, speaker verification, support vector machine.

show abstract

Section: B Computing the Score-vectorsmentioning

confidence: 99%

“…Cepstral mean subtraction was applied to remove the effects of the communication channel. Silence frames within each utterance were segmented out using a multilayer perceptron pre-trained on a different dataset [23].…”

Section: E Xperimentsmentioning

confidence: 99%

Speaker verification using sequence discriminant support vector machines

Wan

Renals

2005

IEEE Trans. Speech Audio Process.

167

View full text Add to dashboard Cite

show abstract

“…all shouted segments, because shouting may indicate extra importance in some situations. To determine the vocal effort of a speaker, LPC (Linear Predictive Coding) analysis is performed using several different orders (13)(14)(15)(16)(17)(18)(19). The best fit to the incoming spectrum is then determined, and used for inverse-filtering.…”

Section: Vocal Effort Estimationmentioning

confidence: 99%

Automatic Detection, Indexing, and Retrieval of Multiple Attributes from Cross‐Lingual Multimedia Data

Goodman

Boykin

et al. 2012

Multimedia Information Extraction

View full text Add to dashboard Cite

“…There has been much significant progress made in speech summarization for English or Japanese text and audio sources (Hori and Furui, 2003;Inoue et al, 2004;Koumpis and Renals, 2005;Maskey and Hirschberg, 2003;Maskey and Hirschberg, 2005). Some research efforts have focused on summarizing Mandarin sources Huang et al, 2005), which are dependent on lexical features.…”

Section: Introductionmentioning

confidence: 99%

“…Some research efforts have focused on summarizing Mandarin sources Huang et al, 2005), which are dependent on lexical features. Considering the difficulty in obtaining high quality transcriptions, some researchers proposed speech summarization systems with non-lexical features (Inoue et al, 2004;Koumpis and Renals, 2005;Maskey and Hirschberg, 2003;. However, there does not exist any empirical study on speech summarization without lexical features for Mandarin Chinese sources.…”

Section: Introductionmentioning

confidence: 99%

Automatic acquisition of grammatical types for nouns

Bel

Espeja

Marimon

2007

Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics

View full text Add to dashboard Cite

The work 1 we present here is concerned with the acquisition of deep grammatical information for nouns in Spanish. The aim is to build a learner that can handle noise, but, more interestingly, that is able to overcome the problem of sparse data, especially important in the case of nouns. We have based our work on two main points. Firstly, we have used distributional evidences as features. Secondly, we made the learner deal with all occurrences of a word as a single complex unit. The obtained results show that grammatical features of nouns is a level of generalization that can be successfully approached with a Decision Tree learner.

show abstract

Automatic summarization of voicemail messages using lexical and prosodic features

Cited by 48 publications

References 26 publications

Speaker verification using sequence discriminant support vector machines

Speaker verification using sequence discriminant support vector machines

Automatic Detection, Indexing, and Retrieval of Multiple Attributes from Cross‐Lingual Multimedia Data

Automatic acquisition of grammatical types for nouns

Contact Info

Product

Resources

About