Modeling multiword phrases with constrained phrase trees for improved topic modeling of conversational speech

Hazen, Timothy J.; Richardson, F. S.

doi:10.1109/slt.2012.6424226

Cited by 4 publications

(3 citation statements)

References 12 publications

(6 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Its value ranges between 0 and 1, with 1 representing a perfect mapping between the true topics and the latent topics. Table 3 shows the NMI scores for a uniform random assignment of documents to latent topics, the hard agglomerative clustering used for initialization, both latent models, and a phrase-based PLSA model applied to text transcripts of the data [10]. Both models do a surprisingly good job of learning latent topics with a strong mapping to the true topics, given the fully unsupervised nature of the system.…”

Section: Methodsmentioning

confidence: 99%

Zero resource spoken audio corpus analysis

Harwath

Hazen

Glass³

2013

2013 IEEE International Conference on Acoustics, Speech and Signal Processing

Self Cite

View full text Add to dashboard Cite

Zero-resource speech processing involves the automatic analysis of a collection of speech data in a completely unsupervised fashion without the benefit of any transcriptions or annotations of the data. In this paper, our zero-resource system seeks to automatically discover important words, phrases and topical themes present in an audio corpus. This system employs a segmental dynamic time warping (S-DTW) algorithm for acoustic pattern discovery in conjunction with a probabilistic model which treats the topic and pseudo-word identity of each discovered pattern as hidden variables. By applying an Expectation-Maximization (EM) algorithm, our system estimates the latent probability distributions over the pseudo-words and topics associated with the discovered patterns. Using this information, we produce acoustic summaries of the dominant topical themes of the audio document collection.

show abstract

Section: Methodsmentioning

confidence: 99%

Zero resource spoken audio corpus analysis

Harwath

Hazen

Glass³

2013

2013 IEEE International Conference on Acoustics, Speech and Signal Processing

Self Cite

View full text Add to dashboard Cite

show abstract

“…Make a set of all keywords for both RRL and LRL. 4. Train the Convolutional Neural Network (CNN) neural network using audio waveforms in the RRL.…”

Section: Model Trainingmentioning

confidence: 99%

“…Topic detection is a heavily studied problem, including methods specialized for both text [2] and speech [3] sources. Topic detection and tracking from speech is most accurately performed when one can first perform automatic speech recognition (ASR), then apply text-oriented topic detection methods such as latent Dirichlet allocation [2] or partial semantic parse [4]. It has been demonstrated that ASR-based topic detection outperforms methods without transcription, even when the ASR output has a relatively high error rate [5,6].…”

Section: Introductionmentioning

confidence: 99%

Topic and Keyword Identification for Low-resourced Speech Using Cross-Language Transfer Learning

2018

View full text Add to dashboard Cite

This paper studies topic and keyword identification for languages in which we have no transcribed speech data. We adopt a transfer learning framework to transfer what is learned from rich-resourced languages (RRL) to low-resourced languages (LRL). Specifically, we propose that a convolutional neural network (CNN) trained as a topic classifier in an RRL learns features (hidden layer activations) that can be used for the same purpose in an LRL. The CNN observes acoustic features, RRL phones, or segment clusters generated by an unsupervised phone clustering system; its hidden layers are retained, and its output layer retrained from scratch on the LRL. Our results are compared with the state-of-the-art topic classification methods on cross-language ASR transcripts. We also discuss the successful detection of topic dependent keywords and the use of unsupervised learning based clusters in our approach for low-resourced language topic detection.

show abstract

Detection and Characterization of Human Trafficking Networks Using Unsupervised Scalable Text Template Matching

Simek

Lai

et al. 2018

2018 IEEE International Conference on Big Data (Big Data)

View full text Add to dashboard Cite

Modeling multiword phrases with constrained phrase trees for improved topic modeling of conversational speech

Cited by 4 publications

References 12 publications

Zero resource spoken audio corpus analysis

Zero resource spoken audio corpus analysis

Topic and Keyword Identification for Low-resourced Speech Using Cross-Language Transfer Learning

Detection and Characterization of Human Trafficking Networks Using Unsupervised Scalable Text Template Matching

Contact Info

Product

Resources

About