User Based Aggregation for Biterm Topic Model

Chen, Weizheng; Wang, Jinpeng; Zhang, Yan; Yan, Huirong; Li, Xiaoming

doi:10.3115/v1/p15-2080

Cited by 30 publications

(24 citation statements)

References 16 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…BTM directly models the generation of biterms (pairs of words) in the whole corpus. However, the assumption that pairs of cooccurring words should be assigned to the same topic might be too strong (Chen et al, 2015).…”

Section: Related Workmentioning

confidence: 99%

A Latent Concept Topic Model for Robust Topic Inference Using Word Embeddings

Hu¹,

Tsujii

2016

Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)

View full text Add to dashboard Cite

Uncovering thematic structures of SNS and blog posts is a crucial yet challenging task, because of the severe data sparsity induced by the short length of texts and diverse use of vocabulary. This hinders effective topic inference of traditional LDA because it infers topics based on document-level co-occurrence of words. To robustly infer topics in such contexts, we propose a latent concept topic model (LCTM). Unlike LDA, LCTM reveals topics via co-occurrence of latent concepts, which we introduce as latent variables to capture conceptual similarity of words. More specifically, LCTM models each topic as a distribution over the latent concepts, where each latent concept is a localized Gaussian distribution over the word embedding space. Since the number of unique concepts in a corpus is often much smaller than the number of unique words, LCTM is less susceptible to the data sparsity. Experiments on the 20Newsgroups show the effectiveness of LCTM in dealing with short texts as well as the capability of the model in handling held-out documents with a high degree of OOV words.

show abstract

Section: Related Workmentioning

confidence: 99%

A Latent Concept Topic Model for Robust Topic Inference Using Word Embeddings

Hu¹,

Tsujii

2016

Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)

View full text Add to dashboard Cite

show abstract

“…Recent work on learning user representations with multitask deep learning techniques (Li et al, 2015), suggests that learning a nonlinear mapping from observed views to the latent space can learn high quality user representations. One issue with GCCA is scalability: solving for G relies on an SVD of a large matrix that must be loaded into memory.…”

Section: Resultsmentioning

confidence: 99%

“…One related work was proposed by Zhang and Wang (2015), which employed bidirectional RN-N to learn patterns of relations from raw text data. Although bidirectional RNN has access to both past and future context information, the range of context is limited due to the vanishing gradient problem.…”

Section: Related Workmentioning

confidence: 99%

“…Another related work is SDP-LSTM model proposed by Yan et al (2015). This model leverages the shortest dependency path(SDP) between two nominals, then it picks up heterogeneous information along the SDP with LSTM units.…”

Section: Related Workmentioning

confidence: 99%

“…, h T ] that the LSTM layer produced, where T is the sentence length. The representation r of the sentence is formed by a weighted sum of these output vectors: (Zhang and Wang, 2015) WV (dim=300) + PI 82.5 SDP-LSTM WV (pretrained by word2vec) (dim=200), syntactic parse 82.4 (Yan et al, 2015) + POS + WordNet + grammar relation embeddings 83.7 BLSTM WV ) (dim=100) 82.7 + PF + POS + NER + WNSYN + DEP 84.3 BLSTM WV where…”

Section: Attentionmentioning

confidence: 99%

See 2 more Smart Citations