Sparsemax and Relaxed Wasserstein for Topic Sparsity

Lin, Tianyi; Hu, Zhiyue Tom; Guo, Xin

doi:10.1145/3289600.3290957

Cited by 27 publications

(24 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Other than Dirichlet, [Miao et al, 2017] introduces a Gaussian softmax (GSM) function in the encoder: q z := softmax N (µ, diag K (σ 2 )) and [Silveira et al, 2018] proposes to use a logistic-normal mixture distribution for the prior of z. To further enhance the sparsity in z, [Lin et al, 2019] introduces to use the sparsemax function to replace the softmax in GSM.…”

Section: Variants Of Distributionsmentioning

confidence: 99%

Topic Modelling Meets Deep Neural Networks: A Survey

Zhao

Phung

Huynh

et al. 2021

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence

View full text Add to dashboard Cite

Topic modelling has been a successful technique for text analysis for almost twenty years. When topic modelling met deep neural networks, there emerged a new and increasingly popular research area, neural topic models, with nearly a hundred models developed and a wide range of applications in neural language understanding such as text generation, summarisation and language models. There is a need to summarise research developments and discuss open problems and future directions. In this paper, we provide a focused yet comprehensive overview of neural topic models for interested researchers in the AI community, so as to facilitate them to navigate and innovate in this fast-growing research area. To the best of our knowledge, ours is the first review on this specific topic.

show abstract

Section: Variants Of Distributionsmentioning

confidence: 99%

Topic Modelling Meets Deep Neural Networks: A Survey

Zhao

Phung

Huynh

et al. 2021

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence

View full text Add to dashboard Cite

show abstract

“…It significantly improved the flexibility and efficiency of the original sparse topic model. Most recently, Lin et al [7] proposed Neural SparseMax Document and Topic Models, which utilized sparsemax to directly control the sparsity of the topics. It outperformed previous neural sparse topic methods such as [6] in quality and stability.…”

Section: Related Workmentioning

confidence: 99%

“…There are many researches devoting to combine deep learning techniques with topic models. In these models [5,4,6,7,8,9], the back propagation method is utilized to automatically update the parameters during the training, since the probabilistic mixtures in the generative process of traditional topic models are replaced by deep neural networks. Thus, they can be derived easily and extended flexibly with the neural network structures and the back propagation.…”

Section: Introductionmentioning

confidence: 99%

Neural variational sparse topic model for sparse explainable text representation

Xie

Tiwari

Gupta

et al. 2021

Information Processing & Management

View full text Add to dashboard Cite

Texts are the major information carrier for internet users, from which learning the latent representations has important research and practical value. Neural topic models have been proposed and have great performance in extracting interpretable latent topics and representations of texts. However, there remain two major limitations: 1) these methods generally ignore the contextual information of texts and have limited feature representation ability due to the shallow feed-forward network architecture, 2) Sparsity of the representations in topic semantic space is ignored. To address these issues, in this paper, we propose a semantic reinforcement neural variational sparse topic model (SR-NSTM) towards explainable and sparse latent text representation learning. Compared with existing neural topic models, SR-NSTM models the generative process of texts with probabilistic distributions parameterized with neural networks and incorporates Bi-directional LSTM to embed contextual information at the document level. It achieves sparse posterior representations over documents and words with zero-mean Laplace distribution and topics with sparsemax. Moreover, we propose a supervised extension of SR-NSTM via adding the max-margin posterior regularization to tackle the supervised tasks. The neural variational inference method is utilized to learn our models efficiently. Experimental results on Web Snippets, 20Newsgroups, BBC, and Biomedical datasets demonstrate that the contextual information and revisiting generative process can improve the performance, leading to the competitive performance of our models in learning coherent topics and explainable sparse representations for texts.

show abstract

“…This ensures that each document covers limited topics (Chen et al, 2020b). The sparsity of topics encourages interpretable topics (Card et al, 2018), which corresponds with the tuition that a document usually focuses on several salient topics instead of covering a wide variety of topics (Lin et al, 2019). Although NMF has given sparseness to V t , a more direct control over such properties of the representation is still needed (Hoyer, 2004).…”

Section: Objective Functionmentioning

confidence: 99%

“…Considering that it is important to discover discriminative topics, we also adopt the topic uniqueness (TU) score (Nan et al, 2019) to measure the diversity of topics. In addition, the sparsity score of the document-topic distribution (TS-U) and the topic-word distribution (TS-V) proposed by Lin et al (2019) is further used to measure the topic sparsity quantitatively. Particularly, we use 1e-20 as the threshold to count the number of zero values in document-topic and topicword distributions.…”

Section: Evaluation Metricsmentioning

confidence: 99%

Lifelong Learning of Topics and Domain-Specific Word Embeddings

Qin¹,

Lu²,

Rao³

2021

Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021

View full text Add to dashboard Cite

Lifelong topic models mainly focus on indomain text streams in which each chunk only contains documents from a single domain. To overcome data diversity of the in-domain corpus, most of the existing methods exploit the information from limited sources in a separate and heuristic manner. In this study, we develop a lifelong collaborative model (LCM) based on non-negative matrix factorization to accurately learn topics and domain-specific word embeddings. LCM particularly investigates:(1) developing a knowledge graph based on the semantic relationships among words in the lifelong learning process, so as to accumulate global context information discovered by topic models and local context information reflected by context word embeddings from previous domains, and (2) developing a subword graph based on byte pair encoding and pairwise word relationships to exploit subword information of words in the current in-domain corpus. To the best of our knowledge, we are the first to collaboratively learn topics and word embeddings via lifelong learning. Experiments on real-world in-domain text streams validate the effectiveness of our method.

show abstract

Sparsemax and Relaxed Wasserstein for Topic Sparsity

Cited by 27 publications

References 24 publications

Topic Modelling Meets Deep Neural Networks: A Survey

Topic Modelling Meets Deep Neural Networks: A Survey

Neural variational sparse topic model for sparse explainable text representation

Lifelong Learning of Topics and Domain-Specific Word Embeddings

Contact Info

Product

Resources

About