2020
DOI: 10.3390/e22040394
|View full text |Cite
|
Sign up to set email alerts
|

Analyzing the Influence of Hyper-parameters and Regularizers of Topic Modeling in Terms of Renyi Entropy

Abstract: Topic modeling is a popular technique for clustering large collections of text documents. A variety of different types of regularization is implemented in topic modeling. In this paper, we propose a novel approach for analyzing the influence of different regularization types on results of topic modeling. Based on Renyi entropy, this approach is inspired by the concepts from statistical physics, where an inferred topical structure of a collection can be considered an information statistical system residing in a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

3
19
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 12 publications
(22 citation statements)
references
References 25 publications
3
19
0
Order By: Relevance
“…Therefore, we conclude that this model leads to distortions caused by its type of regularization. This echoes with work [ 15 ], where more types of regularization were studied, and where it was demonstrated that regularization can lead to distorted results. However, it is beyond the scope of this paper to study the influence of regularization on the Renyi entropy.…”
Section: Resultssupporting
confidence: 60%
See 4 more Smart Citations
“…Therefore, we conclude that this model leads to distortions caused by its type of regularization. This echoes with work [ 15 ], where more types of regularization were studied, and where it was demonstrated that regularization can lead to distorted results. However, it is beyond the scope of this paper to study the influence of regularization on the Renyi entropy.…”
Section: Resultssupporting
confidence: 60%
“…Moreover, the renormalization procedure allows us to significantly speed up this search. However, the location of minimum Renyi entropy may significantly depend on the type of topic model, i.e., on the type of regularization used in the model [ 15 ], which causes difficulties when searching for the number of topics for unmarked datasets leading to the problem of choosing a topic model. In this subsection, we would like to demonstrate the influence of model type on the results of Renyi entropy approach and show how the renormalization procedure can be applied for quickly selecting the number of topics.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations