Renormalization Analysis of Topic Models

Koltcov, Sergei; Ignatenko, Vera

doi:10.3390/e22050556

Cited by 6 publications

(7 citation statements)

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Thus, the complex system's entropy differences can be measured to discover when an information maximum is reached. Koltcov's research considers entropy as negative information; thus, the maximum entropy corresponds to the minimum of information [14,15]. Therefore, the value of T corresponding to the smallest entropy value can be considered the "true number of topics," representing the maximum valid information generated by the topic model.…”

Section: Step 3: Renyi Entropy With Renormalization Analysismentioning

confidence: 99%

“…Based on Koltcovs' research [14], the Renyi entropy can be expressed as follows: where q = 1∕T is called the deformation parameter and T is the number of topics. Z q is the partition function of a topic solution which is shown as below:…”

Section: Step 3: Renyi Entropy With Renormalization Analysismentioning

confidence: 99%

“…In addition to the application of Renyi entropy, this study also employs a renormalization technique to accelerate the computational speed for a fast approximation of Renyi entropy and to search the optimal number of topics [14]. The procedure of implementing renormalization for the LDA model can be explained as follows:…”

Section: Step 3: Renyi Entropy With Renormalization Analysismentioning

confidence: 99%

“…Especially when using supermarket data, the number of topics representing customer's purchase tendency will directly impact understanding customer behavior and the decisions about which products should be promoted. While most of the topic models require the user to select this number manually, Koltcov's research about the application of Renyi entropy on topic models provides an automated, time-saving solution [14,15]. Therefore, this study also implements this method to automatically search for the optimal number of topics in the POS data.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

A Latent Topic Analysis and Visualization Framework for Category-Level Target Promotion in the Supermarket

Sun

Hayashi

Ohsawa

2021

Rev Socionetwork Strat

View full text Add to dashboard Cite

Deciding when and which products to recommend to whom is always an essential issue for retailers. In this study, we propose a mixed framework with two components to capture customer buying behavior and its changes over time and visualize these results to better help retailers choose and target products strategically for marketing. In this framework, a topic model is first used to extract customer’s purchase behavior instead of association rules or K-means as mainly used in market field. To automatically choose the optimal number of topics, we implement an approach proposed by Koltcov et al. on point-of-sale (POS) data in the supermarket. Meanwhile, to grasp the change of topics over time, we divided monthly POS data in half and applied the topic model with Renyi entropy separately. The results suggest that splitting data might be a better way to understand customer behavior. Second, we consider how to develop an effective way to visualize the results of the topic model, which is essential, because in a supermarket context, simply knowing which product categories are included under which topics is not enough to support how a supermarket promotes their products. To address this, we design a three-layer visualization approach to better interpret the topic model results and to help retailers design target promotion strategies. The design of visualization was overlooked by studies related to the use of topic models on supermarket data. Finally, to demonstrate the usefulness of our proposed framework, we conduct a simple scenario-based analysis between our framework and other models, such as Latent Dirichlet Allocation (LDA) and the Dynamic Topic Model (DTM). The results show that for most periods, our proposed framework outperforms LDA and DTM.

show abstract

Section: Step 3: Renyi Entropy With Renormalization Analysismentioning

confidence: 99%

Section: Step 3: Renyi Entropy With Renormalization Analysismentioning

confidence: 99%

Section: Step 3: Renyi Entropy With Renormalization Analysismentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

A Latent Topic Analysis and Visualization Framework for Category-Level Target Promotion in the Supermarket

Sun

Hayashi

Ohsawa

2021

Rev Socionetwork Strat

View full text Add to dashboard Cite

show abstract

“…From a text mining perspective, topics in a text corpus can be viewed as probability distributions of terms present in the corpus or clusters that define weights for those terms [1,2]. The LDA model (Latent Dirichlet Allocation) uses a probability distribution model to generate topics [3,4]. The principle is that a document is assumed to be generated from multiple topics according to a certain random probability distribution, and each topic is composed of words according to a random probability distribution.…”

Section: Introductionmentioning

confidence: 99%

Selection of the Optimal Number of Topics for LDA Topic Model—Taking Patent Policy Analysis as an Example

Gan

2021

Entropy

View full text Add to dashboard Cite

This study constructs a comprehensive index to effectively judge the optimal number of topics in the LDA topic model. Based on the requirements for selecting the number of topics, a comprehensive judgment index of perplexity, isolation, stability, and coincidence is constructed to select the number of topics. This method provides four advantages to selecting the optimal number of topics: (1) good predictive ability, (2) high isolation between topics, (3) no duplicate topics, and (4) repeatability. First, we use three general datasets to compare our proposed method with existing methods, and the results show that the optimal topic number selection method has better selection results. Then, we collected the patent policies of various provinces and cities in China (excluding Hong Kong, Macao, and Taiwan) as datasets. By using the optimal topic number selection method proposed in this study, we can classify patent policies well.

show abstract

The Impact of Sentiment Scores Extracted from Product Descriptions on Customer Purchase Intention

Sun,

Sekiguchi,

Ohsawa

2024

New Gener. Comput.

View full text Add to dashboard Cite

This study investigates whether and how the textual content of product descriptions, especially the sentiment element, influences buyers’ purchase intentions. Using year-round digital transaction data from Mercari, a leading e-Commerce platform in Japan, we examine the interplay of hard and soft information signals exchanged between sellers and buyers. The study addresses two crucial questions: (1) Do the descriptions that sellers provide on product sales pages impact the buyer’s intent to purchase? and (2) In what way does the description influence the buyer’s purchase intention? Quantitative analysis is used to understand the relationship between product descriptions, sentiment elements, and purchase intentions. The results show that sentiment factors in product descriptions can serve as high-quality “signals” that can help buyers make informed purchasing decisions and reduce information asymmetry between buyers and sellers. This research contributes to understanding decision-making in online markets, particularly the role of soft information and sentiment analysis.

show abstract

Renormalization Analysis of Topic Models

Cited by 6 publications

References 39 publications

A Latent Topic Analysis and Visualization Framework for Category-Level Target Promotion in the Supermarket

A Latent Topic Analysis and Visualization Framework for Category-Level Target Promotion in the Supermarket

Selection of the Optimal Number of Topics for LDA Topic Model—Taking Patent Policy Analysis as an Example

The Impact of Sentiment Scores Extracted from Product Descriptions on Customer Purchase Intention

Contact Info

Product

Resources

About