Semantic Tagging Using Topic Models Exploiting Wikipedia Category Network

Allahyari, Mehdi; Kochut, Krys J.

doi:10.1109/icsc.2016.34

Cited by 14 publications

(15 citation statements)

References 18 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In [24] the authors developed a correlated tag learning (CTL) model for the semi-structured corpora based on the topic model to enable the construction of the correlation graph among tags via a logistic normal participation process. In [25] the authors proposed a probabilistic topic model that incorporates DBpedia knowledge into the topic model for tagging web pages and online documents with topics discovered. The model learns the probability distribution of each category over the words using the statistical topic models considering the prior knowledge from Wikipedia about the words, and their associated probabilities in various categories [25].…”

Section: Latent Topic Decompositionmentioning

confidence: 99%

“…In [25] the authors proposed a probabilistic topic model that incorporates DBpedia knowledge into the topic model for tagging web pages and online documents with topics discovered. The model learns the probability distribution of each category over the words using the statistical topic models considering the prior knowledge from Wikipedia about the words, and their associated probabilities in various categories [25]. In [26] the authors proposed Wikipedia-category-concept mention latent Dirichlet allocation (WCM-LDA), which not only models the relationship between words and topics, but also utilizes the concept and category knowledge of entities to model the semantic relation of entities and topics.…”

Section: Latent Topic Decompositionmentioning

confidence: 99%

“…Due to the semantic text representation approach specially WSD in addition the semantic similarity, the conventional topic models are not effectively to capture semantic topics, which leads to the poor performance in [33] X √ Schneider et al [22] X X Zhao et al [23] √ X Chen et al [32] X √ Hong et al [34] X X Peng et al [47] X X Xu et al [26] X X Li et al [24] √ X Allahyari et al [25] √ X Izquierdo et al [48] √ X Viegas et al [35] √ (word embedding) √ semantic topic learning. To tackle this problem, we introduce semantic topic modelling SNNMF.…”

Section: Problem Statementmentioning

confidence: 99%

See 2 more Smart Citations

Novel semantic tagging detection algorithms based non-negative matrix factorization

2019

View full text Add to dashboard Cite

The tagging aims to address a challenge to search relevant text-documents given a set of tags. In addition, the tag-based approaches received a wide attention as a possible solution to the big-content. Probabilistic topic model methods, such as Dirichlet distribution and non-negative matrix factorization are used for tagging process. Both have many challenges. The iterations in addition the semantic coherence are considered as challenges in semantic tagging applications. In light of this, we propose a novel learning tagging model called semantic non-negative matrix factorization, which introduces the utilization of the semantic text representation via knowledge-based approach to extract the term-topic matrix and the topic-document matrix by semantically approach. The proposed words are based on a novel initialization method for non-negative matrix factorization technique. In the experimental evaluations, we use five datasets demonstrate the effectiveness of our model. The results are compared with the state-of-the-art model. The results show the proposed model has an ability to generate more precise topics with semantically related and having the high sense to the disambiguation of meaning, provides up to more dimensionality reduction and the topic coherence based semantic.

show abstract

Section: Latent Topic Decompositionmentioning

confidence: 99%

Section: Latent Topic Decompositionmentioning

confidence: 99%

Section: Problem Statementmentioning

confidence: 99%

See 1 more Smart Citation

Novel semantic tagging detection algorithms based non-negative matrix factorization

2019

View full text Add to dashboard Cite

show abstract

“…Secondly, as described above, the tags were a set of semantic topic distributions, which were learned from the plain text, and so the correlations should be modeled from the semantic level, while only considering the co-occurrences was not enough. Allahyari, et al (2016). Proposed a probabilistic topic model that incorporates DBpedia Knowledge into the topic model for tagging Web pages and online documents with topics discovered.…”

Section: Taggingmentioning

confidence: 99%

“…Proposed a probabilistic topic model that incorporates DBpedia Knowledge into the topic model for tagging Web pages and online documents with topics discovered. They learn the probability distribution of each category over the words using the statistical topic models taking into account the prior knowledge from Wikipedia about the words, and their associated probabilities in various categories [2]. There were two main limitations in this approach.…”

Section: Taggingmentioning

confidence: 99%

Semantic Tagging-Based Document Retrieval Using Non-Negative Matrix Factorization

Gadelrab¹,

Haggag²,

Sadek³

2019

النشرة المعلوماتیة فی الحاسبات والمعلومات

View full text Add to dashboard Cite

Many document retrieval methods focusing on unstructured text to deliver more meaningful information on the user. Tag-based document retrieval aims to address a challenge to searching relevant text-documents given a set of tags. Tag-based approaches received a wide attention as a possible solution to the big-content related IR, showing a high performance through a combination of its effectiveness and efficiency. This paper use word sense disambiguation with non-negative matrix factorization to generate topic model based semantic.

show abstract