Proceedings of the 21st ACM International Conference on Information and Knowledge Management 2012
DOI: 10.1145/2396761.2396860
|View full text |Cite
|
Sign up to set email alerts
|

The generalized dirichlet distribution in enhanced topic detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
22
0

Year Published

2013
2013
2021
2021

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 23 publications
(22 citation statements)
references
References 14 publications
0
22
0
Order By: Relevance
“…For this application, we fit a GD-LDA model developed by authors of [5] to extract the topics from the corpus set. This consists in all the processed text entry notes (noun phrases + terms) of all the patients.…”
Section: Topic Based Featuresmentioning
confidence: 99%
See 1 more Smart Citation
“…For this application, we fit a GD-LDA model developed by authors of [5] to extract the topics from the corpus set. This consists in all the processed text entry notes (noun phrases + terms) of all the patients.…”
Section: Topic Based Featuresmentioning
confidence: 99%
“…Each entry has an average length of 173 terms after constructing noun phrases, performing stemming, and removing stop words. We fit the GDLDA [5] model using all the text entries and K = [50, 75, 100] topics. Figure 3 shows some of the obtained topics and how these topics are aligned with symptoms and procedures for a particular disease.…”
Section: Experimental Settings and Numerical Feature Extractionmentioning
confidence: 99%
“…The interior nodes are distributions over topics called super-topics. Recently, in [6], the authors presented a new model to find correlation among topics in a corpus using the Generalized Dirichlet distribution model instead of the Dirichlet distribution.…”
Section: Related Workmentioning
confidence: 99%
“…Due to space constraint, we only show the graph obtained from our NTSeg model. Note that other models such as PAM, LDCC, LDSEG, GD-LDA [6] and CTM, only form unigrams in a topic leading to ambiguous interpretation. For example, presenting the unigram "confidence" will not be that insightful in a correlation graph.…”
Section: Correlation Graphmentioning
confidence: 99%
See 1 more Smart Citation