2021 IEEE International Conference on Data Mining (ICDM) 2021
DOI: 10.1109/icdm51629.2021.00017
|View full text |Cite
|
Sign up to set email alerts
|

Topic-Noise Models: Modeling Topic and Noise Distributions in Social Media Post Collections

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
1

Relationship

2
3

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 19 publications
0
4
0
Order By: Relevance
“…Different strategies for mitigating noise have been proposed, including (1) modelling documents as a graph or network and removing noisy parts of the network before detecting topics and (2) adjusting generative probabilistic models to account for short, noisy text. A new approach that focuses on building topic‐noise models by approximating both the underlying topic and noise probability distributions is an important step for identifying topics in short, noisy posts (Churchill & Singh, 2021).…”
Section: Constructing Variables From Big Datamentioning
confidence: 99%
See 2 more Smart Citations
“…Different strategies for mitigating noise have been proposed, including (1) modelling documents as a graph or network and removing noisy parts of the network before detecting topics and (2) adjusting generative probabilistic models to account for short, noisy text. A new approach that focuses on building topic‐noise models by approximating both the underlying topic and noise probability distributions is an important step for identifying topics in short, noisy posts (Churchill & Singh, 2021).…”
Section: Constructing Variables From Big Datamentioning
confidence: 99%
“…Determining the topic(s) in an article or post is central to understanding how the dynamics of conversation about different topics are changing through time. While different approaches have been proposed for extracting topics 7. from the text (see surveys for more detail (Qiang et al, 2020;Churchill & Singh, in press), new methods that adequately handle the noise of social media text streams are still in their infancy (Churchill & Singh, 2020Dieng et al, 2019;Wang et al, 2018, Qiang et al, 2017. Different strategies for mitigating noise have been proposed, including (1) modelling documents as a graph or network and removing noisy parts of the network before detecting topics and (2) adjusting generative probabilistic models to account for short, noisy text.…”
Section: Cons Truc Ting Variab Le S From B Ig Datamentioning
confidence: 99%
See 1 more Smart Citation
“…This contrasts to our approach of using an unsupervised algorithm to generate comparisons between multiple topic models, although we do impose a binary classification of topics related to semantic vs logistical content. Churchill & Singh [31] consider topic models from the perspective of topic-noise, using pre-trained model in an ensemble with LDA to generate more diverse and coherent topics. This approach also uses the semantic information from word emebeddings but is specifically tuned to use topicnoise as a discriminator that can be used either during or after topic generation.…”
Section: Related Workmentioning
confidence: 99%