Twitter hashtags are typically used to categorize a tweet, to monitor ongoing conversations, and to facilitate accurate retrieval of posts. Hashtag hijacking occurs when a group of users starts using a trending hashtag to promote a topic that is substantially different from its recent context. Most of the prior research on hashtag hijacking has focused on manual monitoring of specific hashtags. We present a general framework based on multi-modal matrix factorization for automatically detecting hashtag hijacking from Twitter data, where the compromised hashtags and their underlying topics were unknown a priori.
CCS Concepts•Security and privacy → Social engineering attacks; Intrusion/anomaly detection and malware mitigation;
Past research has suggested an associative relationship between social media use and alcohol consumption, especially among the younger generations. The current study takes a generalizable approach to examining the prevalence of posting about alcohol on a popular social media platform, Twitter, as well as examining the predictors of a tweet’s virality. We content-analyzed more than 47.5 million tweets that were posted in March 2015 to explore the prevalence of alcohol-related references, and how alcohol-related references, tweet features (e.g., inclusion of hashtags, pictures, etc.), and user characteristics (e.g., number of followers) contribute to the tweet’s virality. Our findings showed that during March 2015, about two of every 100 tweets in the United States were alcohol-related; whereas the majority of those referenced intoxication. In addition to tweet features and user characteristics, the prevalence of alcohol-related references in a tweet positively predicted the number of likes it received, yet negatively predicted the number of retweets. Given prior evidence supporting the association between social media use and alcohol consumption, the prevalence of alcohol references in tweets and how that contributes to their virality offers insights into the widespread phenomenon of glorifying alcohol use and excessive drinking via social media, pointing to potential negative health consequences.
Patients with chronic health conditions use online health communities to seek support and information to help manage their condition. For clinically related topics, patients can benefit from getting opinions from clinical experts, and many are concerned about misinformation and biased information being spread online. However, a large volume of community posts makes it challenging for moderators and clinical experts, if there are any, to provide necessary information. Automatically identifying forum posts that need validated clinical resources can help online health communities efficiently manage content exchange. This automation can also assist patients in need of clinical expertise by getting proper help. We present our results on testing text classification models that efficiently and accurately identify community posts containing clinical topics. We annotated 1,817 posts comprised of 4,966 sentences of an existing online diabetes community. We found that our classifier performed the best (F-measure: 0.83, Precision: 0.79, Recall:0.86) when using Naïve Bayes algorithm, unigrams, bigrams, trigrams, and MetaMap Symantic Types. Training took 5 seconds. The classification process took a fraction of 1 second. We applied our classifier to another online diabetes community, and the results were: F-measure: 0.63, Precision: 0.57, Recall: 0.71. Our results show our model is feasible to scale to other forums on identifying posts containing clinical topic with common errors properly addressed.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.