2019
DOI: 10.1177/0022042619833911
|View full text |Cite
|
Sign up to set email alerts
|

Choosing Your Platform for Social Media Drug Research and Improving Your Keyword Filter List

Abstract: Social media research often has two things in common: Twitter is the platform used and a keyword filter list is used to extract only relevant Tweets. Here we propose that (a) alternative platforms be considered more often when doing social media research, and (b) regardless of platform, researchers use word embeddings as a type of synonym discovery to improve their keyword filter list, both of which lead to more relevant data. We demonstrate the benefit of these proposals by comparing how successful our synony… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 19 publications
(14 citation statements)
references
References 17 publications
0
14
0
Order By: Relevance
“…In a nonmedical cannabis-related study, word embeddings created from Twitter and Reddit data sets discovered synonyms and slang terms that could not be identified using other means. The study recommends this method of synonym discovery in advance for any data collection based on keyword filtering [ 65 ].…”
Section: Resultsmentioning
confidence: 99%
“…In a nonmedical cannabis-related study, word embeddings created from Twitter and Reddit data sets discovered synonyms and slang terms that could not be identified using other means. The study recommends this method of synonym discovery in advance for any data collection based on keyword filtering [ 65 ].…”
Section: Resultsmentioning
confidence: 99%
“…Depending on the objective of the study, this could result in "contaminated" data that did not truly reflect the sentiments of the target study population, and could consequently lead to imprecise or incorrect conclusions. 20,21,39,40 Future studies may consider adopting the methods proposed by Kim et al 41 and Adams et al 42 on how to develop and iteratively refine search keywords (eg, through word embeddings) for retrieving the content of interest from social media platforms, and how to thoroughly evaluate the relevance, and comprehensiveness (if applicable), of the information retrieved using manually annotated data. Furthermore, few studies described how they handled special types of data such as images or videos, hashtags, emojis, and hyperlinks, which are commonly used in social media discourses and can in fact convey important information about the sentiments being expressed.…”
Section: Discussionmentioning
confidence: 99%
“…However, recently there has been a move towards a more data-driven means of iteratively identifying and evaluating keywords (and their associated synonyms), with word embeddings and other empirical synonym discovery methods (e.g. [108]). This shift towards a more principled method of selecting keywords for data sampling is to be welcomed.…”
Section: Discussionmentioning
confidence: 99%