2018
DOI: 10.1093/jamia/ocx093
|View full text |Cite
|
Sign up to set email alerts
|

Discovering foodborne illness in online restaurant reviews

Abstract: Our system has been instrumental in the identification of 10 outbreaks and 8523 complaints of foodborne illness associated with New York City restaurants since July 2012. Our evaluation has identified strong classifiers for both tasks, whose deployment will allow DOHMH epidemiologists to more effectively monitor Yelp for foodborne illness investigations.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

1
49
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 44 publications
(52 citation statements)
references
References 11 publications
1
49
0
Order By: Relevance
“…Popular applications of text classification in our daily life includes spam detection, opinion mining, and information filtering. In the food industry, for example, text classification has been used to identify unreported cases of foodborne illnesses from social media (Effland et al., ; Harris et al., ; Harrison et al., ; Sadilek et al., ). Text clustering: Text clustering is used for grouping objects (e.g., documents, paragraphs, sentences or terms) based on similarity between the objects (Aggarwal & Zhai, ). There are a variety of algorithms used for text clustering depending on how to calculate the similarity.…”
Section: Text Data Analysis Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Popular applications of text classification in our daily life includes spam detection, opinion mining, and information filtering. In the food industry, for example, text classification has been used to identify unreported cases of foodborne illnesses from social media (Effland et al., ; Harris et al., ; Harrison et al., ; Sadilek et al., ). Text clustering: Text clustering is used for grouping objects (e.g., documents, paragraphs, sentences or terms) based on similarity between the objects (Aggarwal & Zhai, ). There are a variety of algorithms used for text clustering depending on how to calculate the similarity.…”
Section: Text Data Analysis Methodsmentioning
confidence: 99%
“…Popular applications of text classification in our daily life includes spam detection, opinion mining, and information filtering. In the food industry, for example, text classification has been used to identify unreported cases of foodborne illnesses from social media (Effland et al, 2018;Harris et al, 2014;Harrison et al, 2014;Sadilek et al, 2016).…”
Section: Advanced Text Analysismentioning
confidence: 99%
“…This process of learning and intervening has already begun, utilizing social media. An innovative example of using big data analytics for targeted public health interventions was recently demonstrated by a team at Columbia University that developed a system that identifies foodborne illnesses in NYC restaurants by analyzing Yelp reviews . The system utilizes logistic regression trained with bias‐adjusted augmented data, which has identified 10 outbreaks and 8,253 complaints of foodborne illness associated with NYC restaurants since 2012.…”
Section: Three Major Challengesmentioning
confidence: 99%
“…An innovative example of using big data analytics for targeted public health interventions was recently demonstrated by a team at Columbia University that developed a system that identifies foodborne illnesses in NYC restaurants by analyzing Yelp reviews. 115 The system utilizes logistic regression trained with bias-adjusted augmented data, which has identified 10 outbreaks and 8,253 complaints of foodborne illness associated with NYC restaurants since 2012. Perhaps the best description of necessary pieces of a learning health care system was given by Dr. Friedman, who chairs the Department of Learning Health Sciences at the University of Michigan School of Medicine in Ann Harbor.…”
Section: Three Major Challengesmentioning
confidence: 99%
“…Public-private partnerships can encourage the use of advanced technology and have resulted in innovative approaches to capturing public health data. For example, the Centers for Disease Control and Prevention has recently partnered with Watson Health to explore ways to better manage data and respond to emergencies [23], and the NYC Health Department worked with Columbia University in using social media restaurant reviews to detect food poisoning outbreaks [24]. As such partnerships evolve, generating representative data that can inform public health agencies and policy-makers will be critical.…”
mentioning
confidence: 99%