2020
DOI: 10.1016/j.cities.2019.102410
|View full text |Cite
|
Sign up to set email alerts
|

Multi-class twitter data categorization and geocoding with a novel computing framework

Abstract: This study details the progress in transportation data analysis with a novel computing framework in keeping with the continuous evolution of the computing technology. The computing framework combines the Labelled Latent Dirichlet Allocation (L-LDA)-incorporated Support Vector Machine (SVM) classifier with the supporting computing strategy on publicly available Twitter data in determining transportation-related events to provide reliable information to travelers. The analytical approach includes analyzing tweet… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
3
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 13 publications
(3 citation statements)
references
References 26 publications
0
3
0
Order By: Relevance
“…The Twitter handles were removed from the collected dataset, making the data presented in this paper anonymous. Similar utilization of geotagged Twitter data has a strong precedent in emerging urban literature ( Córdoba et al, 2021 ; García-Palomares et al, 2018 ; Khan et al, 2020 ; Martín et al, 2019 ; Osorio-Arjona & García-Palomares, 2019 ). It must be noted that the study does not advocate that this sample is a homogeneous representation of all students; it might be subject to some biases but it remains within the specified criteria for the study.…”
Section: Methodsmentioning
confidence: 87%
“…The Twitter handles were removed from the collected dataset, making the data presented in this paper anonymous. Similar utilization of geotagged Twitter data has a strong precedent in emerging urban literature ( Córdoba et al, 2021 ; García-Palomares et al, 2018 ; Khan et al, 2020 ; Martín et al, 2019 ; Osorio-Arjona & García-Palomares, 2019 ). It must be noted that the study does not advocate that this sample is a homogeneous representation of all students; it might be subject to some biases but it remains within the specified criteria for the study.…”
Section: Methodsmentioning
confidence: 87%
“…Machine learning methods like support vector machines, logistic regression, and Naive Bayes were applied to study Twitter data for event detection around London [36]. A combination of Support vector machine classifier and Labeled Latent Dirichlet Allocation (L-LDA) was also used to analyze 700,010 tweets and extracted transportation-related information for New York City [37]. In [38], the authors proposed a unified statistical framework that combines the topic modeling based language model and hinge loss Markov random fields (HLMRFs) based model for the road traffic congestion monitoring using Twitter data.…”
Section: Approaches For the Selection Of Relevant Featuresmentioning
confidence: 99%
“…Based on the social media functions that previous research has described, this research will then conduct an in-depth analysis of the activities that occur on each account. Referring to what was conveyed byKhan et al (2020) regarding the opportunity for the presence of social media as an effective medium that can involve any and everyone, this study would provide practical advantages, namely going to contribute to the issue of using social media as a Covid-19 mitigation tool in terms of policy communication, and can give an overview of the extent of communication policies in handling Covid-19 have been developed in two cities, namely Samarinda City and Balikpapan City, through Twitter social media.…”
mentioning
confidence: 98%