2018
DOI: 10.1145/3156667
|View full text |Cite
|
Sign up to set email alerts
|

Exploiting User and Venue Characteristics for Fine-Grained Tweet Geolocation

Abstract: Which venue is a tweet posted from? We call this a fine-grained geolocation problem. Given an observed tweet, the task is to infer its discrete posting venue, e.g., a specific restaurant. This recovers the venue context and differs from prior work, which geolocats tweets to location coordinates or cities/neighborhoods. First, we conduct empirical analysis to uncover venue and user characteristics for improving geolocation. For venues, we observe spatial homophily … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
19
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 22 publications
(20 citation statements)
references
References 34 publications
0
19
0
Order By: Relevance
“…Similar to the idea of local words, they prefer geospecific n-grams, i.e., those whose tweets are mostly located in a small eclipse on the map. Alternatively, Chong and Lim [77] apply a learning to rank method which encodes tweet content by a smoothed probability estimation that a word occurs at a venue. In their following work [78], word importance for different locations is distinguished.…”
Section: Word-or Location-centric Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Similar to the idea of local words, they prefer geospecific n-grams, i.e., those whose tweets are mostly located in a small eclipse on the map. Alternatively, Chong and Lim [77] apply a learning to rank method which encodes tweet content by a smoothed probability estimation that a word occurs at a venue. In their following work [78], word importance for different locations is distinguished.…”
Section: Word-or Location-centric Methodsmentioning
confidence: 99%
“…In experiments, they find that such a multi-indicator approach is more robust than single-indicator approaches, which is error-prone due to ambiguity. Chong and Lim [77] provide another angle to utilize the context information and observe that both venues' active time and users' visiting place histories could help on tweet location prediction. They investigate venues' active time and estimate the probability that a location is popular given a time by a smoothed kernel density estimation method.…”
Section: Inference Based On Tweet Contextsmentioning
confidence: 99%
“…Considering that the trackers can be devices that operate, specifically, in such a context, their sensors data can be integrated to those related to a group of entities in order to create functionalities aimed to specific groups of users. This is an approach that leads towards two interesting advantages: it is able to uncover implicit characteristics of the involved entities by following non canonical criteria [17,60]; each group of entities can be anonymously characterized on the basis of the sensors data of the entities that belong to it.…”
Section: Introductionmentioning
confidence: 99%
“…First, most of the previous studies on learning spatiotemporal embeddings neglect Non-GeoTagged Social Media (NGTSM) records, which is a large proportion of records compared to the GTSM records in social media 2 https://mktgathpu.wordpress.com/the-social-media-past-present-future 3 https://www.internetlivestats.com/twitter-statistics/ platforms. For instance, less than 5% of postings in Twitter are geotagged [10,11]. This percentage is expected to have a downward trend over the next few years as users become increasingly concerned about their privacy, which forces social media platforms to tighten their privacy agreements 4 .…”
Section: Introductionmentioning
confidence: 99%
“…Second, social media records generally come with the user indices, by which different users can be uniquely understood in an anonymous manner (i.e., a user index is a numerical identity given to each user as depicted in Table I and it does not reveal the actual identity of the user). Studies [10,12] report that there are spatially motivated user behaviors (e.g., spatially close users produce similar textual contents and users tend to visit venues that are near to each other), which are useful to understand the dynamics of spatiotemporal units. However, such user behaviors have not been exploited to learn representations for the spatiotemporal units.…”
Section: Introductionmentioning
confidence: 99%