Exploiting User and Venue Characteristics for Fine-Grained Tweet Geolocation

Chong, Wen-Haw; Lim, Ee-Peng

doi:10.1145/3156667

Cited by 22 publications

(20 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Similar to the idea of local words, they prefer geospecific n-grams, i.e., those whose tweets are mostly located in a small eclipse on the map. Alternatively, Chong and Lim [77] apply a learning to rank method which encodes tweet content by a smoothed probability estimation that a word occurs at a venue. In their following work [78], word importance for different locations is distinguished.…”

Section: Word-or Location-centric Methodsmentioning

confidence: 99%

“…In experiments, they find that such a multi-indicator approach is more robust than single-indicator approaches, which is error-prone due to ambiguity. Chong and Lim [77] provide another angle to utilize the context information and observe that both venues' active time and users' visiting place histories could help on tweet location prediction. They investigate venues' active time and estimate the probability that a location is popular given a time by a smoothed kernel density estimation method.…”

Section: Inference Based On Tweet Contextsmentioning

confidence: 99%

See 1 more Smart Citation

A Survey of Location Prediction on Twitter

Zheng

Han

Sun

2018

IEEE Trans. Knowl. Data Eng.

190

117

View full text Add to dashboard Cite

Locations, e.g., countries, states, cities, and point-of-interests, are central to news, emergency events, and people's daily lives. Automatic identification of locations associated with or mentioned in documents has been explored for decades. As one of the most popular online social network platforms, Twitter has attracted a large number of users who send millions of tweets on daily basis. Due to the world-wide coverage of its users and real-time freshness of tweets, location prediction on Twitter has gained significant attention in recent years. Research efforts are spent on dealing with new challenges and opportunities brought by the noisy, short, and context-rich nature of tweets. In this survey, we aim at offering an overall picture of location prediction on Twitter. Specifically, we concentrate on the prediction of user home locations, tweet locations, and mentioned locations. We first define the three tasks and review the evaluation metrics. By summarizing Twitter network, tweet content, and tweet context as potential inputs, we then structurally highlight how the problems depend on these inputs. Each dependency is illustrated by a comprehensive review of the corresponding strategies adopted in state-of-the-art approaches. In addition, we also briefly review two related problems, i.e., semantic location prediction and point-of-interest recommendation. Finally, we make a conclusion of the survey and list future research directions.

show abstract

Section: Word-or Location-centric Methodsmentioning

confidence: 99%

Section: Inference Based On Tweet Contextsmentioning

confidence: 99%

A Survey of Location Prediction on Twitter

Zheng

Han

Sun

2018

IEEE Trans. Knowl. Data Eng.

190

117

View full text Add to dashboard Cite

show abstract

“…Considering that the trackers can be devices that operate, specifically, in such a context, their sensors data can be integrated to those related to a group of entities in order to create functionalities aimed to specific groups of users. This is an approach that leads towards two interesting advantages: it is able to uncover implicit characteristics of the involved entities by following non canonical criteria [17,60]; each group of entities can be anonymously characterized on the basis of the sensors data of the entities that belong to it.…”

Section: Introductionmentioning

confidence: 99%

Internet of Entities (IoE): A Blockchain-based Distributed Paradigm for Data Exchange between Wireless-based Devices

Saia

Carta

Recupero

et al. 2019

Proceedings of the 8th International Conference on Sensor Networks

View full text Add to dashboard Cite

The exponential growth of wireless-based solutions, such as those related to the mobile smart devices (e.g., smart-phones and tablets) and Internet of Things (IoT) devices, has lead to countless advantages in every area of our society. Such a scenario has transformed the world a few decades back, dominated by latency, into a new world based on an efficient real-time interaction paradigm. Recently, cryptocurrency have contributed to this technological revolution, the fulcrum of which are a decentralization model and a certification function offered by the socalled blockchain infrastructure, which make it possible to certify the financial transactions, anonymously. However, it should be observed how this challenging scenario has generated new security problems directly related to the involved new technologies (e.g., e-commerce frauds, mobile bot-net attacks, blockchain DoS attacks, cryptocurrency scams, etc.). In this context, we can acknowledge that the scientific community efforts are usually oriented toward specific solutions, instead to exploit all the available technologies, synergistically, in order to define more efficient security paradigms. This paper aims to indicate a possible approach able to improve the security of people and things by introducing a novel blockchain-based distributed paradigm to security defined Internet of Entities (IoE). It represents an effective mechanism for the localization of people and things, which exploits both the huge number of existing wireless-based devices and the blockchain-based distributed ledger technology, overcoming the limits of traditional localization approaches, but without jeopardizing the user privacy. Its operation is based on two core elements with interchangeable roles, entities and trackers, which can be very common elements such as smart-phones, tablets, and IoT devices, and its implementation requires minimal efforts thanks to the existing infrastructures and devices. The possibility of including further information to those of localization, such as those generated by device sensors, gives rise to a novel and widely exploitable data environment, whose applications can be extended to contexts different from that of the localization of people and things, e.g., eHealth, Smart Cities, and so on.

show abstract

“…First, most of the previous studies on learning spatiotemporal embeddings neglect Non-GeoTagged Social Media (NGTSM) records, which is a large proportion of records compared to the GTSM records in social media 2 https://mktgathpu.wordpress.com/the-social-media-past-present-future 3 https://www.internetlivestats.com/twitter-statistics/ platforms. For instance, less than 5% of postings in Twitter are geotagged [10,11]. This percentage is expected to have a downward trend over the next few years as users become increasingly concerned about their privacy, which forces social media platforms to tighten their privacy agreements 4 .…”

Section: Introductionmentioning

confidence: 99%

“…Second, social media records generally come with the user indices, by which different users can be uniquely understood in an anonymous manner (i.e., a user index is a numerical identity given to each user as depicted in Table I and it does not reveal the actual identity of the user). Studies [10,12] report that there are spatially motivated user behaviors (e.g., spatially close users produce similar textual contents and users tend to visit venues that are near to each other), which are useful to understand the dynamics of spatiotemporal units. However, such user behaviors have not been exploited to learn representations for the spatiotemporal units.…”

Section: Introductionmentioning

confidence: 99%

USTAR: Online Multimodal Embedding for Modeling User-Guided Spatiotemporal Activity

Silva

Karunasekera

Leckie

et al. 2019

2019 IEEE International Conference on Big Data (Big Data)

View full text Add to dashboard Cite

Building spatiotemporal activity models for people's activities in urban spaces is important for understanding the everincreasing complexity of urban dynamics. With the emergence of Geo-Tagged Social Media (GTSM) records, previous studies demonstrate the potential of GTSM records for spatiotemporal activity modeling. State-of-the-art methods for this task embed different modalities (location, time, and text) of GTSM records into a single embedding space. However, they ignore Non-GeoTagged Social Media (NGTSM) records, which generally account for the majority of posts (e.g., more than 95% in Twitter), and could represent a great source of information to alleviate the sparsity of GTSM records. Furthermore, in the current spatiotemporal embedding techniques, less focus has been given to the users, who exhibit spatially motivated behaviors. To bridge this research gap, this work proposes USTAR, a novel online learning method for User-guided SpatioTemporal Activity Representation, which (1) embeds locations, time, and text along with users into the same embedding space to capture their correlations; (2) uses a novel collaborative filtering approach based on two different empirically studied user behaviors to incorporate both NGTSM and GTSM records in learning; and (3) introduces a novel sampling technique to learn spatiotemporal representations in an online fashion to accommodate recent information into the embedding space, while avoiding overfitting to recent records and frequently appearing units in social media streams. Our results show that USTAR substantially improves the state-of-the-art for region retrieval and keyword retrieval and its potential to be applied to other downstream applications such as local event detection.

show abstract

Exploiting User and Venue Characteristics for Fine-Grained Tweet Geolocation

Cited by 22 publications

References 34 publications

A Survey of Location Prediction on Twitter

A Survey of Location Prediction on Twitter

Internet of Entities (IoE): A Blockchain-based Distributed Paradigm for Data Exchange between Wireless-based Devices

USTAR: Online Multimodal Embedding for Modeling User-Guided Spatiotemporal Activity

Contact Info

Product

Resources

About