2017
DOI: 10.1186/s13673-016-0084-z
|View full text |Cite
|
Sign up to set email alerts
|

Activity-based Twitter sampling for content-based and user-centric prediction models

Abstract: BackgroundTwitter's public and open nature provides great opportunities for its users to actively participate in sharing their opinions and produce high quality content that is reflective of their tendencies and preferences in their day-to-day life [1]. This vast amount of publicly available user-generated content is applied to many applications ranging from tracking human social behavior [2][3][4], detecting events of interest [5][6][7], to smart business [8] where domain knowledge is collected through social… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(4 citation statements)
references
References 28 publications
0
4
0
Order By: Relevance
“…Tweepy is a Python library for accessing the standard real-time streaming Twitter API, 2 which allows to freely retrieve tweets that match a given query. If the query is too broad that it includes over 1% of the total number of tweets posted at that time worldwide, the query's response is sampled (Aghababaei and Makrehchi, 2017;Morstatter et al, 2014). The way in which Twitter samples the data is unpublished.…”
Section: Data Collectionmentioning
confidence: 99%
“…Tweepy is a Python library for accessing the standard real-time streaming Twitter API, 2 which allows to freely retrieve tweets that match a given query. If the query is too broad that it includes over 1% of the total number of tweets posted at that time worldwide, the query's response is sampled (Aghababaei and Makrehchi, 2017;Morstatter et al, 2014). The way in which Twitter samples the data is unpublished.…”
Section: Data Collectionmentioning
confidence: 99%
“…Feature-based approaches [2,3,[11][12][13][14][15] make the connection between the prediction and various types of hand-crafted features that are extracted from the information cascade, including the structural features of the social network, content features, temporal features, and user features. To predict the popularity of news articles in Yahoo News, Arapakis et al [16] used 10 different features that they extracted from the content of the news articles as well as external sources.…”
Section: Cascade Predictionmentioning
confidence: 99%
“…Having obtained the attended whole structure embedding ṡ and temporal embedding ḣ , we can feed these two embeddings into the inter-gate mechanism to effectively combine these two factors. The proposed inter-gate mechanism can capture the different (11) Fig. 3 Architecture of the Intra-attention Mechanism w.r.t.…”
Section: Inter-gate Mechanismmentioning
confidence: 99%
“…First, collecting data on all possible Twitter accounts (or any other social media accounts) poses prohibitive storage and bandwidth costs, even absent limitations imposed by the service provider; thus, selection of accounts (including the special case in which a census of a focal account set is attempted) is inevitable. Depending on the method employed (see Aghababaei and Makrehchi 2017), and the manner in which it is implemented, all accounts of interest may not be identified at the same time. This may lead to missingness during the initial observation period.…”
Section: Introductionmentioning
confidence: 99%