Representing the Twittersphere: Archiving a representative sample of Twitter data under resource constraints

Hino, Airo; Fahey, Robert A.

doi:10.1016/j.ijinfomgt.2019.01.019

Cited by 39 publications

(28 citation statements)

References 18 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Twitter also only provides access to approximately 1% of all the data sent on Twitter at that moment. Moreover, tweets older than 7 days cannot be retrieved by Search API (Hino and Fahey, 2019). To address such limitations as much as possible, we tried to collect live data in a longer time period (25 days) with more hashtags and keywords (n ¼ 94) using REST API.…”

Section: Data Collection and Preparationmentioning

confidence: 99%

Mapping Persian Twitter: Networks and mechanism of political communication in Iranian 2017 presidential election

Kermani

Adham

2021

Big Data & Society

View full text Add to dashboard Cite

This paper investigates the structure of networked publics and their sharing practices in Persian Twitter during a period surrounding Iran’s 2017 presidential election. Building on networked gatekeeping and framing theories, we used a mixed methodological approach to analyze a dataset of 2,596,284 Persian tweets. Results revealed that Twitter provided a space for Iranians to discuss public topics. However, this space is not necessarily used by voiceless and marginalized groups; and the uses are not limited to discussing controversial issues. The growing body of conservative crowdsourced elites emerged to defend the regime’s ideology. Moreover, the dominant networked frames were shaped around normal and routine subjects in an election time. Thus, Twitter was not a platform for only seeking liberal demands. It was to some extent used to serve the regime’s political interests. Furthermore, while many ordinary users rose to prominence, mainstream media continued to act as powerful players. This study contributes to the existing literature into networked practices, digital democracy, and citizen journalism; particularly in restrictive contexts.

show abstract

Section: Data Collection and Preparationmentioning

confidence: 99%

Mapping Persian Twitter: Networks and mechanism of political communication in Iranian 2017 presidential election

Kermani

Adham

2021

Big Data & Society

View full text Add to dashboard Cite

show abstract

“…Twitter's own API is the most potent available tool for collecting data generated through the interaction of Twitter users. Representing different demographic categories, Twitter data is a diverse and salient data source for researchers [21], [22] and policymakers [23].…”

Section: Introductionmentioning

confidence: 99%

Twitter and Research: A Systematic Literature Review Through Text Mining

et al. 2020

View full text Add to dashboard Cite

Researchers have collected Twitter data to study a wide range of topics. This growing body of literature, however, has not yet been reviewed systematically to synthesize Twitter-related papers. The existing literature review papers have been limited by constraints of traditional methods to manually select and analyze samples of topically related papers. The goals of this retrospective study are to identify dominant topics of Twitter-based research, summarize the temporal trend of topics, and interpret the evolution of topics withing the last ten years. This study systematically mines a large number of Twitter-based studies to characterize the relevant literature by an efficient and effective approach. This study collected relevant papers from three databases and applied text mining and trend analysis to detect semantic patterns and explore the yearly development of research themes across a decade. We found 38 topics in more than 18,000 manuscripts published between 2006 and 2019. By quantifying temporal trends, this study found that while 23.7% of topics did not show a significant trend (P => 0.05), 21% of topics had increasing trends and 55.3% of topics had decreasing trends that these hot and cold topics represent three categories: application, methodology, and technology. The contributions of this paper can be utilized in the growing field of Twitter-based research and are beneficial to researchers, educators, and publishers. INDEX TERMS Literature review, social media, survey, text mining, topic modeling, Twitter.

show abstract

“…Kinder-Kurlanda et al., 2017). Publicly posted tweets are collected in a continuous loop using the Twitter ‘statuses/user_timeline’ API to retrieve recent tweets for each account, thereby avoiding the sampling issues of other endpoints (Hino and Fahey, 2019). 2 Because of the large number of accounts under consideration, this approach leads to tweet volumes of 600,000–800,000 tweets per day.…”

Section: Methodsmentioning

confidence: 99%

Analysing discourse around COVID-19 in the Australian Twittersphere: A real-time corpus-based analysis

2021

View full text Add to dashboard Cite

Public discourse about the COVID-19 that appears on Twitter and other social media platforms provides useful insights into public concerns and responses to the pandemic. However, acknowledging that public discourse around COVID-19 is multi-faceted and evolves over time poses both analytical and ontological challenges. Studies that use text-mining approaches to analyse responses to major events commonly treat public discourse on social media as an undifferentiated whole, without systematically examining the extent to which that discourse consists of distinct sub-discourses or which phases characterize its development. They also confound structured behavioural data (i.e., tagging) with unstructured user-generated data (i.e., content of tweets) in their sampling methods. The present study aims to demonstrate how one might go about addressing both of these sets of challenges by combining corpus linguistic methods with a data-driven text-mining approach to gain a better understanding of how the public discourse around COVID-19 developed over time and what topics combine to form this discourse in the Australian Twittersphere over a period of nearly four months. By combining text mining and corpus linguistics, this study exemplifies how both approaches can complement each other productively.

show abstract

Representing the Twittersphere: Archiving a representative sample of Twitter data under resource constraints

Cited by 39 publications

References 18 publications

Mapping Persian Twitter: Networks and mechanism of political communication in Iranian 2017 presidential election

Mapping Persian Twitter: Networks and mechanism of political communication in Iranian 2017 presidential election

Twitter and Research: A Systematic Literature Review Through Text Mining

Analysing discourse around COVID-19 in the Australian Twittersphere: A real-time corpus-based analysis

Contact Info

Product

Resources

About