As Twitter becomes a more common means for officials to communicate with their constituents, it becomes more important that we understand how officials use these communication tools. Using data from 380 members of Congress' Twitter activity during the winter of 2012, we find that officials frequently use Twitter to advertise their political positions and to provide information but rarely to request political action from their constituents or to recognize the good work of others. We highlight a number of differences in communication frequency between men and women, Senators and Representatives, Republicans and Democrats. We provide groundwork for future research examining the behavior of public officials online and testing the predictive power of officials' social media behavior.
Online abusive behavior affects millions and the NLP community has attempted to mitigate this problem by developing technologies to detect abuse. However, current methods have largely focused on a narrow definition of abuse to detriment of victims who seek both validation and solutions. In this position paper, we argue that the community needs to make three substantive changes: (1) expanding our scope of problems to tackle both more subtle and more serious forms of abuse, (2) developing proactive technologies that counter or inhibit abuse before it harms, and (3) reframing our effort within a framework of justice to promote healthy communities.
The conventional understanding of how elected officials affect the policy agenda is based on the argument that they use symbols and rhetoric to propagate a policy problem, primarily through the traditional media. The arguments presented in this article are largely consistent with this claim but account for the function of social media. More specifically, and framed by indexing theory, we argue that social media enhances opportunities for policy agenda builders in the U.S. Congress to share information with journalists. Across the key policy issues of 2013, tests for congruence between politicians' Twitter posts and New York Times articles confirm a connection, particularly for the policy issue areas of the economy, immigration, health care, and marginalized groups. Simultaneous discussion and debate between Democrats and Republicans about a particular policy issue area, however, negatively impact how the New York Times indexes a particular issue.
Understanding how Members of Congress (MCs) distribute their political attention is key to a number of areas of political science research including agenda setting, framing, and issue evolution. Tweets illuminate what lawmakers are paying attention to by aggregating information from newsletters, press releases, and floor debates to provide a birds-eye view of a lawmaker's diverse agenda. In order to leverage this data efficiently, we trained a supervised machine learning classifier to label tweets according to the Comparative Agenda Project's Policy Codebook and used the results to examine the differential attention that policy topics receive from MCs. The classifier achieved an F1 score of 0.79 and a Cohen's kappa with human labelers of 0.78, suggesting good performance. Using this classifier, we labeled 1,485,834 original MC tweets (Retweets were excluded) and conducted a multinomial logistic regression to understand what influenced the policy areas MCs Tweeted about. Our model reveals differences in political attention along party, chamber, and gender lines and their interactions. Our approach allows us to study MCs' political attention in near real-time and to uncover both intra-and inter-group differences.
Social media data (SMD) offer researchers new opportunities to leverage those data for their work in broad areas such as public opinion, digital culture, labor trends, and public health. The success of efforts to save SMD for reuse by researchers will depend on aligning data management and archiving practices with evolving norms around the capture, use, sharing, and security of datasets. This paper presents an initial foray into understanding how established practices for managing and preserving data should adapt to demands from researchers who use and reuse SMD, and from people who are subjects in SMD. We examine the data management practices of researchers who use SMD through a survey, and we analyze published articles that used data from Twitter. We discuss how researchers describe their data management practices and how these practices may differ from the management of conventional data types. We explore conceptual, technical, and ethical challenges for data archives based on the similarities and differences between SMD and other types of research data, focusing on the social sciences. Finally, we suggest areas where archives may need to revise policies, practices, and services in order to create secure, persistent, and usable collections of SMD.
There is evidence that Russia's Internet Research Agency attempted to interfere with the 2016 U.S. election by running fake accounts on Twitter-often referred to as "Russian trolls". In this work, we: 1) develop machine learning models that predict whether a Twitter account is a Russian troll within a set of 170K control accounts; and, 2) demonstrate that it is possible to use this model to find active accounts on Twitter still likely acting on behalf of the Russian state. Using both behavioral and linguistic features, we show that it is possible to distinguish between a troll and a non-troll with a precision of 78.5% and an AUC of 98.9%, under cross-validation. Applying the model to out-of-sample accounts still active today, we find that up to 2.6% of top journalists' mentions are occupied by Russian trolls. These findings imply that the Russian trolls are very likely still active today. Additional analysis shows that they are not merely software-controlled bots, and manage their online identities in various complex ways. Finally, we argue that if it is possible to discover these accounts using externally-accessible data, then the platformswith access to a variety of private internal signals-should succeed at similar or better rates.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.