Abstract:Every person who sends email, text messages, tweets, or simply surfs the Web leaves a digital trace. Researchers are just starting to comprehend the possibilities of “big data” for creating a new picture of social behavior. The potential for innovative work on social and cultural topics far outstrips current data collection and analysis techniques for a variety of reasons, including researchers' lack of access to corporate data sets, technical skills, and analytical lenses. This article draws a distinction bet… Show more
“…Another type of popular WWW imageries is based on the coverage of usually US‐based IT giants such as Facebook and Twitter, which veils the political and economic specificities of the particular online applications (see Menchen‐Trevino, ).…”
mentioning
confidence: 99%
“…This is consistent with the use of the term in the literature (e.g., DeNardis, 2012;Dodge, 2008). 2 Another type of popular WWW imageries is based on the coverage of usually US-based IT giants such as Facebook and Twitter, which veils the political and economic specificities of the particular online applications (see Menchen-Trevino, 2013). 3 Additionally, hyperlinks may contain erroneous links, irrelevant information, and inconsequential relationships (Weber & Monge, 2011).…”
“…Another type of popular WWW imageries is based on the coverage of usually US‐based IT giants such as Facebook and Twitter, which veils the political and economic specificities of the particular online applications (see Menchen‐Trevino, ).…”
mentioning
confidence: 99%
“…This is consistent with the use of the term in the literature (e.g., DeNardis, 2012;Dodge, 2008). 2 Another type of popular WWW imageries is based on the coverage of usually US-based IT giants such as Facebook and Twitter, which veils the political and economic specificities of the particular online applications (see Menchen-Trevino, 2013). 3 Additionally, hyperlinks may contain erroneous links, irrelevant information, and inconsequential relationships (Weber & Monge, 2011).…”
“…Perhaps I should have added surveys to the digital data, as some researchers have suggested (Stier, Breuer, Siegers, & Thorson, 2019). Perhaps I was too narrow in my focus, looking only at the fans or users as they participated on Reddit but missing their activity on other platforms (Menchen-Trevino, 2013). Researchers have noted how Skyrim (Bethesda Softworks, 2011) fans (Puente & Tosca, 2013) and people in general (Baym, 2007) use more than one online space for their online activity.…”
Section: Problems and Solutionsmentioning
confidence: 99%
“…The advanced questions we can ask now, building on decades of research and methods, call for advanced methods and understanding. Advanced methods require more than just large amounts of raw computing power; they call for both quantitative and qualitative methodological approaches (Menchen-Trevino, 2013), a broad understanding of theories, topical expertise, and computational skills. In short, CSS done well requires teams (Lazer et al, 2009).…”
The questions we can ask currently, building on decades of research, call for advanced methods and understanding. We now have large, complex data sets that require more than complex statistical analysis to yield human answers. Yet as some researchers have pointed out, we also have challenges, especially in computational social science. In a recent project I faced several such challenges and eventually realized that the relevant issues were familiar to users of free and open-source software. I needed a team with diverse skills and knowledge to tackle methods, theories, and topics. We needed to iterate over the entire project: from the initial theories to the data to the methods to the results. We had to understand how to work when some data was freely available but other data that might benefit the research was not. More broadly, computational social scientists may need creative solutions to slippery problems, such as restrictions imposed by terms of service for sites from which we wish to gather data. Are these terms legal, are they enforced, or do our institutional review boards care? Lastly—perhaps most importantly and dauntingly—we may need to challenge laws relating to digital data and access, although so far this conflict has been rare. Can we succeed as open-source advocates have?
“…Among the prom is ing new sources are dig i tal trace data (Deville et al 2014;FriasMartinez et al 2012;Hawelka et al 2014). Generated as a byproduct of ev ery day in for ma tion tech nol ogy use, dig i tal trace data con sist of in di vid u allevel re cords of dig i tal be hav ior, which may in clude in for ma tion on a per son's phys i cal lo ca tion (MenchenTrevino 2013). With the global pro lif er a tion of dig i tal tech nol o gy, dig i tal trace data are in creas ingly com mon and are avail able in a wide range of forms that are po ten tially use ful to mi gra tion schol ars, such as metadata as so ci ated with cel lu lar calls and texts, GPS in for ma tion cap tured pas sively by smartphone ap pli ca tions, and geotags posted to so cial me dia or other lo ca tionbased so cial net works (LBSNs) (Girardin et al 2008).…”
Georeferenced digital trace data offer unprecedented flexibility in migration estimation. Because of their high temporal granularity, many migration estimates can be generated from the same data set by changing the definition parameters. Yet despite the growing application of digital trace data to migration research, strategies for taking advantage of their temporal granularity remain largely underdeveloped. In this paper, we provide a general framework for converting digital trace data into estimates of migration transitions and for systematically analyzing their variation along a quasi-continuous time scale, analogous to a survival function. From migration theory, we develop two simple hypotheses regarding how we expect our estimated migration transition functions to behave. We then test our hypotheses on simulated data and empirical data from three platforms in two internal migration contexts: geotagged Tweets and Gowalla check-ins in the United States, and cell-phone call detail records in Senegal. Our results demonstrate the need for evaluating the internal consistency of migration estimates derived from digital trace data before using them in substantive research. At the same time, however, common patterns across our three empirical data sets point to an emergent research agenda using digital trace data to study the specific functional relationship between estimates of migration and time and how this relationship varies by geography and population characteristics.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.