2014
DOI: 10.1177/2329496514524543
|View full text |Cite
|
Sign up to set email alerts
|

From Words to Networks and Back

Abstract: Digital text has revolutionized how we consume and produce information, and also provides seemingly limitless sources of data from Twitter feeds to online historical archives. Such new data challenge traditional boundaries between quantitative and qualitative research, and exciting horizons have emerged. New analytic approaches are warranted, however, given the typically unstructured, respondent-generated format of such data. In this article, I examine how sociologists have handled text data prior to digitizat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
6
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 22 publications
(12 citation statements)
references
References 53 publications
(69 reference statements)
0
10
0
Order By: Relevance
“…• In [40] the analysed data consists of the corpus of United States' presidential Inaugural Addresses from 1789-2009. Also in this case, the dataset is a sensible subset of our corpus.…”
Section: Statistical Indicatormentioning
confidence: 99%
“…• In [40] the analysed data consists of the corpus of United States' presidential Inaugural Addresses from 1789-2009. Also in this case, the dataset is a sensible subset of our corpus.…”
Section: Statistical Indicatormentioning
confidence: 99%
“…In the context of the US Presidents' communications studies, there are not many works devoted to the analysis of such a wide Presidents' speeches corpus with a combination of text mining and network analysis approaches strictly comparable to our methodological combination. For example, in Light (2014), the author combined text mining and network analysis to analyse the Presidents' Inaugural Addresses, but he has used the Stanford POS-Tagger and a different similarity measure. Hence, we use some excellent studies as reference for validating our approach.…”
Section: Recent Contributions Likementioning
confidence: 99%
“…Inaugurals in the same cluster are also part of the same time interval, which is something we also observe as a strong temporal bias in the data. Light (2014) uses part-of-speech tagging (PoS) to develop a sophisticated cluster network analysis of the Inaugurals. He creates a "co-word network for the inaugural addresses" using the "147 most prominent (…) words".…”
Section: From Periodisation To Timelinesmentioning
confidence: 99%
“…Again, length normalisation is applied. We employ Pearson, as it has also been used in (Light 2014) to analyse the Inaugurals. For all words in two documents x and y, the Pearson correlation divides the sum of their frequencies in x and y minus their respective mean frequencies m(x) and m(y) by the square roots of the squares of (xm(x)) and (y -m(y)): .…”
Section: From Periodisation To Timelinesmentioning
confidence: 99%