2018
DOI: 10.21105/joss.00774
|View full text |Cite
|
Sign up to set email alerts
|

quanteda: An R package for the quantitative analysis of textual data

Abstract: quanteda is an R package providing a comprehensive workflow and toolkit for natural language processing tasks such as corpus management, tokenization, analysis, and visualization. It has extensive functions for applying dictionary analysis, exploring texts using keywords-in-context, computing document and feature similarities, and discovering multi-word expressions through collocation scoring. Based entirely on sparse operations, it provides highly efficient methods for compiling document-feature matrices and … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
624
0
28

Year Published

2018
2018
2024
2024

Publication Types

Select...
10

Relationship

0
10

Authors

Journals

citations
Cited by 933 publications
(653 citation statements)
references
References 11 publications
1
624
0
28
Order By: Relevance
“…Shock includes words and phrases associated with the crisis itself (financial crisis, crisis lessons, sovereign debt, and subprime). Core includes those that capture the key elements of the original monetary policy paradigm 4 We use software by Benoit et al (2018) to remove stop words, numbers, and punctions; and to remove inflections from words in order to reduce them to their roots. We use software by Roberts et al (2018) to estimate a Correlated Topic Model (Blei and Lafferty 2007).…”
Section: Methodsmentioning
confidence: 99%
“…Shock includes words and phrases associated with the crisis itself (financial crisis, crisis lessons, sovereign debt, and subprime). Core includes those that capture the key elements of the original monetary policy paradigm 4 We use software by Benoit et al (2018) to remove stop words, numbers, and punctions; and to remove inflections from words in order to reduce them to their roots. We use software by Roberts et al (2018) to estimate a Correlated Topic Model (Blei and Lafferty 2007).…”
Section: Methodsmentioning
confidence: 99%
“…The full list of removed tokens/features is available upon request. All preprocessing steps along with subsequent analyses were performed in R/R Studio with the quanteda package (Benoit et al, 2018).…”
Section: Appendix A: Methods Used For Data Collection and Analysismentioning
confidence: 99%
“… Note : Sources: General textual network analysis tools: http://www.eladsegev.com/tools, Count Words: http://www.countwordsfree.com, DMI: https://wiki.digitalmethods.net/Dmi/ToolDatabase, Mozdeh: http://mozdeh.wlv.ac.uk, (Thelwall, ) TAGS: https://tags.hawksey.info, Visone: https://visone.info, Gephi: https://gephi.org, NodeXL: https://nodexl.com, quanteda: https://quanteda.io/reference/textplot_network.html, (Benoit et al, ) textnets: https://github.com/cbail/textnets, Text‐Network Analysis: https://github.com/michal‐pikusa/text‐network‐analysis …”
Section: Guidelines For Conducting Textual Network Analysismentioning
confidence: 99%