2021
DOI: 10.48550/arxiv.2107.01198
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

DRIFT: A Toolkit for Diachronic Analysis of Scientific Literature

Abstract: In this work, we present to the NLP community, and to the wider research community as a whole, an application for the diachronic analysis of research corpora. We open source an easy-to-use tool coined: DRIFT, which allows researchers to track research trends and development over the years. The analysis methods are collated from well-cited research works, with a few of our own methods added for good measure. Succinctly put, some of the analysis methods are: keyword extraction, word clouds, predicting declining/… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 8 publications
0
3
0
Order By: Relevance
“…Mohammad (2020c; 2020b) used the NLP Scholar dataset to examine citation patterns, the gender gap between female and male first-time authors, and n−gram distributions through interactive visualization. DRIFT (Sharma et al, 2021) tracks the changes in cs.CL, the computer science computational linguistics tag from arXiv, focusing on single-word terms and their word embeddings over time. In NLPExplorer, (Parmar et al, 2020) provide an exploratory tool for NLP publications based on ACL Anthology, including information on most-cited authors, areas, and venues; similar to NLP Scholar.…”
Section: Existing Resourcesmentioning
confidence: 99%
“…Mohammad (2020c; 2020b) used the NLP Scholar dataset to examine citation patterns, the gender gap between female and male first-time authors, and n−gram distributions through interactive visualization. DRIFT (Sharma et al, 2021) tracks the changes in cs.CL, the computer science computational linguistics tag from arXiv, focusing on single-word terms and their word embeddings over time. In NLPExplorer, (Parmar et al, 2020) provide an exploratory tool for NLP publications based on ACL Anthology, including information on most-cited authors, areas, and venues; similar to NLP Scholar.…”
Section: Existing Resourcesmentioning
confidence: 99%
“…For massive analysis reports on the field of NLP, Mohammad (2020) surveyed the literature with 1.1 million paper information dataset collected from Google Scholar. Additionally, Sharma et al (2021) proposed DRIFT, a data analysis tool that presents an overview of the landscape of a queried topic. They constructed their dataset from arXiv papers' abstracts.…”
Section: Related Workmentioning
confidence: 99%
“…The global aim of this series of studies was to investigate the speech and natural language processing (SNLP), research area through the related scientific publications, using a set of NLP tools, in harmony with the growing interest for scientometrics in SNLP [refer to Banchs, 2012 ; Jurafsky, 2016 ; Atanassova et al, 2019 ; Goh and Lepage, 2019 ; Mohammad, 2020a , b , c ; Wang et al, 2020 ; Sharma et al, 2021 and many more] or in various domains such as economics (Muñoz-Céspedes et al, 2021 ), finance (Daudert and Ahmadi, 2019 ), or disinformation (Monogarova et al, 2021 ).…”
Section: Introductionmentioning
confidence: 99%