2020
DOI: 10.1162/qss_a_00021
|View full text |Cite
|
Sign up to set email alerts
|

Microsoft Academic Graph: When experts are not enough

Abstract: An ongoing project explores the extent to which artificial intelligence (AI), specifically in the areas of natural language processing and semantic reasoning, can be exploited to facilitate the studies of science by deploying software agents equipped with natural language understanding capabilities to read scholarly publications on the web. The knowledge extracted by these AI agents is organized into a heterogeneous graph, called Microsoft Academic Graph (MAG), where the nodes and the edges represent the entit… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
195
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
6
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 319 publications
(211 citation statements)
references
References 41 publications
0
195
0
Order By: Relevance
“…Although Google Scholar and Microsoft Academic are the two most comprehensive bibliographic data sources analysed in this study, their search functionalities have a number of limitations, such as limited support of Boolean and other types of search operators, limited filtering capabilities (Google Scholar), and non-transparent algorithms to process queries and rank the documents in the results page (Microsoft Academic uses artificial intelligence, and Google Scholar uses publicly unknown heuristics to rank documents by relevance) (Beel and Gipp 2009a , b , c ; Martin-Martin et al 2017 ; Orduña-Malea et al 2016 ; Rovira et al 2019 ; Wang et al 2020 ). These characteristics, which prevent users from being able to generate complex search equations that are guaranteed to stay reproducible over time, have led some authors to consider Google Scholar and Microsoft Academic inadequate for query-based search (Gusenbauer and Haddaway 2020 ).…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…Although Google Scholar and Microsoft Academic are the two most comprehensive bibliographic data sources analysed in this study, their search functionalities have a number of limitations, such as limited support of Boolean and other types of search operators, limited filtering capabilities (Google Scholar), and non-transparent algorithms to process queries and rank the documents in the results page (Microsoft Academic uses artificial intelligence, and Google Scholar uses publicly unknown heuristics to rank documents by relevance) (Beel and Gipp 2009a , b , c ; Martin-Martin et al 2017 ; Orduña-Malea et al 2016 ; Rovira et al 2019 ; Wang et al 2020 ). These characteristics, which prevent users from being able to generate complex search equations that are guaranteed to stay reproducible over time, have led some authors to consider Google Scholar and Microsoft Academic inadequate for query-based search (Gusenbauer and Haddaway 2020 ).…”
Section: Resultsmentioning
confidence: 99%
“…In 2016, Microsoft launched a new platform called Microsoft Academic, based on Bing’s web crawling infrastructure. Like Google Scholar, Microsoft Academic is a free academic search engine, but unlike Google Scholar, Microsoft Academic facilitates bulk access to its data via an Applications Programming Interface (API) (Wang et al 2020 ).…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Open Academic Graph (OAG) [34,38,44] contains more than 178 million nodes and 2.236 billion edges. It is the largest publicly available heterogeneous academic dataset to date.…”
Section: Methodsmentioning
confidence: 99%
“…Typically, the two sectors are either analysed separately [15,[17][18][19][20] or together on a small scale [10,11], using a limited sample of papers and patents. Most of these analyses rely on knowledge graphs describing research publications, such as Microsoft Academic Graph [21], Scopus 2 , Semantic Scholar 3 , Aminer [22], Core [23], OpenCitations [24], and others. Other resources, such as Dimensions 4 , the United States Patent and Trademark Office corpus 5 , the PatentScope corpus 6 and the European Patent Office dataset 7 , offer a similar description of patents.…”
Section: Literature Reviewmentioning
confidence: 99%