2016
DOI: 10.1007/s11192-016-1892-7
|View full text |Cite
|
Sign up to set email alerts
|

Using Monte Carlo simulations to assess the impact of author name disambiguation quality on different bibliometric analyses

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
25
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 29 publications
(25 citation statements)
references
References 52 publications
0
25
0
Order By: Relevance
“…Recently, such distortive effects of ambiguous bibliographic data have been discussed for bibliometrics in general as well as network measures (e.g., Schulz, 2016;Strotmann & Zhao, 2012;van den Besselaar & Sandström, 2016). First, scholars should be warned that author name ambiguity can be detrimental to the study of collaboration networks by generating merged and/or split nodal entities.…”
Section: Conclusion and Discussionmentioning
confidence: 99%
“…Recently, such distortive effects of ambiguous bibliographic data have been discussed for bibliometrics in general as well as network measures (e.g., Schulz, 2016;Strotmann & Zhao, 2012;van den Besselaar & Sandström, 2016). First, scholars should be warned that author name ambiguity can be detrimental to the study of collaboration networks by generating merged and/or split nodal entities.…”
Section: Conclusion and Discussionmentioning
confidence: 99%
“…A challenge is that if we use many features, we cannot distinguish the impact of different positive-negative training data ratios from the impact of feature effectiveness. So, we tried to select a minimum set of featurescoauthor names and title wordswhich are commonly used in most disambiguation studies and have been found to be effective in disambiguating names (Ferreira et al, 2012;Schulz, 2016;Wang et al, 2012). Another reason is that these two features are available across all labeled datasets used in this study, while other features such as affiliation, journal names, and references are recorded in some data but not in another.…”
Section: Machine Learning Settingsmentioning
confidence: 99%
“…This challenge also holds at the disciplinary level, as investigated for Chemistry, Physics, Medicine, and Economics and Business by Harzing (2015). Even if the problem would be less pronounced at the specialty level, results may still benefit from more advanced disambiguation methods (Schulz, 2016).…”
Section: Fa-a(1)mentioning
confidence: 99%