2013
DOI: 10.1007/978-3-642-35647-6_11
|View full text |Cite
|
Sign up to set email alerts
|

Methodology for Evaluating Citation Parsing and Matching

Abstract: Bibliographic references between scholarly publications contain valuable information for researchers and developers involved with digital repositories. They are indicators of topical similarity between linked texts, impact of the referenced document, and improve navigation in user interfaces of digital libraries. Consequently, several approaches to extraction, parsing and resolving said references have been proposed to date. In this paper we develop a methodology for evaluating parsing and matching algorithms … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2013
2013
2020
2020

Publication Types

Select...
2
2
1
1

Relationship

2
4

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 11 publications
(8 reference statements)
0
4
0
Order By: Relevance
“…This is in addition to different ways to abbreviate journal names and nonstandard uses of punctuation marks. These factors made it difficult to use citation parsing software (assuming the results will not be reliable and will need further manual confirmation) and necessitated the tedious task of manual extraction of journal names (Fedoryszak et al, 2013). Finally, journal names could be identified within 33,216 citations to journal articles.…”
Section: Disambiguation Of Npl Citationsmentioning
confidence: 99%
“…This is in addition to different ways to abbreviate journal names and nonstandard uses of punctuation marks. These factors made it difficult to use citation parsing software (assuming the results will not be reliable and will need further manual confirmation) and necessitated the tedious task of manual extraction of journal names (Fedoryszak et al, 2013). Finally, journal names could be identified within 33,216 citations to journal articles.…”
Section: Disambiguation Of Npl Citationsmentioning
confidence: 99%
“…From the very beginning the main afford in CoAnSys have been put on document analysis algorithms, i.e. author name disambiguation [7,8,9], metadata extraction [10], document similarity and classification calculations [11,12], citation matching [13,14], etc. Some of algorithms can be used in Hadoop environment out-of-box, some need further amendments and some are entirely not applicable [15].…”
Section: Well-suited Algorithmsmentioning
confidence: 99%
“…indexes of ERNIE data that can facilitate user searches that return informative, ranked, and scored query results and is the basis for a process that takes advantage of indexed Web of Science publications in ERNIE. A similar approach has been described for citation matching of raw text strings [28]. In our process, a text file containing references is passed as input 120 to a Python script.…”
Section: Introductionmentioning
confidence: 99%