Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Student Research W 2021
DOI: 10.18653/v1/2021.eacl-srw.7
|View full text |Cite
|
Sign up to set email alerts
|

Automatically Cataloging Scholarly Articles using Library of Congress Subject Headings

Abstract: Institutes are required to catalog their articles with proper subject headings so that the users can easily retrieve relevant articles from the institutional repositories. However, due to the rate of proliferation of the number of articles in these repositories, it is becoming a challenge to manually catalog the newly added articles at the same pace. To address this challenge, we explore the feasibility of automatically annotating articles with Library of Congress Subject Headings (LCSH). We first use web scra… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 19 publications
(16 reference statements)
0
2
0
Order By: Relevance
“…Wartena and Franke-Maier (2018) studied the possibility of assigning LCSH automatically by training classifiers for terms used frequently in a large collection of abstracts of the literature on hand and by extracting headings from those abstracts by combining both methods [17]. Kazi et al (2021) use web-scraping techniques to extract keywords from article sets in the Literature Analysis and Metrics Portal, and map these keywords to biologically relevant LCSH names to develop a gold standard dataset that demonstrates the feasibility of this approach for predicting LCSH of scholarly articles [10].…”
Section: Automatic Subject Indexingmentioning
confidence: 99%
See 1 more Smart Citation
“…Wartena and Franke-Maier (2018) studied the possibility of assigning LCSH automatically by training classifiers for terms used frequently in a large collection of abstracts of the literature on hand and by extracting headings from those abstracts by combining both methods [17]. Kazi et al (2021) use web-scraping techniques to extract keywords from article sets in the Literature Analysis and Metrics Portal, and map these keywords to biologically relevant LCSH names to develop a gold standard dataset that demonstrates the feasibility of this approach for predicting LCSH of scholarly articles [10].…”
Section: Automatic Subject Indexingmentioning
confidence: 99%
“…However, LDA tends to produce noisier and difficult-to-interpret results, and its parameter specification setting is also a significant problem. Other state-of-the-art approaches either classify papers in a top-down fashion, taking advantage of pre-existent categories from domain vocabularies, such as Medical Subject Headings (MeSH), and Library of Congress Subject Headings (LCSH) [9,10]. The advantage of this method is that it relies on a set of artificially predefined research subject vocabularies and then uses trained machine learning classifiers to transform the labels of the nearest neighbors into predicted terms.…”
Section: Introductionmentioning
confidence: 99%