2007
DOI: 10.1045/july2007-hagedorn
|View full text |Cite
|
Sign up to set email alerts
|

Enhancing Search and Browse Using Automated Clustering of Subject Metadata

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
3
0

Year Published

2009
2009
2015
2015

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 8 publications
0
3
0
Order By: Relevance
“…This protocol implements a standardized metadata model for facilitating exchange between repositories. Approaches to document clustering in digital libraries have focused, among other things, on extending search queries and metadata entries of documents (Hagedorn et al , 2007; Rosenberg and Borgman, 1992). In this case, clustering is performed to detect the subject area of documents based on a predefined classification scheme, that is, a closed topic model (Newman et al , 2007).…”
Section: Introductionmentioning
confidence: 99%
“…This protocol implements a standardized metadata model for facilitating exchange between repositories. Approaches to document clustering in digital libraries have focused, among other things, on extending search queries and metadata entries of documents (Hagedorn et al , 2007; Rosenberg and Borgman, 1992). In this case, clustering is performed to detect the subject area of documents based on a predefined classification scheme, that is, a closed topic model (Newman et al , 2007).…”
Section: Introductionmentioning
confidence: 99%
“…Examples of unsupervised learning approaches include Krowne and Halbert (2005), who used a text-clustering approach to analyze the title, description and subject fields from the "americansouth.org" digital library, and Newman et al (2007) and Hagedorn et al (2007), who used a statistical topic model to enrich subject metadata in 7.5 million records in the OAIster Digital Library. Recently, Tuarob et al (2013) described a method for generating tags from a domain-specific controlled vocabulary to augment metadata for resources from four different environmental data repositories associated with the DataONE program.…”
mentioning
confidence: 99%
“…With various information retrieval (IR) and text categorization (TC, also known as automatic classification) models becoming more and more available for DLs and generating local demand for new, automated solutions [23], in this paper we test a new TC model in a real world setting for the above purpose. As TC research typically uses standard test collections of documents, we replace them by a small database, the institutional repository of the University of Strathclyde, Glasgow, called Strathprints 2 , indexed by the Library of Congress Subject Headings (LCSH).…”
Section: Introductionmentioning
confidence: 99%