1993
DOI: 10.1016/0306-4573(93)90104-l
|View full text |Cite
|
Sign up to set email alerts
|

Controlled and uncontrolled subject descriptions in the CF database: A comparison of optimal cluster-based retrieval results

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

1994
1994
2012
2012

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 14 publications
(4 citation statements)
references
References 18 publications
0
4
0
Order By: Relevance
“…These were then used to produce single-term, uncontrolled representations for the collections. Following Shaw (1993) weights for the document vector collections were based on the customary inverse document frequency formula, log,(d/d,), where d is the total number of documents in the collection, and dk is the number of documents in which term k appears. Term weights were then normalized from zero to 999 so that M?k = 0 if term k is assigned to all documents and lvk = 999 if term k is assigned to one document.…”
Section: Test Collectionsmentioning
confidence: 99%
See 1 more Smart Citation
“…These were then used to produce single-term, uncontrolled representations for the collections. Following Shaw (1993) weights for the document vector collections were based on the customary inverse document frequency formula, log,(d/d,), where d is the total number of documents in the collection, and dk is the number of documents in which term k appears. Term weights were then normalized from zero to 999 so that M?k = 0 if term k is assigned to all documents and lvk = 999 if term k is assigned to one document.…”
Section: Test Collectionsmentioning
confidence: 99%
“…In a series of articles, Shaw has explored the effectiveness of cluster-based retrieval as a function of indexing exhaustivity. These investigations have included examinations of four subject representations based on MeSH subject headings and subheadings employed in the Medline database (Shaw, 1990), four composite representations that include subject and citation representations (Shaw, 1991), and both controlled and uncontrolled sub-ject representations (Shaw, 1993). The results suggest that the performance of a retrieval system based on single-link clustering varies as a function of indexing exhaustivity.…”
Section: Introductionmentioning
confidence: 99%
“…The retrieval process itself , i.e., the process of ranking and retrieving documents in response to a set of queries, may also be simulated, as in several articles by Shaw (1990aShaw ( , 1990bShaw ( , 1991Shaw ( , 1993. As with other simulation studies, these simulations of an optimal cluster-based retrieval system allow the variability of the retrieval mechanism to be controlled and thereby allow differences in other aspects of the retrieval process to be more carefully examined.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Retrieval by titles, abstracts, and subject headings, in Compendex, was investigated by Byrne (1975). Controlled (subject headings) and uncontrolled subject descriptions (word-stems from titles and abstracts) produce similar levels of performance in retrieval and are thus complementary (Shaw Jr, 1993). Jenuwine and Floyd (2004) also concluded that MeSH descriptors and text-words should be used together for maximal retrieval.…”
Section: Introductionmentioning
confidence: 99%