1996
DOI: 10.1002/(sici)1097-4571(199610)47:10<749::aid-asi3>3.0.co;2-2
|View full text |Cite
|
Sign up to set email alerts
|

Filtered document retrieval with frequency-sorted indexes

Abstract: Ranking techniques are effective at finding answers in document collections but can be expensive to evaluate. We propose an evaluation technique that uses early recognition of which documents are likely to be highly ranked to reduce costs; for our test data, queries are evaluated in 2% of the memory of the standard implementation without degradation in retrieval effectiveness. Cpu time and disk traffic can also be dramatically reduced by designing inverted indexes explicitly to support the technique. The princ… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
44
0
1

Year Published

1999
1999
2012
2012

Publication Types

Select...
6
4

Relationship

1
9

Authors

Journals

citations
Cited by 87 publications
(45 citation statements)
references
References 5 publications
0
44
0
1
Order By: Relevance
“…In the list above the first and second chunk contain one pointer, and the third one two. The final frequency-sorted index size is rarely larger than when generated the classic way [29].…”
Section: Special Index Construction For Ranked Queriesmentioning
confidence: 99%
“…In the list above the first and second chunk contain one pointer, and the third one two. The final frequency-sorted index size is rarely larger than when generated the classic way [29].…”
Section: Special Index Construction For Ranked Queriesmentioning
confidence: 99%
“…Top-k query processing has received much attention in a variety of settings such as similarity search on multimedia data [7,24,29,30,45,46], ranked retrieval on text and semistructured documents in digital libraries and on the Web [3,6,36,40,48,52,55], network and stream monitoring [4,14] collaborative recommendation and preference queries on ecommerce product catalogs [17,31,42,56], and ranking of SQL-style query results on structured data sources in general [1,11,18]. Among the ample work on top-k query processing, the TA family of algorithms for monotonic score aggregation [25,30,46] stands out as an extremely efficient and highly versatile method.…”
Section: Related Workmentioning
confidence: 99%
“…By modifying the output criterion, also incremental-tunable algorithms are possible. As a specific instance of this class of algorithms, the Nosferatu algorithm (Pfeifer and Pennekamp 1997) assumes that inverted list entries are sorted by decreasing indexing weights (a similar algorithm has been described in Persin et al 1996). In this case, inverted lists are processed in parallel, and entries are read in the order of decreasing RSV increments.…”
Section: Blockmentioning
confidence: 99%