Proceedings of the 2015 International Conference on the Theory of Information Retrieval 2015
DOI: 10.1145/2808194.2809489
|View full text |Cite
|
Sign up to set email alerts
|

The Feasibility of Brute Force Scans for Real-Time Tweet Search

Abstract: The real-time search problem requires making ingested documents immediately searchable, which presents architectural challenges for systems built around inverted indexing. In this paper, we explore a radical proposition: What if we abandon document inversion and instead adopt an architecture based on brute force scans of document representations? In such a design, "indexing" simply involves appending the parsed representation of an ingested document to an existing buffer, which is simple and fast. Quite surpri… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
2
1
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 6 publications
0
1
0
Order By: Relevance
“…Obviously, a brute-force scan of sizeable collections is impractical for low-latency querying, with the exception of a few specialized cases [Lempel et al, 2007, Wang andLin, 2015]. For dense vector representations, the top-k retrieval problem is often called nearest neighbor (NN) search, and for a small set of φ comparison functions (inner products, L1 distance, and a few others), there exist efficient, scalable solutions.…”
Section: Logical/physical Separationmentioning
confidence: 99%
“…Obviously, a brute-force scan of sizeable collections is impractical for low-latency querying, with the exception of a few specialized cases [Lempel et al, 2007, Wang andLin, 2015]. For dense vector representations, the top-k retrieval problem is often called nearest neighbor (NN) search, and for a small set of φ comparison functions (inner products, L1 distance, and a few others), there exist efficient, scalable solutions.…”
Section: Logical/physical Separationmentioning
confidence: 99%