2009
DOI: 10.1145/1462198.1462202
|View full text |Cite
|
Sign up to set email alerts
|

Information filtering and query indexing for an information retrieval model

Abstract: In the information filtering paradigm, clients subscribe to a server with continuous queries or profiles that express their information needs. Clients can also publish documents to servers. Whenever a document is published, the continuous queries satisfying this document are found and notifications are sent to appropriate clients. This article deals with the filtering problem that needs to be solved efficiently by each server: Given a database of continuous queries db and a document d , find all queries q ∈ db… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
40
0

Year Published

2010
2010
2018
2018

Publication Types

Select...
5
2

Relationship

2
5

Authors

Journals

citations
Cited by 26 publications
(40 citation statements)
references
References 81 publications
(92 reference statements)
0
40
0
Order By: Relevance
“…The robustness of the proposed methodology is highlighted not only by the publications in top-class venues that utilize it (e.g., [11,12,16,18,22]), but also by the different document corpora it was applied on (TREC .gov, TREC ClueWeb09, OHSUMED, NN, and others). Finally, interesting directions for future work include the design and implementation of modules for creating realistic vector space and semi-structured continuous queries.…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations
“…The robustness of the proposed methodology is highlighted not only by the publications in top-class venues that utilize it (e.g., [11,12,16,18,22]), but also by the different document corpora it was applied on (TREC .gov, TREC ClueWeb09, OHSUMED, NN, and others). Finally, interesting directions for future work include the design and implementation of modules for creating realistic vector space and semi-structured continuous queries.…”
Section: Discussionmentioning
confidence: 99%
“…Some of these approaches include the systems XFilter [1], YFilter [6], DFA [10], the Boolean version of SIFT [17], and the agent-based DIAS [11]. Other approaches focus more on the algorithmic aspect by providing efficient treebased data structures such as [12,16,18,19,20] for dealing with documents that are free text and profiles that are conjunctions of keywords. To the best of our knowledge the only work that is somewhat relevant to ours is [15], where a corpus of documents (but no continuous queries) is built for adaptive filtering tasks.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…We opt to use the data traces of traditional web search services, which are truly representative of the end user behavior to use keywords in the real world. Similar approaches are used by many previous works [23] We use an input query history file (containing 4,000,000 pre-processed queries) collected from the Microsoft MSN search engine (MSN in short). On average, the number of terms per query is 2.843 in MSN.…”
Section: A Experimental Settingsmentioning
confidence: 99%