Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval 2019
DOI: 10.1145/3331184.3331401
|View full text |Cite
|
Sign up to set email alerts
|

Solr Integration in the Anserini Information Retrieval Toolkit

Abstract: Anserini is an open-source information retrieval toolkit built around Lucene to facilitate replicable research. In this demonstration, we examine different architectures for Solr integration in order to address two current limitations of the system: the lack of an interactive search interface and support for distributed retrieval. Two architectures are explored: In the first approach, Anserini is used as a frontend to index directly into a running Solr instance. In the second approach, Lucene indexes built dir… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2019
2019
2019
2019

Publication Types

Select...
3

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 8 publications
(5 reference statements)
0
2
0
Order By: Relevance
“…Third, the integration of search capabilities with a document store allows dstlr to focus analyses on subsets of documents, as demonstrated in Clancy et al (2019b). For convenience, our open-source search toolkit Anserini (Yang et al, 2018) provides a number of connectors for ingesting document collections into Solr (Clancy et al, 2019a), under different index architectures. The execution layer, which relies on Apache Spark, coordinates the two major phases of knowledge graph construction: extraction and enrichment.…”
Section: System Overviewmentioning
confidence: 99%
“…Third, the integration of search capabilities with a document store allows dstlr to focus analyses on subsets of documents, as demonstrated in Clancy et al (2019b). For convenience, our open-source search toolkit Anserini (Yang et al, 2018) provides a number of connectors for ingesting document collections into Solr (Clancy et al, 2019a), under different index architectures. The execution layer, which relies on Apache Spark, coordinates the two major phases of knowledge graph construction: extraction and enrichment.…”
Section: System Overviewmentioning
confidence: 99%
“…Third, the integration of search capabilities with a document store allows dstlr to focus analyses on subsets of documents, as demonstrated in Clancy et al (2019b). For convenience, our open-source search toolkit Anserini provides a number of connectors for ingesting document collections into Solr (Clancy et al, 2019a), under different index architectures. The execution layer, which relies on Apache Spark, coordinates the two major phases of knowledge graph construction: extraction and enrichment.…”
Section: System Overviewmentioning
confidence: 99%