2007
DOI: 10.1007/978-3-540-74999-8_45
|View full text |Cite
|
Sign up to set email alerts
|

N-Gram vs. Keyword-Based Passage Retrieval for Question Answering

Abstract: Abstract. In this paper we describe the participation of the Universidad Politécnica of Valencia to the 2006 edition, which was focused on the comparison between a Passage Retrieval engine (JIRS) specifically aimed to the Question Answering task and a standard, general use search engine such as Lucene. JIRS is based on n-grams, Lucene on keywords. We participated in three monolingual tasks: Spanish, Italian and French. The obtained results show that JIRS is able to return high quality passages, especially in S… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2009
2009
2012
2012

Publication Types

Select...
5

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 6 publications
(4 reference statements)
0
3
0
Order By: Relevance
“…This PR system uses a weighting scheme based on n-grams density. It was proved in [1] that this approach is more effective in the PR and QA tasks than other commonly used IR systems based on keywords and the well-known TF.IDF weighting scheme. So, JIRS works under the premise that, in a sufficiently large document collection, question n-grams should appear near the answer at least once.…”
Section: The Jirs Passage Retrieval Systemmentioning
confidence: 99%
“…This PR system uses a weighting scheme based on n-grams density. It was proved in [1] that this approach is more effective in the PR and QA tasks than other commonly used IR systems based on keywords and the well-known TF.IDF weighting scheme. So, JIRS works under the premise that, in a sufficiently large document collection, question n-grams should appear near the answer at least once.…”
Section: The Jirs Passage Retrieval Systemmentioning
confidence: 99%
“…Therefore, our system cannot solve anaphoras. We refer the reader to the description in [2] for a detailed description of the base system.…”
Section: Wordnet-based Index Expansionmentioning
confidence: 99%
“…Our system is constituted by a modified version of the QUASAR system described in [2]. For this task the search engine (JIRS) has been replaced by Lucene 2 , which can work with multiple indices.…”
Section: Introductionmentioning
confidence: 99%