Untitled

A general re-weighting method, called contextualization, for more efficient element ranking in XML retrieval is introduced. Reweighting is based on the idea of using the ancestors of an element as a context: if the element appears in a good contextgood interpreted as probability of relevance -its weight is increased in relevance scoring; if the element appears in a bad context, its weight is decreased. The formal presentation of contextualization is given in a general XML representation and manipulation frame, which is based on utilization of structural indices. This provides a general approach independent of weighting schemas or query languages.Contextualization is evaluated with the INEX test collection. We tested four runs: no contextualization, parent, root and tower contextualizations. The contextualization runs were significantly better than no contextualization. The root contextualization was the best among the re-weighted runs.

show abstract

Contextualization models for XML retrieval

Arvola

Kekäläinen

Junkkari

2011

Information Processing & Management

View full text Add to dashboard Cite

Expected reading effort in focused retrieval evaluation

2010

View full text Add to dashboard Cite

This study introduces a novel framework for evaluating passage and XML retrieval. The framework focuses on a user's effort to localize relevant content in a result document. Measuring the effort is based on a system guided reading order of documents. The effort is calculated as the quantity of text the user is expected to browse through. More specifically, this study seeks evaluation metrics for retrieval methods following a specific fetch and browse approach, where in the fetch phase documents are ranked in decreasing order according to their document score, like in document retrieval. In the browse phase, for each retrieved document, a set of non-overlapping passages representing the relevant text within the document is retrieved. In other words, the passages of the document are re-organized, so that the best matching passages are read first in sequential order. We introduce an application scenario motivating the framework, and propose sample metrics based on the framework. These metrics give a basis for the comparison of effectiveness between traditional document retrieval and passage/XML retrieval and illuminate the benefit of passage/XML retrieval.

show abstract

TRIX 2004 – Struggling with the Overlap

Kekäläinen

Junkkari

Arvola

et al. 2005

View full text Add to dashboard Cite

Query Evaluation with Structural Indices

Arvola

Kekäläinen

Junkkari

2006

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Marko Junkkari

Generalized contextualization method for XML information retrieval

Contextualization models for XML retrieval

Expected reading effort in focused retrieval evaluation

TRIX 2004 – Struggling with the Overlap

Query Evaluation with Structural Indices

Contact Info

Product

Resources

About