Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Informaion Retrieval 2003
DOI: 10.1145/860435.860464
|View full text |Cite
|
Sign up to set email alerts
|

Searching XML documents via XML fragments

Abstract: Most of the work on XML query and search has stemmed from the publishing and database communities, mostly for the needs of business applications. Recently, the Information Retrieval community began investigating the XML search issue to answer information discovery needs. Following this trend, we present here an approach where information needs can be expressed in an approximate manner as pieces of XML documents or "XML fragments" of the same nature as the documents that are being searched. We present an extens… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
81
0

Year Published

2005
2005
2012
2012

Publication Types

Select...
5
3
2

Relationship

1
9

Authors

Journals

citations
Cited by 140 publications
(82 citation statements)
references
References 9 publications
0
81
0
Order By: Relevance
“…The ancestor context similarity ancSim between two nodes (n i , n j ) is based on the resemblance measure between their paths (p i , p j ). This is done by calculating three scores established in [30]. These scores are combined and weighted by the linguistic similarity between (n i , n j ) to compute the ancestor context similarity:…”
Section: Structural Similarity Computationmentioning
confidence: 99%
“…The ancestor context similarity ancSim between two nodes (n i , n j ) is based on the resemblance measure between their paths (p i , p j ). This is done by calculating three scores established in [30]. These scores are combined and weighted by the linguistic similarity between (n i , n j ) to compute the ancestor context similarity:…”
Section: Structural Similarity Computationmentioning
confidence: 99%
“…Let us advert that in VSM model, two documents are presented in a space whose dimensions correspond each to a distinct indexing unit [2]. Indexing units are words that are in their root forms.…”
Section: Text Document Similaritymentioning
confidence: 99%
“…At INEX 2002), a broad spectrum of techniques was used to exploit non-content aspects of XML documents in addressing the XML element retrieval task. For instance, the JuruXML system by Mass et al (2003) and Carmel et al (2003) extends the traditional vector space model by allowing XML collections to be searched through so-called "XML fragments" which combine content and structure features. Similarly, Gövert et al (2003) exploit content and structure features to identify relevant elements and to redistribute relevancy from elements to their enclosing elements.…”
Section: Xml Retrievalmentioning
confidence: 99%