2004
DOI: 10.1145/1042046.1042050
|View full text |Cite
|
Sign up to set email alerts
|

Incremental validation of XML documents

Abstract: We investigate the incremental validation of XML documents with respect to DTDs, specialized DTDs, and XML Schemas, under updates consisting of element tag renamings, insertions, and deletions. DTDs are modeled as extended context-free grammars. "Specialized DTDs" allow the decoupling of element types from element tags. XML Schemas are abstracted as specialized DTDs with limitations on the type assignment. For DTDs and XML Schemas, we exhibit an O(m log n) incremental validation algorithm using an auxiliary st… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
110
0

Year Published

2005
2005
2013
2013

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 77 publications
(112 citation statements)
references
References 23 publications
(32 reference statements)
1
110
0
Order By: Relevance
“…The set Q of states is the disjoint union of the vertices V of G, plus two distinguished states q 0 (which is D's initial state) and q f (which is D's only accepting state). The alphabet Σ is defined as (V {q 0 , q f }) 2 . The transitions of D are defined as follows.…”
Section: Theorem 20 Dre-definability(mindfa) Is Nlogspace-hardmentioning
confidence: 99%
See 1 more Smart Citation
“…The set Q of states is the disjoint union of the vertices V of G, plus two distinguished states q 0 (which is D's initial state) and q f (which is D's only accepting state). The alphabet Σ is defined as (V {q 0 , q f }) 2 . The transitions of D are defined as follows.…”
Section: Theorem 20 Dre-definability(mindfa) Is Nlogspace-hardmentioning
confidence: 99%
“…Primarily, schema information is crucial for automatic error detection in the data itself (which is called validation, see, e.g., [5,26,2,20]) or in the procedures that transform the data [24,23,22]. Furthermore, schemas provide information for optimization of XML querying and processing [25,28], they are inevitable when integrating data through schema matching [1], and they provide users with a high-level overview of the structure of the data.…”
Section: Introductionmentioning
confidence: 99%
“…The basic idea adopted in this family of methods [9,10,14,24,45,79] is to abstract DTDs/XSDs as extended Context-Free Grammars (CFG).…”
Section: Xml Document/grammar Validationmentioning
confidence: 99%
“…2 (b)). However, we cannot add such an address node because of (2). Hence there is no edit script between t and D.…”
Section: Overview Of Operationsmentioning
confidence: 99%
“…G(t, j, k) exactly covers all the patterns of children of n i ; that is, ( 1 ) for every edit script s for t (not deleting n i ), G(t, j, k) contains a path from m j−1 to m k that represents ch(s(t), n i ), and ( 2 ) for every path p from m j−1 to m k in G(t, j, k), there exists an edit script s for t such that ch(s(t), n i ) is represented by p. Similarly, for every siblings n j , n k with j ≤ k, G (t, j, k) exactly covers all the patterns of children of n j,k . Proof(sketch): Condition (1) can easily be shown by induction on the length of s. As for Condition (2), let p be a path from m j−1 to m k in G(t, j, k). Let s be an edit script obtained from p by replacing (i) each leaf edge Step 3 We first show some definitions related to NFA, which is necessary to determine if a path in G(t, j, k) or G (t, j, k) from m j−1 to m k , representing a sequence of children of a node, matches a regular expression.…”
Section: First G(t J K) Is Defined As the Subgraph Of G(t) Consistmentioning
confidence: 99%