Incremental validation of XML documents

Balmin, Andrey; Papakonstantinou, Yannis; Vianu, Victor

doi:10.1145/1042046.1042050

Cited by 77 publications

(112 citation statements)

References 23 publications

(32 reference statements)

Supporting

Mentioning

110

Contrasting

Order By: Relevance

“…The set Q of states is the disjoint union of the vertices V of G, plus two distinguished states q 0 (which is D's initial state) and q f (which is D's only accepting state). The alphabet Σ is defined as (V {q 0 , q f }) 2 . The transitions of D are defined as follows.…”

Section: Theorem 20 Dre-definability(mindfa) Is Nlogspace-hardmentioning

confidence: 99%

“…Primarily, schema information is crucial for automatic error detection in the data itself (which is called validation, see, e.g., [5,26,2,20]) or in the procedures that transform the data [24,23,22]. Furthermore, schemas provide information for optimization of XML querying and processing [25,28], they are inevitable when integrating data through schema matching [1], and they provide users with a high-level overview of the structure of the data.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Deciding Definability by Deterministic Regular Expressions

Czerwiński

David²,

Losemann

et al. 2013

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Abstract. We investigate the complexity of deciding whether a given regular language can be defined with a deterministic regular expression. Our main technical result shows that the problem is PSPACE-complete if the input language is represented as a regular expression or nondeterministic finite automaton. The problem becomes EXPSPACE-complete if the language is represented as a regular expression with counters.

show abstract

Section: Theorem 20 Dre-definability(mindfa) Is Nlogspace-hardmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Deciding Definability by Deterministic Regular Expressions

Czerwiński

David²,

Losemann

et al. 2013

Lecture Notes in Computer Science

View full text Add to dashboard Cite

show abstract

“…The basic idea adopted in this family of methods [9,10,14,24,45,79] is to abstract DTDs/XSDs as extended Context-Free Grammars (CFG).…”

Section: Xml Document/grammar Validationmentioning

confidence: 99%

XML document-grammar comparison: related problems and applications

Tekli

Chbeir

Traina

et al. 2011

Open Computer Science

View full text Add to dashboard Cite

10.2478/s13537-011-0005-1International audienceXML document comparison is becoming an ever more popular research issue due to the increasingly abundant use of XML. Likewise, a growing interest fosters the development of XML grammar matching and comparison, due to the proliferation of heterogeneous XML data sources, particularly on the Web. Nonetheless, the process of comparing XML documents with XML grammars, i.e., XML document and grammar similarity evaluation, has not yet received the attention it deserves. In this paper, we provide an overview on existing research related to XML document/grammar comparison, presenting the background and discussing the various techniques related to the problem. We also discuss some prominent application domains, ranging over document classification and clustering, document transformation, grammar evolution, selective dissemination of XML information, XML querying, as well as alert filtering in intrusion detection systems and Web Services matching and communications

show abstract

“…2 (b)). However, we cannot add such an address node because of (2). Hence there is no edit script between t and D.…”

Section: Overview Of Operationsmentioning

confidence: 99%

“…G(t, j, k) exactly covers all the patterns of children of n i ; that is, ( 1 ) for every edit script s for t (not deleting n i ), G(t, j, k) contains a path from m j−1 to m k that represents ch(s(t), n i ), and ( 2 ) for every path p from m j−1 to m k in G(t, j, k), there exists an edit script s for t such that ch(s(t), n i ) is represented by p. Similarly, for every siblings n j , n k with j ≤ k, G (t, j, k) exactly covers all the patterns of children of n j,k . Proof(sketch): Condition (1) can easily be shown by induction on the length of s. As for Condition (2), let p be a path from m j−1 to m k in G(t, j, k). Let s be an edit script obtained from p by replacing (i) each leaf edge Step 3 We first show some definitions related to NFA, which is necessary to determine if a path in G(t, j, k) or G (t, j, k) from m j−1 to m k , representing a sequence of children of a node, matches a regular expression.…”

Section: First G(t J K) Is Defined As the Subgraph Of G(t) Consistmentioning

confidence: 99%

On Finding an Edit Script between an XML Document and a DTD

Suzuki

2006

ipsjdc

View full text Add to dashboard Cite

Finding an edit script between data has played an important role in data retrieval and data transformation. So far many methods for finding an edit script between two XML documents have been proposed, but few studies on finding an edit script between an XML document and a DTD have been made. In this paper, we first present a polynomial-time algorithm for finding an edit script between an XML document and a DTD, which is optimum under some restrictions on operations. We next prove the correctness of the algorithm.

show abstract

Incremental validation of XML documents

Cited by 77 publications

References 23 publications

Deciding Definability by Deterministic Regular Expressions

Deciding Definability by Deterministic Regular Expressions

XML document-grammar comparison: related problems and applications

On Finding an Edit Script between an XML Document and a DTD

Contact Info

Product

Resources

About