2006
DOI: 10.1007/11871842_9
|View full text |Cite
|
Sign up to set email alerts
|

Learning Stochastic Tree Edit Distance

Abstract: Abstract. Trees provide a suited structural representation to deal with complex tasks such as web information extraction, RNA secondary structure prediction, or conversion of tree structured documents. In this context, many applications require the calculation of similarities between tree pairs. The most studied distance is likely the tree edit distance (ED) for which improvements in terms of complexity have been achieved during the last decade. However, this classic ED usually uses a priori fixed edit costs w… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
15
0

Year Published

2007
2007
2019
2019

Publication Types

Select...
4
2

Relationship

4
2

Authors

Journals

citations
Cited by 9 publications
(15 citation statements)
references
References 9 publications
(11 reference statements)
0
15
0
Order By: Relevance
“…These probabilities are the parameters of a generative model describing a joint distribution over (input,output) pairs of trees. In [8], we proposed a first solution to this problem, in a restrictive case of tree edit distance, when a deletion (resp. an insertion) implies the removal (resp.…”
Section: Definitions and Notationsmentioning
confidence: 99%
See 1 more Smart Citation
“…These probabilities are the parameters of a generative model describing a joint distribution over (input,output) pairs of trees. In [8], we proposed a first solution to this problem, in a restrictive case of tree edit distance, when a deletion (resp. an insertion) implies the removal (resp.…”
Section: Definitions and Notationsmentioning
confidence: 99%
“…Recently, the Pascal network of excellence funded a pump priming project on the learning of a stochastic tree ED for musical recognition. A first learning algorithm, where deletions and insertions only concern entire subtrees, has been proposed in [8]. Although this type of tree ED is costless from an algorithmic standpoint (quadratic complexity [9] rather than a polynomial complexity of order 4 for a more general case [10]), it is not the most used in the literature because of a clear loss of generality.…”
Section: Introductionmentioning
confidence: 99%
“…A parametric approach has also been presented in [6] in the 2 A c c e p t e d m a n u s c r i p t context of graph ED, where each edit operation is modeled by a Gaussian Mixture Density. With the exception of our preliminary work [7], as far as we know, no method has been proposed to directly learn edit costs for a stochastic tree ED. The aim of this paper is to fill this gap by a non parametric stochastic method specifically adapted to trees.…”
Section: Introductionmentioning
confidence: 99%
“…This joint work has lead to publications in the previous conferences ECML'06 [7] and ECML'07 [4], and in Pattern Recognition [3,8]. This research has also received funding from the RedEx PASCAL in the form of a pump-priming project in 2007.…”
Section: Introductionmentioning
confidence: 96%
“…However, in many real world applications, such a strategy clearly appears insufficient. To overcome this drawback and to capture background knowledge, supervised learning has been used during the last few years for learning the parameters of edit distances [1,2,3,4,7,8,9], often by maximizing the likelihood of a learning set. The learned models usually take the form of state machines such as stochastic transducers or probabilistic automata.…”
Section: Introductionmentioning
confidence: 99%