“…As shown in [9], a probabilistic DTA (PDTA) incorporates a probability, p m (σ, t 1 , ..., t m ), for every transition in the automaton, with the normalization that the probabilities of the transitions leading to the same state q ∈ Q must add up to one. For this purpose, one should note that, in this kind of deterministic models, the likelihood of the training sample is maximized if the stochastic model assigns to every tree t in the sample a probability equal to its relative frequency in Ω [8].…”