Abstract:We study the profile $X_{n,k}$ of random search trees including binary search
trees and $m$-ary search trees. Our main result is a functional limit theorem
of the normalized profile $X_{n,k}/\mathbb{E}X_{n,k}$ for $k=\lfloor\alpha\log
n\rfloor$ in a certain range of $\alpha$. A central feature of the proof is the
use of the contraction method to prove convergence in distribution of certain
random analytic functions in a complex domain. This is based on a general
theorem concerning the contraction method for ra… Show more
“…In the context of the analysis of performance of algorithms, the method was first employed by Rösler [29] who proved convergence in distribution for the rescaled total cost of the randomized version of quicksort. The method was then further developed by Rachev and Rüschendorf [27], Rösler [30], and later on in [5,9,22,24,25,31] and has permitted numerous analyses in distribution for random discrete structures. So far, the method has mostly been used to analyze random variables taking real values, though a few applications on functions spaces have been made, see [5,9,16].…”
Section: Main Results and Implicationsmentioning
confidence: 99%
“…The method was then further developed by Rachev and Rüschendorf [27], Rösler [30], and later on in [5,9,22,24,25,31] and has permitted numerous analyses in distribution for random discrete structures. So far, the method has mostly been used to analyze random variables taking real values, though a few applications on functions spaces have been made, see [5,9,16]. Here we are interested in the function space D[0, 1] with the uniform topology, but the main idea persists: (1) devise a recursive equation for the quantity of interest (here the process(C n (s), s ∈ [0, 1])), and (2) prove that a properly rescaled version of the quantity converges to a fixed point of a certain map related to the recursive equation ; (3) if the map is a contraction in a certain metric space, then a fixed point is unique and may be obtained by iteration.…”
We consider the problem of recovering items matching a partially specified pattern in multidimensional trees (quad trees and k-d trees). We assume the traditional model where the data consist of independent and uniform points in the unit square. For this model, in a structure on n points, it is known that the number of nodes Cn(ξ) to visit in order to report the items matching an independent and uniformly on [0, 1] random query ξ satisfies E[Cn(ξ)] ∼ κn β , where κ and β are explicit constants. We develop an approach based on the analysis of the cost Cn(x) of any fixed query x ∈ [0, 1], and give precise estimates for the variance and limit distribution of the cost Cn(x). Our results permit to describe a limit process for the costs Cn(x) as x varies in
“…In the context of the analysis of performance of algorithms, the method was first employed by Rösler [29] who proved convergence in distribution for the rescaled total cost of the randomized version of quicksort. The method was then further developed by Rachev and Rüschendorf [27], Rösler [30], and later on in [5,9,22,24,25,31] and has permitted numerous analyses in distribution for random discrete structures. So far, the method has mostly been used to analyze random variables taking real values, though a few applications on functions spaces have been made, see [5,9,16].…”
Section: Main Results and Implicationsmentioning
confidence: 99%
“…The method was then further developed by Rachev and Rüschendorf [27], Rösler [30], and later on in [5,9,22,24,25,31] and has permitted numerous analyses in distribution for random discrete structures. So far, the method has mostly been used to analyze random variables taking real values, though a few applications on functions spaces have been made, see [5,9,16]. Here we are interested in the function space D[0, 1] with the uniform topology, but the main idea persists: (1) devise a recursive equation for the quantity of interest (here the process(C n (s), s ∈ [0, 1])), and (2) prove that a properly rescaled version of the quantity converges to a fixed point of a certain map related to the recursive equation ; (3) if the map is a contraction in a certain metric space, then a fixed point is unique and may be obtained by iteration.…”
We consider the problem of recovering items matching a partially specified pattern in multidimensional trees (quad trees and k-d trees). We assume the traditional model where the data consist of independent and uniform points in the unit square. For this model, in a structure on n points, it is known that the number of nodes Cn(ξ) to visit in order to report the items matching an independent and uniformly on [0, 1] random query ξ satisfies E[Cn(ξ)] ∼ κn β , where κ and β are explicit constants. We develop an approach based on the analysis of the cost Cn(x) of any fixed query x ∈ [0, 1], and give precise estimates for the variance and limit distribution of the cost Cn(x). Our results permit to describe a limit process for the costs Cn(x) as x varies in
“…In this respect, a promising direction of investigation is provided by the approach of the problem via continuous time branching processes [8][9][10]. Indeed, within this framework, it is possible to relate the mass distribution to other asymptotic large time distributions which are themeselves determined by coupled integral equations.…”
Section: Resultsmentioning
confidence: 99%
“…These problems have been analyzed using continuous time branching processes [8][9][10] and we believe that the mass distribution exponents can in principle be deduced using those methods. However, we find it illuminating to study the model directly using discrete time and elementary methods.…”
We investigate the statistics of trees grown from some initial tree by attaching links to preexisting vertices, with attachment probabilities depending only on the valence of these vertices. We consider the asymptotic mass distribution that measures the repartition of the mass of large trees between their different subtrees. This distribution is shown to be a broad distribution and we derive explicit expressions for scaling exponents that characterize its behavior when one subtree is much smaller than the others. We show in particular the existence of various regimes with different values of these mass distribution exponents. Our results are corroborated by a number of exact solutions for particular solvable cases, as well as by numerical simulations.
“…An alternative proof could be given relying on a distributional recurrence for the sequence W n (z). This approach was taken by Drmota, Janson and Neininger [11] studying the profile of m-ary search trees including the case of BSTs. As for the martingale approach, this methodology can be worked out for any β ∈ N 0 or β = −1.…”
For a martingale (Xn) converging almost surely to a random variable X, the sequence (Xn − X) is called martingale tail sum. Recently, Neininger [Random Structures Algorithms, 46 (2015), 346-361] proved a central limit theorem for the martingale tail sum of Régnier's martingale for the path length in random binary search trees. Grübel and Kabluchko [to appear in Annals of Applied Probability, (2016)] gave an alternative proof also conjecturing a corresponding law of the iterated logarithm. We prove the central limit theorem with convergence of higher moments and the law of the iterated logarithm for a family of trees containing binary search trees, recursive trees and plane-oriented recursive trees.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.