The estimation of the distribution function of a population is an important problem in sampling finite populations. The existing literature focuses on the problem of estimating the population distribution function (p.f.d.) at a single point, or at a finite number of points. In this paper the main interest consists in estimating the whole p.d.f.. In many respects, the starting point is close to classical nonparametric statistics, although the approach to inference is based on sampling design. It is shown here that the Hajek estimator of the p.d.f., if properly centered and scaled, converges weakly to a Gaussian process with covariance kernel proportional to that of a Brownian bridge. The proportionality factor essentially depends on the sample design. Applications to (i) construction of a confidence band for the p.d.f., (ii) comparison of the p.d.f.s of two populations, and (iii) testing for independence of two characters are provided
The goal of statistical matching is the estimation of a joint distribution having observed only samples from its marginals. The lack of joint observations on the variables of interest is the reason of uncertainty about the joint population distribution function. In the present article, the notion of matching error is introduced, and upper-bounded via an appropriate measure of uncertainty. Then, an estimate of the distribution function for the variables not jointly observed is constructed on the basis of a modification of the conditional independence assumption in the presence of logical constraints. The corresponding measure of uncertainty is estimated via sample data. Finally, a simulation study is performed, and an application to a real case is provided. Supplementary materials for this article are available online
In this paper, a class of resampling techniques for finite populations under complex sampling design is introduced. The basic idea on which they rest is a two-step procedure consisting in:(i) constructing a "pseudo-population" on the basis of sample data; (ii) drawing a sample from the predicted population according to an appropriate resampling design. From a logical point of view, this approach is essentially based on the plug-in principle by Efron, at the "sampling design level". Theoretical justifications based on large sample theory are provided. New approaches to construct pseudo populations based on various forms of calibrations are proposed. Finally, a simulation study is performed.
The aim of the paper is to study the problem of estimating the quantile function of a finite population. Attention is first focused on point estimation, and asymptotic results are obtained. Confidence intervals are then constructed, based on both the following: (i) asymptotic results and (ii) a resampling technique based on rescaling the ‘usual’ bootstrap. A simulation study to compare asymptotic and resampling-based results, as well as an application to a real population, is finally performed
a b s t r a c tA new method, based on the maximum likelihood principle, through the numerical Expectation-Maximization algorithm, is proposed to estimate traffic matrices when traffic exhibits long-range dependence. The methods proposed so far in the literature do not account for long-range dependence. The method proposed in the present paper also provides an estimate of the Hurst parameter. Simulation results show that: (i) the estimate of the traffic matrix is more efficient than those obtained via existing techniques; (ii) the estimation error of the traffic matrix is lower for larger values of the true traffic intensity; (iii) the estimate of the Hurst parameter is slightly negatively biased.
Ìnthis paper, a nonparametric Bayesian analysis of queueing models with geometric input and general service time is performed. In particular, Ž . statistical inference for the probability generating function p.g.f. of the equilibrium waiting time distribution is considered. The consistency of the posterior distribution for such a p.g.f., as well as the weak convergence to a Gaussian process of a suitable rescaling, are proved. As by-products, results on statistical inference for queueing characteristics are also obtained. Finally, the problem of estimating the probability of a long delay is considered.
Statistical matching consists in estimating the joint characteristics of two variables observed in two distinct and independent sample surveys, respectively. In a parametric setup, ranges of estimates for non identifiable parameters are the only estimable items, unless restrictive assumptions on the probabilistic relationship between the non jointly observed variables are imposed. These ranges correspond to the uncertainty due to the absence of joint observations on the pair of variables of interest. The aim of this paper is to analyze the uncertainty in statistical matching in a non parametric setting. A measure of uncertainty is introduced, and its properties studied: this measure studies the "intrinsic" association between the pair of variables, which is constant and equal to 1/6 whatever the form of the marginal distribution functions of the two variables when knowledge on the pair of variables is the only one available in the two samples. This measure becomes useful in the context of the reduction of uncertainty due to further knowledge than data themselves, as in the case of structural zeros. In this case the proposed measure detects how the introduction of further knowledge shrinks the intrinsic uncertainty from 1/6 to smaller values, zero being the case of no uncertainty. Sampling properties of the uncertainty measure and of the bounds of the uncertainty intervals are also proved.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.