Pier Luigi Conti scite author profile

The estimation of the distribution function of a population is an important problem in sampling finite populations. The existing literature focuses on the problem of estimating the population distribution function (p.f.d.) at a single point, or at a finite number of points. In this paper the main interest consists in estimating the whole p.d.f.. In many respects, the starting point is close to classical nonparametric statistics, although the approach to inference is based on sampling design. It is shown here that the Hajek estimator of the p.d.f., if properly centered and scaled, converges weakly to a Gaussian process with covariance kernel proportional to that of a Brownian bridge. The proportionality factor essentially depends on the sample design. Applications to (i) construction of a confidence band for the p.d.f., (ii) comparison of the p.d.f.s of two populations, and (iii) testing for independence of two characters are provided

show abstract

Statistical Matching Analysis for Complex Survey Data With Applications

Conti

Marella

Scanu

2016

Journal of the American Statistical Association

View full text Add to dashboard Cite

The goal of statistical matching is the estimation of a joint distribution having observed only samples from its marginals. The lack of joint observations on the variables of interest is the reason of uncertainty about the joint population distribution function. In the present article, the notion of matching error is introduced, and upper-bounded via an appropriate measure of uncertainty. Then, an estimate of the distribution function for the variables not jointly observed is constructed on the basis of a modification of the conditional independence assumption in the presence of logical constraints. The corresponding measure of uncertainty is estimated via sample data. Finally, a simulation study is performed, and an application to a real case is provided. Supplementary materials for this article are available online

show abstract

A unified principled framework for resampling based on pseudo-populations: Asymptotic theory

Conti¹,

Marella²,

Mecatti³

et al. 2020

Bernoulli

View full text Add to dashboard Cite

In this paper, a class of resampling techniques for finite populations under complex sampling design is introduced. The basic idea on which they rest is a two-step procedure consisting in:(i) constructing a "pseudo-population" on the basis of sample data; (ii) drawing a sample from the predicted population according to an appropriate resampling design. From a logical point of view, this approach is essentially based on the plug-in principle by Efron, at the "sampling design level". Theoretical justifications based on large sample theory are provided. New approaches to construct pseudo populations based on various forms of calibrations are proposed. Finally, a simulation study is performed.

show abstract

On the asymptotic distribution of a general measure of monotone dependence

Cifarelli¹,

Conti²,

Regazzini³

1996

Ann. Statist.

View full text Add to dashboard Cite

Inference for Quantiles of a Finite Population: Asymptotic versus Resampling Results

Conti

Marella

2014

Scandinavian J Statistics

View full text Add to dashboard Cite

The aim of the paper is to study the problem of estimating the quantile function of a finite population. Attention is first focused on point estimation, and asymptotic results are obtained. Confidence intervals are then constructed, based on both the following: (i) asymptotic results and (ii) a resampling technique based on rescaling the ‘usual’ bootstrap. A simulation study to compare asymptotic and resampling-based results, as well as an application to a real population, is finally performed

show abstract

Blind maximum likelihood estimation of traffic matrices under long-range dependent traffic

Conti¹,

Giovanni

Naldi

2010

Computer Networks

View full text Add to dashboard Cite

a b s t r a c tA new method, based on the maximum likelihood principle, through the numerical Expectation-Maximization algorithm, is proposed to estimate traffic matrices when traffic exhibits long-range dependence. The methods proposed so far in the literature do not account for long-range dependence. The method proposed in the present paper also provides an estimate of the Hurst parameter. Simulation results show that: (i) the estimate of the traffic matrix is more efficient than those obtained via existing techniques; (ii) the estimation error of the traffic matrix is lower for larger values of the true traffic intensity; (iii) the estimate of the Hurst parameter is slightly negatively biased.

show abstract

Large sample Bayesian analysis for ${\rm Geo}/G/1$ discrete-time queueing models

Conti¹

1999

Ann. Statist.

View full text Add to dashboard Cite

Ìnthis paper, a nonparametric Bayesian analysis of queueing models with geometric input and general service time is performed. In particular, Ž . statistical inference for the probability generating function p.g.f. of the equilibrium waiting time distribution is considered. The consistency of the posterior distribution for such a p.g.f., as well as the weak convergence to a Gaussian process of a suitable rescaling, are proved. As by-products, results on statistical inference for queueing characteristics are also obtained. Finally, the problem of estimating the probability of a long delay is considered.

show abstract

How far from identifiability? A systematic overview of the statistical matching problem in a non parametric framework

Conti

Marella

Scanu

2016

Communications in Statistics - Theory and Methods

View full text Add to dashboard Cite

Statistical matching consists in estimating the joint characteristics of two variables observed in two distinct and independent sample surveys, respectively. In a parametric setup, ranges of estimates for non identifiable parameters are the only estimable items, unless restrictive assumptions on the probabilistic relationship between the non jointly observed variables are imposed. These ranges correspond to the uncertainty due to the absence of joint observations on the pair of variables of interest. The aim of this paper is to analyze the uncertainty in statistical matching in a non parametric setting. A measure of uncertainty is introduced, and its properties studied: this measure studies the "intrinsic" association between the pair of variables, which is constant and equal to 1/6 whatever the form of the marginal distribution functions of the two variables when knowledge on the pair of variables is the only one available in the two samples. This measure becomes useful in the context of the reduction of uncertainty due to further knowledge than data themselves, as in the case of structural zeros. In this case the proposed measure detects how the introduction of further knowledge shrinks the intrinsic uncertainty from 1/6 to smaller values, zero being the case of no uncertainty. Sampling properties of the uncertainty measure and of the bounds of the uncertainty intervals are also proved.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Pier Luigi Conti

On the Estimation of the Distribution Function of a Finite Population Under High Entropy Sampling Designs, with Applications

Statistical Matching Analysis for Complex Survey Data With Applications

A unified principled framework for resampling based on pseudo-populations: Asymptotic theory

On the asymptotic distribution of a general measure of monotone dependence

Inference for Quantiles of a Finite Population: Asymptotic versus Resampling Results

Blind maximum likelihood estimation of traffic matrices under long-range dependent traffic

Large sample Bayesian analysis for ${\rm Geo}/G/1$ discrete-time queueing models

How far from identifiability? A systematic overview of the statistical matching problem in a non parametric framework

Contact Info

Product

Resources

About