2019
DOI: 10.1093/imaiai/iaz016
|View full text |Cite
|
Sign up to set email alerts
|

A central limit theorem for Lp transportation cost on the real line with application to fairness assessment in machine learning

Abstract: We provide a central limit theorem for the Monge–Kantorovich distance between two empirical distributions with sizes $n$ and $m$, $\mathcal{W}_p(P_n,Q_m), \ p\geqslant 1,$ for observations on the real line. In the case $p>1$ our assumptions are sharp in terms of moments and smoothness. We prove results dealing with the choice of centring constants. We provide a consistent estimate of the asymptotic variance, which enables to build two sample tests and confidence intervals to certify the similarity betwe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
14
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 20 publications
(15 citation statements)
references
References 30 publications
1
14
0
Order By: Relevance
“…We refer to [19] for a more detailed account about the history of the problem. The problem has received a renewed interest in the last few years, both in the setup P = Q (see [3,25,39]) or for general P and Q (see [36] and [41] for finitely and countably supported probabilities, [19] for the case p = 2 and general probabilities and dimension and [18,6] for dimension d = 1 and general costs).…”
Section: Introductionmentioning
confidence: 99%
“…We refer to [19] for a more detailed account about the history of the problem. The problem has received a renewed interest in the last few years, both in the setup P = Q (see [3,25,39]) or for general P and Q (see [36] and [41] for finitely and countably supported probabilities, [19] for the case p = 2 and general probabilities and dimension and [18,6] for dimension d = 1 and general costs).…”
Section: Introductionmentioning
confidence: 99%
“…These have then been used to assess a new criterion of dataset fairness in classification. These contributions correspond to the publication del Barrio et al [2019b]. Additionally, we have provided a moderate deviation principle for the empirical transportation cost in general dimension.…”
Section: 73mentioning
confidence: 81%
“…Yet all other fairness criteria should be given with the calculation of a confidence interval. For instance in del Barrio et al [2019b] we propose confidence intervals for Wasserstein distance which is used in many methods in fair learning.…”
Section: Bootstraping Vs Direct Calculation Of Ic Intervalmentioning
confidence: 99%
See 2 more Smart Citations