Dispersion for Data-Driven Algorithm Design, Online Learning, and Private Optimization

Balcan, Maria-Florina; Dick, Travis; Vitercik, Ellen

doi:10.1109/focs.2018.00064

Cited by 48 publications

(129 citation statements)

References 41 publications

Supporting

Mentioning

122

Contrasting

Order By: Relevance

“…As we describe more in Section 3.1.2, this requires us to prove that with high probability, each function sequence from several infinite families of sequences is dispersed. This facet of our analysis is notably different from prior research by Balcan et al [2018a]: in their applications, it is enough to show that with high probability, a single, finite sequence of functions is dispersed. Our proofs thus necessitate that we carefully examine the structure of the utility functions that we analyze.…”

Section: Our Contributionsmentioning

confidence: 73%

“…Rather, we must prove that for all type vectors, the dispersion property holds. This facet of our analysis is notably different from prior work by Balcan et al [2018a]: in their applications, it is enough to show that with high probability, a single, finite sequence of functions is dispersed. In contrast, we show that under mild assumptions, with high probability, each function sequence from an infinite family is dispersed.…”

Section: Dispersion and Pseudo-dimension Guaranteesmentioning

confidence: 78%

See 1 more Smart Citation

Estimating Approximate Incentive Compatibility

Balcan

Sandholm

Vitercik

2019

Proceedings of the 2019 ACM Conference on Economics and Computation

Self Cite

View full text Add to dashboard Cite

In practice, most mechanisms for selling, buying, matching, voting, and so on are not incentive compatible. We present techniques for estimating how far a mechanism is from incentive compatible. Given samples from the agents' type distribution, we show how to estimate the extent to which an agent can improve his utility by misreporting his type. We do so by first measuring the maximum utility an agent can gain by misreporting his type on average over the samples, assuming his true and reported types are from a finite subset-which our technique constructs-of the type space. The challenge is that by measuring utility gains over a finite subset of the type space, we might miss type pairs θ andθ where an agent with type θ can greatly improve his utility by reporting typeθ. Indeed, our primary technical contribution is proving that the maximum utility gain over this finite subset nearly matches the maximum utility gain overall, despite the volatility of the utility functions we study. We apply our tools to the single-item and combinatorial first-price auctions, generalized second-price auction, discriminatory auction, uniform-price auction, and second-price auction with spiteful bidders.

show abstract

Section: Our Contributionsmentioning

confidence: 73%

Section: Dispersion and Pseudo-dimension Guaranteesmentioning

confidence: 78%

Estimating Approximate Incentive Compatibility

Balcan

Sandholm

Vitercik

2019

Proceedings of the 2019 ACM Conference on Economics and Computation

Self Cite

View full text Add to dashboard Cite

show abstract

“…The authors of [37] studied the problem with smooth loss function and proposed using the 2 gradient-norm of a private estimator, i.e., ∇L(w priv , D) 2 , to measure the utility, which was then extended in [34,31] to the cases of non-smooth loss functions and high dimensional space. It is well known that 2 gradient-norm can estimate only the first-order stationary point (or critical point) 3 , and thus may lead to inferior generalization performance [10]. The authors of [28] are the first to show that the utility of general non-convex loss functions can also be measured in the same way as convex loss functions by the expected excess empirical risk.…”

Section: L(w D)mentioning

confidence: 99%

Escaping Saddle Points of Empirical Risk Privately and Scalably via DP-Trust Region Method

Wang

2021

Machine Learning and Knowledge Discovery in Databases

View full text Add to dashboard Cite

It has been shown recently that many non-convex objective/loss functions in machine learning are known to be strict saddle. This means that finding a second-order stationary point (i.e., approximate local minimum) and thus escaping saddle points are sufficient for such functions to obtain a classifier with good generalization performance. Existing algorithms for escaping saddle points, however, all fail to take into consideration a critical issue in their designs, that is, the protection of sensitive information in the training set. Models learned by such algorithms can often implicitly memorize the details of sensitive information, and thus offer opportunities for malicious parties to infer it from the learned models. In this paper, we investigate the problem of privately escaping saddle points and finding a second-order stationary point of the empirical risk of non-convex loss function. Previous result on this problem is mainly of theoretical importance and has several issues (e.g., high sample complexity and non-scalable) which hinder its applicability, especially, in big data. To deal with these issues, we propose in this paper a new method called Differentially Private Trust Region, and show that it outputs a second-order stationary point with high probability and less sample complexity, compared to the existing one. Moreover, we also provide a stochastic version of our method (along with some theoretical guarantees) to make it faster and more scalable. Experiments on benchmark datasets suggest that our methods are indeed more efficient and practical than the previous one.

show abstract

“…Here, we consider a simple private logistic regression model with 2 regularization trained on the Adult dataset [28]. The model is privatized by training with mini-batched projected SGD, then applying a Gaussian perturbation at the output using the method from [49,Algorithm 2] with default parameters 5 . The only hyperparameters tuned in this example are the regularization γ and the noise standard deviation σ, while the rest are fixed 6 .…”

Section: Private Logistic Regressionmentioning

confidence: 99%

“…Recent work on data-driven algorithm configuration has considered the problem of tuning the hyperparameters of combinatorial optimization algorithms while maintaining DP [5]. The setting considered in [5] assumes there is an underlying distribution of problem instances, and a sample from this distribution is used to select hyperparameters that will have good computational performance on future problem instances sampled from the same distribution. In this case, the authors consider a threat model where the whole sample of problem instances used to tune the algorithm needs to be protected.…”

Section: Related Workmentioning

confidence: 99%

Automatic Discovery of Privacy–Utility Pareto Fronts

Avent

González

Diethe

et al. 2020

Proceedings on Privacy Enhancing Technologies

View full text Add to dashboard Cite

Differential privacy is a mathematical framework for privacy-preserving data analysis. Changing the hyperparameters of a differentially private algorithm allows one to trade off privacy and utility in a principled way. Quantifying this trade-off in advance is essential to decision-makers tasked with deciding how much privacy can be provided in a particular application while maintaining acceptable utility. Analytical utility guarantees offer a rigorous tool to reason about this tradeoff, but are generally only available for relatively simple problems. For more complex tasks, such as training neural networks under differential privacy, the utility achieved by a given algorithm can only be measured empirically. This paper presents a Bayesian optimization methodology for efficiently characterizing the privacy– utility trade-off of any differentially private algorithm using only empirical measurements of its utility. The versatility of our method is illustrated on a number of machine learning tasks involving multiple models, optimizers, and datasets.

show abstract

Dispersion for Data-Driven Algorithm Design, Online Learning, and Private Optimization

Cited by 48 publications

References 41 publications

Estimating Approximate Incentive Compatibility

Estimating Approximate Incentive Compatibility

Escaping Saddle Points of Empirical Risk Privately and Scalably via DP-Trust Region Method

Automatic Discovery of Privacy–Utility Pareto Fronts

Contact Info

Product

Resources

About