Dataset Distillation Fixes Dataset Reconstruction Attacks

Noel, Loo,; Hasani, Ramin; Lechner, Mathias; Rus, Daniela

doi:10.48550/arxiv.2302.01428

Cited by 1 publication

(1 citation statement)

References 17 publications

(20 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This type of problem arises in many deep learning fields such as hyperparameters optimization (Domke, 2012;MacKay et al, 2019;Maclaurin et al, 2015), meta-learning (Finn et al, 2017;Rajeswaran et al, 2019), and adversarial training Madry et al, 2017;Szegedy et al, 2013) as well as safety and verification methods (Gruenbacher et al, 2022;Grunbacher et al, 2021;Xiao et al, 2022). Similarly, dataset distillation can also be framed as a bilevel optimization problem, with θ the set of network parameters, and ψ our distilled dataset parameters, given by the coreset images and labels (Loo et al, 2023;Nguyen et al, 2021a;Wang et al, 2018;Zhou et al, 2022).…”

Section: Introductionmentioning

confidence: 99%

Dataset Distillation with Convexified Implicit Gradients

Noel¹,

Hasani²,

Lechner³

et al. 2023

Preprint

View full text Add to dashboard Cite

We propose a new dataset distillation algorithm using reparameterization and convexification of implicit gradients (RCIG), that substantially improves the state-of-the-art. To this end, we first formulate dataset distillation as a bi-level optimization problem. Then, we show how implicit gradients can be effectively used to compute meta-gradient updates. We further equip the algorithm with a convexified approximation that corresponds to learning on top of a frozen finite-width neural tangent kernel. Finally, we improve bias in implicit gradients by parameterizing the neural network to enable analytical computation of final-layer parameters given the body parameters. RCIG establishes the new state-of-the-art on a diverse series of dataset distillation tasks. Notably, with one image per class, on resized ImageNet, RCIG sees on average a 108% improvement over the previous state-of-the-art distillation algorithm. Similarly, we observed a 66% gain over SOTA on Tiny-ImageNet and 37% on CIFAR-100.

show abstract