Privacy accounting and quality control in the sage differentially private ML platform

Lécuyer, Mathias; Spahn, Riley; Vodrahalli, Kiran; Geambasu, Roxana; Hsu, Daniel

doi:10.1145/3341301.3359639

Cited by 25 publications

(23 citation statements)

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Some of the most prominent demands, which are close in spirit to the requirements of CL, as we need here, are listed as follows: R1: Endless execution R2: Multiple usage of data subsets R3: Capability of changing DP parameters during the execution Satisfying all the R1-R3 together is hard, therefore related papers address only one or two of these requirements. Along this line, two recent DL-based papers of [23], [24] have enabled DP to work on growing databases (dynamic datasets). More explicitly, to address R1 Cummings et al have considered a scheduler to re-execute the DL algorithms whenever the new received data is sufficient [24].…”

Section: Related Workmentioning

confidence: 99%

“…Note that, since each block of data might be used by different DL algorithms corresponding to the pipelines, calculating the PB spent by the whole pipelines would be challenging. To reach this goal, the authors of [23], have proposed the so-called block composition theorem by which the DL algorithms are executed till the PB consumption of each block 1 does not exceed the predefined GPB. To achieve the desired accuracy, with the aim of re-training the pipelines, either the relevant PB of each pipeline or the number of available samples is doubled.…”

Section: Related Workmentioning

confidence: 99%

“…The everlasting approach of CL is a serious impediment to deploy either of the proposed solutions in [23] or [24]. In particular, if one intends to add DP to CL, the limited GPB hinders the process to be continued.…”

Section: How To Add Dp While CL Is Executed Endlessly (Addressing R1)?mentioning

confidence: 99%

“…A serious impediment to deploy either of the proposed solutions in [23] or [24] in a CL process, is the data coming from the stream as well as samples stored in the EM to avoid catastrophic forgetting (CF). Note that, although the learners observe most of the data coming from the stream just once, a small portion stored in the EM is observed several times.…”

Section: How To Add Dp While Subsets Of Data Are Used Repeatedly (Add...mentioning

confidence: 99%

“…where 𝑃𝐵 F $ ,D % stands for the consumed PB of 𝑖th iteration of the learner 𝑙 $ . By doing so, we can calculate the total consumed PB for the sample via feeding 𝑃𝐵 E $ to the Block Composition Theorem (BCT) [23]. Tracking the behavior of 𝑃𝐵 E $ , if it exceeds the GPB, we no longer use that sample in our CL procedure.…”

Section: How To Add Dp While Subsets Of Data Are Used Repeatedly (Add...mentioning

confidence: 99%

See 4 more Smart Citations