Settling the robust learnability of mixtures of Gaussians

Liu, Allen; Moitra, Ankur

doi:10.1145/3406325.3451084

Cited by 11 publications

(6 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This progress in obtaining clustering algorithms for nonspherical mixtures is a key component in the recent resolution of the problem of robust learning of mixtures of arbitrary Gaussians [1,26]. Our list-decodable covariance estimation algorithm immediately upgrades the above results and resolves the problem of finding a 𝑑 poly(𝑘) sample and 𝑛 poly(𝑘) time algorithm for the problem.…”

Section: Our Resultsmentioning

confidence: 83%

“…Learning Arbitrary Gaussian Mixtures. Our work is related (but incomparable and complementary, in both results and techniques) to the recent resolution of the problem of robust learning of a mixture of 𝑘-arbitrary Gaussians [1,26]. When viewed from our vantage point, these works give a polynomial time algorithm (for any fixed 𝑘) to learn the parameters of a mixture of 𝑘 Gaussians given an 𝜀-corrupted input sample.…”

Section: Comparison With Related Workmentioning

confidence: 97%

“…When viewed from our vantage point, these works give a polynomial time algorithm (for any fixed 𝑘) to learn the parameters of a mixture of 𝑘 Gaussians given an 𝜀-corrupted input sample. The algorithms of [1,26] do not need strong separation assumptions but crucially need that the fraction of outliers is small (at most ∼ exp(−𝑘!)). In that setting, their algorithm recovers estimates of the components that are close (within some 𝜀 𝐹 (𝑘) in [1]) to those of the unknown mixture.…”

Section: Comparison With Related Workmentioning

confidence: 99%

Section: Comparison With Related Workmentioning

confidence: 99%

“…Indeed, our techniques are significantly (and necessarily so) different from those in [1,26]. In fact, the algorithms in [1,26] use robust clustering algorithms from [3,11] as a first step with their key new algorithmic components coming after the clustering step. We note that using our new list-decodable covariance estimation algorithm in lieu of the clustering algorithm in the first step will somewhat simplify their proof though the bulk of their analysis is to deal with łunclusterablež mixtures.…”

Section: Comparison With Related Workmentioning

confidence: 99%

See 4 more Smart Citations

List-decodable covariance estimation

Ivkov

Kothari

2022

Proceedings of the 54th Annual ACM SIGACT Symposium on Theory of Computing

View full text Add to dashboard Cite

We give the first polynomial time algorithm for list-decodable covariance estimation. For any 𝛼 > 0, our algorithm takes input a sample 𝑌 ⊆ ℝ 𝑑 of size 𝑛 ⩾ 𝑑 poly(1/𝛼) obtained by adversarially corrupting an (1 − 𝛼)𝑛 points in an i.i.d. sample 𝑋 of size 𝑛 from the Gaussian distribution with unknown mean 𝜇 * and covariance Σ * . In 𝑛 poly(1/𝛼) time, it outputs a constant-size list of 𝑘 = 𝑘 (𝛼) = (1/𝛼) poly(1/𝛼) candidate parameters that, with high probability, contains a ( μ, Σ) such that the total variation distance 𝑇𝑉 (N (𝜇 * , Σ * ), N ( μ, Σ)) < 1−𝑂 𝛼 (1). This is a statistically strongest notion of distance and implies multiplicative spectral and relative Frobenius distance approximation with dimension independent error. Our algorithm works more generally for any distribution 𝐷 that possesses low-degree sum-of-squares certificates of two natural analytic properties: 1) anti-concentration of one-dimensional marginals and 2) hypercontractivity of degree 2 polynomials.Prior to our work, the only known results for estimating covariance in the list-decodable setting were for the special cases of list-decodable linear regression and subspace recovery [Karmalkar-Klivans-Kothari 2019. Even for these special cases, the known error guarantees are weak and in particular, the algorithms need super-polynomial time for any sub-constant (in dimension 𝑑) target error in natural norms. Our result, as a corollary, yields the first polynomial time exact algorithm for list-decodable linear regression and subspace recovery that, in particular, obtain 2 − poly(𝑑) error in polynomial-time in the underlying dimension. CCS CONCEPTS• Theory of computation → Design and analysis of algorithms; Complexity theory and logic.

show abstract

Section: Our Resultsmentioning

confidence: 83%

Section: Comparison With Related Workmentioning

confidence: 97%