The complexity of properly learning simple concept classes

Alekhnovich, Misha; Braverman, Mark; Feldman, Vitaly; Klivans, Adam R.; Pitassi, Toniann

doi:10.1016/j.jcss.2007.04.011

Cited by 35 publications

(35 citation statements)

References 31 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Valiant's original definition required that the learning algorithm output a DNF expression but this restriction was later relaxed to any efficiently-evaluatable hypothesis with the stricter version being referred to as proper learning. All these variants of the DNF learning question remained open until a recent result by Alekhnovich et al that established NP-hardness of the hardest variant: proper learning from random examples only [2]. Building on their proof, we resolve one more of Valiant's questions: Theorem 1.2 (Informal).…”

mentioning

confidence: 84%

“…These results were strengthened by Nock et al [29] who proved similar hardness even when learning by formulas of size k α n β (where α 2 and β is any constant). Finally, Alekhnovich et al removed any bounds on the size of the hypothesis (other than those naturally imposed by the polynomial running time of the learning algorithm) [2]. Angluin and Kharitonov prove that if non-uniform one-way functions exist then MQs do not help predicting DNF formulae [5].…”

Section: Relation To Other Workmentioning

confidence: 99%

See 1 more Smart Citation

Hardness of approximate two-level logic minimization and PAC learning with membership queries

Feldman

2009

Journal of Computer and System Sciences

Self Cite

View full text Add to dashboard Cite

Producing a small DNF expression consistent with given data is a classical problem in computer science that occurs in a number of forms and has numerous applications. We consider two standard variants of this problem. The first one is two-level logic minimization or finding a minimum DNF formula consistent with a given complete truth table (TTMinDNF). This problem was formulated by Quine in 1952 and has been since one of the key problems in logic design. It was proved NP-complete by Masek in 1979. The best known polynomial approximation algorithm is based on a reduction to the SET-COVER problem and produces a DNF formula of size O (d · OPT), where d is the number of variables. We prove that TT-MinDNF is NP-hard to approximate within d γ for some constant γ > 0, establishing the first inapproximability result for the problem. The other DNF minimization problem we consider is PAC learning of DNF expressions when the learning algorithm must output a DNF expression as its hypothesis (referred to as proper learning). We prove that DNF expressions are NP-hard to PAC learn properly even when the learner has access to membership queries, thereby answering a long-standing open question due to Valiant [L.G. Valiant, A theory of the learnable, Comm. ACM 27 (11) (1984) 1134-1142]. Finally, we provide a concrete connection between these variants of DNF minimization problem. Specifically, we prove that inapproximability of TT-MinDNF implies hardness results for restricted proper learning of DNF expressions with membership queries even when learning with respect to the uniform distribution only.

show abstract

mentioning

confidence: 84%

Section: Relation To Other Workmentioning

confidence: 99%

Hardness of approximate two-level logic minimization and PAC learning with membership queries

Feldman

2009

Journal of Computer and System Sciences

Self Cite

View full text Add to dashboard Cite

show abstract

“…However, a result such as Theorem 1 was not known for learning intersections of (two) halfspaces. Blum and Rivest [8] showed that it is NP-hard to learn an intersection of two halfspaces with an intersection of two halfspaces, and Alekhnovich, Braverman, Feldman, Klivans and Pitassi [1] proved a similar result even when the hypothesis is an intersection of halfspaces for any constant . Both the results are only NP-hardness results and do not prove APX-hardness for the underlying optimization problem.…”

Section: Previous Workmentioning

confidence: 96%

“…Note that the imperfect completeness is necessary in the above theorem, since via linear programming, one can always efficiently find a halfspace that correctly classifies all the points, if one exists. The theorem is optimal, since one can easily classify 1 2 fraction of the data points correctly, by taking an arbitrary halfspace or its complement as a hypothesis. From the learning theory perspective, such an optimal hardness result is especially satisfying, since if one could efficiently find ( 1 2 + ε)-consistent hypothesis (i.e.…”

Section: Previous Workmentioning

confidence: 99%

On the hardness of learning intersections of two halfspaces

Khot

Saket

2011

Journal of Computer and System Sciences

View full text Add to dashboard Cite

We show that unless NP = RP, it is hard to (even) weakly PAC-learn intersection of two halfspaces in R n using a hypothesis which is a function of up to halfspaces (linear threshold functions) for any integer . Specifically, we show that for every integer and an arbitrarily small constant ε > 0, unless NP = RP, no polynomial time algorithm can distinguish whether there is an intersection of two halfspaces that correctly classifies a given set of labeled points in R n , or whether any function of halfspaces can correctly classify at most 1 2 + ε fraction of the points.

show abstract

“…While there has been significant recent progress on the problem, including that random DNF are learnable under the uniform distribution [12,24,25], virtually nothing is known about their learnability in the worst case, even in the classical noiseless PAC model. In fact, there are serious impediments to learning DNF, including hardness results for their proper learnability [1]. The previous best algorithm for learning s-term DNF from random examples is due to Verbeurgt [27] and runs in quasipolynomial time O(n log s ), where is the error rate.…”

Section: Implications To Related Problemsmentioning

confidence: 99%

On Noise-Tolerant Learning of Sparse Parities and Related Problems

Grigorescu

Reyzin

Vempala

2011

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Abstract. We consider the problem of learning sparse parities in the presence of noise. For learning parities on r out of n variables, we give an algorithm that runs in time poly log 1 δ , 1 1−2η n (1+(2η) 2 +o(1))r/2 and uses only r log(n/δ)ω (1) (1−2η) 2 samples in the random noise setting under the uniform distribution, where η is the noise rate and δ is the confidence parameter. From previously known results this algorithm also works for adversarial noise and generalizes to arbitrary distributions. Even though efficient algorithms for learning sparse parities in the presence of noise would have major implications to learning other hypothesis classes, our work is the first to give a bound better than the brute-force O(n r ). As a consequence, we obtain the first nontrivial bound for learning r-juntas in the presence of noise, and also a small improvement in the complexity of learning DNF, under the uniform distribution.

show abstract

The complexity of properly learning simple concept classes

Cited by 35 publications

References 31 publications

Hardness of approximate two-level logic minimization and PAC learning with membership queries

Hardness of approximate two-level logic minimization and PAC learning with membership queries

On the hardness of learning intersections of two halfspaces

On Noise-Tolerant Learning of Sparse Parities and Related Problems

Contact Info

Product

Resources

About