Jingzhao Zhang scite author profile

Jingzhao Zhang

2Publications

16Citation Statements Received

81Citation Statements Given

How they've been cited

How they cite others

Affiliations

Publications

Order By: Most citations

Minimax in Geodesic Metric Spaces: Sion's Theorem and Algorithms

Zhang¹,

Zhang²,

Sra³

2022

Preprint

View full text Add to dashboard Cite

Determining whether saddle points exist or are approximable for nonconvexnonconcave problems is usually intractable. We take a step towards understanding certain nonconvex-nonconcave minimax problems that do remain tractable. Specifically, we study minimax problems cast in geodesic metric spaces, which provide a vast generalization of the usual convex-concave saddle point problems. The first main result of the paper is a geodesic metric space version of Sion's minimax theorem; we believe our proof is novel and transparent, as it relies on Helly's theorem only. In our second main result, we specialize to geodesically complete Riemannian manifolds: we devise and analyze the complexity of first-order methods for smooth minimax problems.

show abstract

Lower Generalization Bounds for GD and SGD in Smooth Stochastic Convex Optimization

Zhang¹,

Teng²,

Zhang³

2023

Preprint

View full text Add to dashboard Cite

Recent progress was made in characterizing the generalization error of gradient methods for general convex loss by the learning theory community. In this work, we focus on how training longer might affect generalization in smooth stochastic convex optimization (SCO) problems. We first provide tight lower bounds for general non-realizable SCO problems. Furthermore, existing upper bound results suggest that sample complexity can be improved by assuming the loss is realizable, i.e. an optimal solution simultaneously minimizes all the data points. However, this improvement is compromised when training time is long and lower bounds are lacking. Our paper examines this observation by providing excess risk lower bounds for gradient descent (GD) and stochastic gradient descent (SGD) in two realizable settings: 1) realizable with T = O (n), and (2) realizable with T = Ω (n), where T denotes the number of training iterations and n is the size of the training dataset. These bounds are novel and informative in characterizing the relationship between T and n. In the first small training horizon case, our lower bounds almost tightly match and provide the first optimal certificates for the corresponding upper bounds. However, for the realizable case with T = Ω (n), a gap exists between the lower and upper bounds. We provide a conjecture to address this problem, that the gap can be closed by improving upper bounds, which is supported by our analyses in one-dimensional and linear regression scenarios.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Jingzhao Zhang

Minimax in Geodesic Metric Spaces: Sion's Theorem and Algorithms

Lower Generalization Bounds for GD and SGD in Smooth Stochastic Convex Optimization

Contact Info

Product

Resources

About