2023
DOI: 10.48550/arxiv.2302.09160
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

On Equivalent Optimization of Machine Learning Methods

Abstract: At the core of many machine learning methods resides an iterative optimization algorithm for their training. Such optimization algorithms often come with a plethora of choices regarding their implementation. In the case of deep neural networks, choices of optimizer, learning rate, batch size, etc. must be made. Despite the fundamental way in which these choices impact the training of deep neural networks, there exists no general method for identifying when they lead to equivalent, or non-equivalent, optimizati… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 39 publications
0
1
0
Order By: Relevance
“…Recent work [50] has used data-driven tools [8587] from Koopman operator theory [88, 89] to extract properties of RNN dynamics that can be compared using a precise notion of equivalence, called “topological conjugacy” [90]. Similar approaches have been used to study the dynamics associated with training deep neural networks [91], as well as iterative optimization algorithms, more generally [92]. Therefore, we complimented the geometric and topological analysis performed on the structure of the population level representations (Fig.…”
Section: Population Representation Detailsmentioning
confidence: 99%
“…Recent work [50] has used data-driven tools [8587] from Koopman operator theory [88, 89] to extract properties of RNN dynamics that can be compared using a precise notion of equivalence, called “topological conjugacy” [90]. Similar approaches have been used to study the dynamics associated with training deep neural networks [91], as well as iterative optimization algorithms, more generally [92]. Therefore, we complimented the geometric and topological analysis performed on the structure of the population level representations (Fig.…”
Section: Population Representation Detailsmentioning
confidence: 99%