On the regularizing property of stochastic gradient descent

Jin, Bangti; Lu, Xiliang

doi:10.1088/1361-6420/aaea2a

Cited by 28 publications

(44 citation statements)

References 64 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The stochastic gradient descent algorithm [23,36,37] is used to optimize equation (3). For better accuracy in prediction, the algorithm loops through all ratings in the training data and estimates the model parameters.…”

Section: Mathematical Modeling Of the S3d Video Recommendation Systemmentioning

confidence: 99%

Latent Factor Modeling of Perceived Quality for Stereoscopic 3D Video Recommendation

Appina

Sharma

Kumar

et al. 2021

2021 International Conference on 3D Immersion (IC3D)

View full text Add to dashboard Cite

Throughout the recent decades, the movie industry has been continuously producing stereoscopic 3D contents on a wide scale of varying budget; while some of them are rather limited financially, others benefit from massive monetary reinforcements. It is also quite apparent how the related technologies have significantly improved over the years, resulting in better and better 3D visuals. However, at the time of this paper, it is still the sad truth that even those movies withThe scientific efforts leading to the results reported in this paper were

show abstract

Section: Mathematical Modeling Of the S3d Video Recommendation Systemmentioning

confidence: 99%

Latent Factor Modeling of Perceived Quality for Stereoscopic 3D Video Recommendation

Appina

Sharma

Kumar

et al. 2021

2021 International Conference on 3D Immersion (IC3D)

View full text Add to dashboard Cite

show abstract

“…However, they do not give a rate of convergence, which remains an open problem. Numerically, we observe that the convergence rate obtained by the discrepancy principle is nearly order-optimal for low-regularity solutions, as the a priori rule in the regime in [JL19], and the performance is competitive with the standard Landweber method. Thus, the method is especially attractive for finding a low-accuracy solution.…”

Section: Proofsmentioning

confidence: 93%

“…Remark 3.2.3. The condition r < 1 is related to an apparent saturation phenomenon with SGD: for any ν > 1, the SGD iterate x δ k with a priori stopping can only achieve a convergence rate comparable with that for ν = 1 in the setting of Assumption 3.1.1, at least for the current analysis [JL19]. It remains unclear whether this is an intrinsic drawback of SGD or due to limitations of the proof technique.…”

Section: Proofsmentioning

confidence: 94%

“…Remark 3.1.3. The condition r < 1 is related to an apparent saturation phenomenon with SGD: for any ν > 1, the SGD iterate x δ k with a priori stopping can only achieve a convergence rate comparable with that for ν = 1 in the setting of Assumption 3.1.1, at least for the analysis in [JL19]. However, in the very recent preprint [JZZ20b] a refined convergence analysis is presented, showing that this saturation actually does not occur, if the initial step size c 0 is sufficiently small.…”

Section: Convergence and A Finite Termination Propertymentioning

confidence: 99%

“…However, the mathematical theory in the lens of classical regularisation theory is rather incomplete, as it does not fit in the framework of filter-based regularisation. In the work [JL19] the regularising property of stochastic gradient descent was explored for the first time. Further, a convergence rate in the mean squared norm was derived, under suitable source type condition on the true solutionx.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Regularising linear inverse problems under unknown non-Gaussian noise

Jahn¹

View full text Add to dashboard Cite

Diese Arbeit beschäftigt sich mit linearen inversen Problemen, wie sie in einer Vielzahl an Anwendungen auftreten. Diese Probleme zeichnen sich dadurch aus, dass sie typischerweise schlecht gestellt sind, was in erster Linie die Stabilität betrifft. Selbst kleinste Messfehler haben enorme Konsequenzen für die Rekonstruktion der zu bestimmenden Größe. Um eine robuste Rekonstruktion zu ermöglichen, muss das Problem regularisiert, dass heißt durch eine ganze Familie abgeänderter, stabiler Approximationen ersetzt werden. Die konkrete Wahl aus der Familie, die sogenannte Parameterwahlstrategie, stützt sich dann auf zusätzliche ad hoc Annahmen über den Messfehler. Typischerweise ist dies im deterministischen Fall die Kenntnis einer oberen Schranke an die Norm des Datenfehlers, oder im stochastischen Fall, die Kenntnis der Verteilung des Fehlers, beziehungsweise die Einschränkung auf eine bestimmte Klasse von Verteilungen, zumeist Gaußsche. In der vorliegenden Arbeit wird untersucht, wie sich diese Informationen unter der Annahme der Wiederholbarkeit der Messung gewinnen lassen. Die Daten werden dabei aus mehreren Messungen gemittelt, welche einer beliebigen, unbekannten Verteilung folgen, wobei die zur Lösung des Problems unweigerlich notwendige Fehlerschranke geschätzt wird. Auf Mittelwert und Schätzer wird dann ein klassisches Regularisierungsverfahren angewandt. Als Regularisierungen werden größtenteils Filter-basierte Verfahren behandelt, die sich auf die Spektralzerlegung des Problems stützen. Als Parameterwahlstrategien werden sowohl einfache a priori-Wahlen betrachtet, als auch das Diskrepanzprinzip als adaptives Verfahren. Es wird Konvergenz für unbekannte beliebige Fehlerverteilungen mit endlicher Varianz sowie für Weißes Rauschen (bezüglich allgemeiner Diskretisierungen) nachgewiesen. Schließlich wird noch die Konvergenz des Diskrepanzprinzips für ein stochastisches Gradientenverfahren gezeigt, als erste rigorose Analyse einer adaptiven Stoppregel für ein solches nicht Filter-basiertes Regularisierungsverfahren.

show abstract

Randomized progressive iterative approximation for B-spline curve and surface fittings

Wu,

Liu

2024

Applied Mathematics and Computation

View full text Add to dashboard Cite

On the regularizing property of stochastic gradient descent

Cited by 28 publications

References 64 publications

Latent Factor Modeling of Perceived Quality for Stereoscopic 3D Video Recommendation

Latent Factor Modeling of Perceived Quality for Stereoscopic 3D Video Recommendation

Regularising linear inverse problems under unknown non-Gaussian noise

Randomized progressive iterative approximation for B-spline curve and surface fittings

Contact Info

Product

Resources

About