“…In particular, it is shown that under a suitable scaling, as the widths tend to infinity, the neural network's learning dynamics converges to a nonlinear deterministic limit, known as the mean field (MF) limit [14,17]. This line of works starts with analyses of the shallow case under various settings and has led to a number of nontrivial exciting results [18,14,5,23,25,9,22,19,29,24,30,12,1,16]. The generalization to multilayer neural networks, already much more conceptually and technically challenging, has also been met with serious efforts from different groups of authors, with various novel ideas and insights [15,17,20,2,26,6].…”