“…In nonparametric regression estimation we aim to recover an unknown d-variate function g 0 based on n observed input-output pairs (X i , Y i ) ∈ R d × R, i = 1, ..., n. Various regression estimating function classes, including wavelets, polynomials, splines and kernel estimates have been studied in the literature (see, e.g., [2], [5], [6], [7] and references therein). Along with the development of practical and theoretical applications of neural networks, regression estimations with neural networks are becoming popular in the recent literature (see, e.g., [1], [8], [9], [10], [13], [15], [18], [19], [21] and references therein). Usually a class of neural networks with properly chosen architecture and with weight vectors belonging to some regularized set W n is determined and the estimator ĝn of g 0 is selected to be either the regularized empirical risk minimizer…”