“…Since the hidden nodes number is an important parameter which may greatly affect the learning accuracy and generalization performance of the neural network, for each algorithm we set = 20, = 50, and = 100, respectively, for experiments, and the corresponding numbers of training data for initialization are taken as 50, 100, and 200, respectively. Besides, other parameters for each algorithm are set as follows: for R-OSELM, = 10 −8 ; for FR-OSELM, = 10 −8 , = 0.98; for DFF-OSELM, all the parameters are set the same as in [23], and an extra regularization parameter, = 10 −8 , is added to stabilize the algorithm; for our AFGR-OSELM, = 10 −8 , = 0.1, + = 0.999, − = 0.8, the initial value for is set as 0.995, and the S and can be simply initialized to identity matrix and zero vector, respectively. Table 1 presents the mean and standard deviations (SD) of the prediction RMSE over 30 independent trials of the five models for performing different prediction steps on the simulated time-varying system.…”