“…Please note the sizes of X r i , W r i,i+1 and b r i must match. For example, in Figure 1, the transformation between H1 layer and H2 layer, the sizes of X r 3 , X r 2 , W [2,3] , and b r 2 are separately [1, 1, 64], [1,1,64], [64,64], and [1,64].…”