“…Afterwards, Transformers spring up and make splendid breakthroughs on various vision tasks [13,14,15,16,17,18,19,20,21,22,23,24,25,26,27]. Most recently, the multi-layer perceptrons (MLPs) based architectures [28,29] have regained their light and been demonstrated capable of achieving stunning results on vision tasks [30,28,31,32,29,33,34]. A situation in which these three families of backbone architectures are contending has been formed.…”