“…Wu et al [2020] are the first who applied variance reduction mechanism to tolerate Byzantine attacks (see the discussion above Q1). We also refer reader to , Rajput et al, 2019, Rodríguez-Barroso et al, 2020, Xu and Lyu, 2020, Alistarh et al, 2018, Allen-Zhu et al, 2021, Regatti et al, 2020, Yang and Bajwa, 2019a,b, Gupta et al, 2021, Peng et al, 2021 for other advances in Byzantine-robustness (see the detailed summaries in , Gorbunov et al, 2021a). We further progress the field by obtaining new theoretical SOTA convergence results in our work.…”