“…Therefore, it is very important to design FL algorithms to reduce the overall communication cost, which takes into account both the number of communication rounds and the cost per communication round for reaching a desired accuracy. With these two quantities in mind, there are two principal approaches for communication-efficient FL: 1) local methods, where in each communication round, clients run multiple local update steps before communicating with the server, in the hope of reducing the number of communication rounds, e.g., [47,43,36,24,35,61,51,2,67,50,49,42]; 2) compression methods, where clients send compressed communication message to the server, in the hope of reducing the cost per communication round, e.g., [4,37,60,28,34,48,52,25,53,19,41,68]. While both categories have garnered significant attention in recent years, we will focus on the second approach based on communication compression to enhance communication efficiency.…”