“…On GPU architecture, there are many implementations to parallelize BS using di↵erent optimization techniques (such as memory coalescing, data transfer, kernel overlapping, divergent branch elimination, and e cient register usage) [8][9][10]. The research described in [11] proposed a parallel implementation of the CodeBook model on GPU to achieve BS. On distributed memory systems, the authors of [12] proposed a parallel algorithm of the classical Gaussian model to support real-time video applications depending on CD.…”