“…Het-Pipe [Park et al, 2020] addresses the additional problem of heterogeneous GPUs by grouping them into virtual workers and running pipeline parallelism within each virtual worker, while relying on data parallelism between workers. Varuna [Athlur et al, 2021] [Park et al, 2020] DP, PP Parameter Server LinProg for PP Pipe-torch [Zhan and Zhang, 2019] DP, PP Async Update DynProg for DP, PP, GPU allocation Varuna [Athlur et al, 2021] DP, PP Opportunistic Backward Scheduling Heuristic PP partition, Bruteforce for DP, PP depth Gems [Jain et al, 2020a] DP, PP Bidirectional Pipeline -Chimera [Li and Hoefler, 2021] DP,PP 1F1B, Bidirectional Pipeline Greedy mini-batch size, Bruteforce for DP, PP depth tivation recomputations and respective backward passes are scheduled opportunistically.…”