Abstract. We present and analyze a new space-time parallel multigrid method for parabolic equations. The method is based on arbitrarily high order discontinuous Galerkin discretizations in time, and a finite element discretization in space. The key ingredient of the new algorithm is a block Jacobi smoother. We present a detailed convergence analysis when the algorithm is applied to the heat equation, and determine asymptotically optimal smoothing parameters, a precise criterion for semi-coarsening in time or full coarsening, and give an asymptotic two grid contraction factor estimate. We then explain how to implement the new multigrid algorithm in parallel, and show with numerical experiments its excellent strong and weak scalability properties.Key words. Space-time parallel methods, multigrid in space-time, DG-discretizations, strong and weak scalability, parabolic problems AMS subject classifications. 65N55, 65F10, 65L601. Introduction. About ten years ago, clock speeds of processors have stopped increasing, and the only way to obtain more performance is by using more processing cores. This has led to new generations of supercomputers with millions of computing cores, and even today's small devices are multicore. In order to exploit these new architectures for high performance computing, algorithms must be developed that can use these large numbers of cores efficiently. When solving evolution partial differential equations, the time direction offers itself as a further direction for parallelization, in addition to the spatial directions, and the parareal algorithm [29,31,1,37,18,9] has sparked renewed interest in the area of time parallelization, a field that is now just over fifty years old, see the historical overview [8]. We are interested here in space-time parallel methods, which can be based on the two fundamental paradigms of domain decomposition or multigrid. Domain decomposition methods in spacetime lead to waveform relaxation type methods, see [17,7,19] for classical Schwarz waveform relaxation, [12,13,10,11,2] for optimal and optimized variants, and [28,33,15] for Dirichlet-Neumann and Neumann-Neumann waveform relaxation. The spatial decompositions can be combined with parareal to obtain algorithms that run on arbitrary decompositions of the space-time domain into space-time subdomains, see [32,14]. Space-time multigrid methods were developed in [20,30,41,25,40,26,27,43], and reached good F-cycle convergence behavior when appropriate semi-coarsening and extension operators are used. For a variant for non-linear problems, see [4,36,35].We present and analyze here a new space-time parallel multigrid algorithm that has excellent strong and weak scalability properties on large scale parallel computers. As a model problem we consider the heat equation in a bounded domain Ω ⊂ R d , d = 1, 2, 3 with boundary Γ := ∂Ω on the bounded time interval [0, T ],