“…Consequently, we briefly discuss their implementation here. In MPI the MPI_Barrier is implemented using the Butterfly or Pair Exchange (PE) [18,24] algorithm. This can easily be visualized by letting each of the n processes be represented by a vertex in a hypercube of dimension log n, The barrier is performed in log n stages: in the first stage each process synchronizes with their neighbour along the first dimension of the hypercube, in the second stage each process synchronizes with their neighbour along the second dimension, and so on.…”