We thank Mdege et al. [1] for their response to our critical appraisal [2] of the stepped wedge design (SWD). They agree with us that the SWD has many disadvantages compared with the cluster randomized controlled trial (cluster RCT) and that the cluster RCT is preferable in most circumstances. However, they also maintain that the SWD may be useful when the alternative is to conduct no randomized trial at all and that some of the cluster RCT variants that we suggest in our article are in fact variants of the SWD. To clarify the latter and better understand the discussion about the advantages and disadvantages of the SWD compared with the cluster RCT, it is important to point out the key difference between the two designs.So, what actually differentiates the SWD from the cluster RCT? In our view, it is the repeated measurement of data from all clusters at each step. As we explained in our critical appraisal, other features of the SWD are not unique to this design but can also be part of a cluster RCT: rolling out an intervention to all clusters and sequential implementation of the intervention [2]. We also explained that rolling out the intervention to control clusters after the final data collection will result in a shorter duration to conduct a cluster RCT and, in addition, has the advantage that the intervention will only be rolled out if proven to be effective [2]. The repeated measurement of data from all clusters at each step, which is the key characteristic that differentiates the SWD from the cluster RCT, has two very important drawbacks: it puts a heavy burden on patients, caregivers, and researchers, and increases the risk of contamination and attrition [2]. Thus, in our opinion, the SWD should therefore only be considered in the absence of these two drawbacks.Then when can the SWD be used instead of the cluster RCT? We agree with Mdege et al. that the SWD could be applied under some circumstances. However, the set of circumstances in which the SWD may be the preferred alternative (i.e., in the absence of the two aforementioned drawbacks) are rather limited. For one thing, all necessary data must be routinely collected at the appropriate time intervals (i.e., at the inclusion of each additional cluster) and in a reliable and valid fashion without additional burden for patients, caregivers, and researchers. Unfortunately, situations in which this applies are rather exceptional, which in our opinion severely limits the applicability of the SWD. Often, the routine collection of data is incomplete and not all necessary data are collected (in particular relevant covariates).There may be one other situation in which the SWD may be preferable to a cluster RCT: when the number of clusters is extremely low and the researcher has no control over the number of clusters. In both designs, the power increases much more with an increasing number of clusters than with