“…In the two-step procedure, the F -test is not robust against outliers (Ronchetti, 1982). Hence, even one outlier is enough to inflate the first-stage F leading to a false impression that the instrument is strong, while it is weak (Klooster and Zhelonkin, 2024), which eventually results in incorrect inference in the second stage. On the other hand, an outlier could also deflate the F -statistic, i.e., the instrument is strong, but due to an outlier it seems weak.…”