“…Most of the existing methods inspect a pre-specified model component (e.g., individual BERT layers) in a top-down manner. A typical approach first takes aim at specific linguistic phenomena that would be captured by the target components, and then trains a probing classifier that predicts the chosen linguistic phenomena from the target components (Bau et al, 2018;Giulianelli et al, 2018;Dalvi et al, 2019;Lakretz et al, 2019;Kovaleva et al, 2019;Goldberg, 2019;Petroni et al, 2019;Hewitt and Manning, 2019;Jawahar et al, 2019;Durrani et al, 2020;Zhou and Srikumar, 2021;Cao et al, 2021;Jumelet et al, 2021).…”