“…Jiang et al (2020a) build a multilingual knowledge probing benchmark based on LAMA. There are many studies focus on probing specific knowledge in PLMs, such as linguistic knowledge (Lin et al, 2019;Tenney et al, 2019;Liu et al, 2019a;Hewitt and Manning, 2019;Goldberg, 2019;Warstadt et al, 2019), 1862 semantic knowledge (Tenney et al, 2019;Wallace et al, 2019;Ettinger, 2020) and world knowledge (Davison et al, 2019;Bouraoui et al, 2020;Forbes et al, 2019;Zhou et al, 2019;Roberts et al, 2020;Tamborrino et al, 2020). Recently, some studies doubt the reliability of PLMs as knowledge base by discovering the the spurious correlation to surface forms Poerner et al, 2020;Shwartz et al, 2020), and their sensitivity to "negation" and "mispriming" (Kassner and Schütze, 2020b).…”