“…To circumvent manual data annotation, previous work used a distant supervision process requiring a knowledge base aligned to the website targeted for extraction (Gentile et al, 2015;Lockard et al, 2018), including for OpenIE extraction (Banko et al, 2007;Bronzi et al, 2013;Lockard et al, 2019). These methods, however, can only learn a website-specific model based on seed knowledge for the site, but cannot be generalized to the majority of websites with knowledge from new verticals, by long-tail specialists, and in different languages.…”