“…These preferences are usually stated in terms of information-theoretical measures, such as point-wise mutual information. Since hand-annotated treebanks usually do not have enough material to obtain reliable bilexical statistics, these statistics were extracted from raw text (Volk, 2001), automatically tagged (Ratnaparkhi, 1998), chunk parsed (Volk, 2002) or parsed (Hindle and Rooth, 1993;Pantel and Lin, 2000;Mirroshandel et al, 2012) corpora, resulting in auxiliary distributions. Since these seminal works in PP attachment, parsers have become faster (Kübler et al, 2009) and more accurate (Chen and Manning, 2014), opening the possibility to obtain better co-occurrence statistics.…”