“…The metrics were the number of pages accepted into the correct category, the number of pages failed (not categorised that should have been), and those ambiguously categorised (accepted into more than one category, both where this double classification was valid and where one of the classifications was incorrect). Double classification is characteristic of subject classification [10], however, this is not necessarily a problem since many 'distinct' subject areas overlap. For instance, if the page subject is an icebreaking trawler then from our categories, oceans and transport could both be deemed correct classifications.…”