“…How to effectively learn with such data has attracted much attention in the data mining community. Formally, the paradigm is referred to as Partial Label (PL) learning [6-9, 15, 27, 28, 37], also superset label learning [11,16,17] or ambiguous label learning [2,3,5,13,33]. The PL scenarios are mainly caused by the expensive cost of acquiring explicit labels for instances, and they emerge in many real-world applications, e.g., multimedia content analysis [5,33], web mining [18], and ecoinformatics [16], etc.…”