“…Refer to (Li et al 2016a) for their definitions and explanations. We compare our approach with 14 existing counterparts, including HPNet (Liu et al 2017), JRL (Wang et al 2017), VeSPA (Sarfraz et al 2017), WPAL (Yu et al 2016), GAM (Fabbri, Calderara, and Cucchiara 2017), GRL (Zhao et al 2018), LGNet (Liu et al 2018), PGDM (Li et al 2018), VSGR , RCRA (Zhao et al 2019), I 2 ANet (Ji et al 2019), JLPLS (Tan et al 2019), CoCNN , and DCL (Wang et al 2019), as shown in Table 2. The samples in the RAP dataset are collected from real world surveillance scenarios, and compared to the ones in WIDER-Attribute, there are less distractions.…”