“…Recently, a lot of CNN-based methods have been proposed and advanced the performance of crowd counting. Most of them mainly solve various challenges of crowd counting in a fullysupervised manner, including large scale variations [4], [5], [24], [25], [26], [27], [28], [29], [30], attentive feature extraction [31], [32], [33], [34], [35], label noises [6], [7], empirical gaussian kernel [36], [37], [38], estimation uncertainty [39], [40], structural constraints [8], [9], [41], [42], and etc. These methods require a great number of labeled data in the training process which are rather burdensome for crowd counting.…”