“…The typical classifiers include SVM [22,23], boosting [6,24], random forest [25], Hough forest [26], structural learning [7,8,27], sparse coding [28][29][30][31][32][33], discriminative feature learning [34], multiple instance learning [35], co-training technique [36], tracking-learning-detection [37], weakly supervised learning [38,39], and-or graphs [14], coupled 2-layer model [40], etc. However, most of these classifiers are limited by their shallow or linear nature structures while object appearance variations are complex, highly nonlinear, and time-varying.…”