One of the most sought-after research areas in object detection is pedestrian detection owing to its applications especially in automated surveillance and robotics. Traditional methods use hand-crafted features to characterize pedestrians. In this work, we have pro-posed a new hand-crafted feature extraction method that concatenates shape, color and texture features; which is then classified by using Support Vector Machine (SVM). As in recent years, deep learning models such as Convolutional Neural Networks (CNNs) have become an eminent state of the art in detection challenges, which unlike the manually designed feature extraction mechanism, results in more accuracy. Therefore, we have also proposed a CNN network, a modification of the pre-trained ResNet-18 named as Multi-layer Feature Fused-ResNet (MF2-ResNet). We have used the proposed modification for (1) feature extraction; which is then classified by using Support Vector Machine (SVM); (2) End-to-End feature extraction and classification by the CNN network and (3) MF2-ResNet based Faster-RCNN to include region proposals in the detection pipeline. To evaluate the proposed method, it is compared with existing pre-trained CNNs. The MF2-ResNet based Faster R-CNN is compared with state-of-the-art detection methods. Three benchmark pedestrian datasets are used in this work: INRIA, NICTA and Daimler.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.