“…In the past, many different methods have been proposed to understand the role of context [38,1,25], scale [5,37,20,26] and sampling [21,32,2,3]. Considerable importance has been given to leveraging features of different layers of the network and designing architectures for explicitly encoding context/multi-scale information [26,23,39,40] for classification.…”