“…Convolutional neural networks (CNN) [1,2] have recently demonstrated superior performance on many tasks such as image classification [3,4,5], object detection [6,7,8,9,10,11], object tracking [12,13,14], text detection [15,16], text recognition [17,18,19], local feature description [20], video classification [21,22,23], human pose estimation [24,25,26], scene recognition [27,28] and scene labelling [29,30].…”