“…Thus, in order to avoid comparing apples to oranges, we break the listed works into four main groups based on what component, of the SP model, each improves. Thus, works are grouped into (1) those that improve the coding step, including works by Yang et al [53], Wang et al [51] and Wang et al [52], (2) those that improve the pooling operator, including works by Yang et al [53] and Koniusz et al [33], (3) those that enrich the spatial information captured by the model, the works by Khan et al [31,32], and finally (4) those that locally pool in the feature space, including works by Boureau et al [8], Fanello et al [19], and ours. Table 1 also includes studies by Boureau et al [7] and Chatfield et al [10], which are two widely cited benchmarking studies that extensively evaluated the model using different combinations of components and parameters.…”