“…Moreover, the point of view is consistently at the street level, without big changes in the vertical orientation. Non robotics datasets typically are not collected as sequences of frames [22], [23], [38], [69], [80], [81], [116]- [118], [122], [125], [190], [215], [242], [244], [246], [251]. In many cases, they are created by collections of online images, with variable viewpoints [122] 2008 Urban City ∼6k Label Holidays [118] 2008 Outdoor World ∼2k Label Eynsham [21] 2009 Urban City ∼70k GPS St. Lucia [240], [241] 2010 Urban City ∼66k GPS European Cities 50k [22] 2010 Urban Continent ∼50k Label Geotagged StreetView [23] 2010 Urban City ∼17k GPS Rome 16k [242] 2010 Urban City ∼16k Pose Dubrovnik 6k [242] 2010 Urban City ∼6.8k Pose San Francisco [243] 2011 Urban City ∼1.06M GPS Alderley [45] 2012 Urban City ∼31k GPS 7 Scenes [244] 2013 Indoor Building ∼43k Pose Nordland [155] 2013 Outdoor Region ∼143k GPS Google StreetView 62k [114] 2014 Urban City ∼62k GPS Freiburg Across Seasons [192], [245] 2014 Urban City ∼43k GPS Cambridge Landmarks [215] 2015 Urban City ∼10.8k Pose Paris500k [246] 2015 Urban City ∼504k Label Pittsburgh [117] 2015 Urban City ∼278k GPS Landmarks-full [80], [125] 2016 Urban World ∼192k Label NCLT [247] 2016 Outdoor + Indoor...…”