Places: A 10 Million Image Database for Scene Recognition

Zhou, Bolei; Lapedriza, Àgata; Khosla, Aditya; Oliva, Aude; Torralba, Antonio

doi:10.1109/tpami.2017.2723009

Cited by 2,921 publications

(2,329 citation statements)

References 33 publications

(61 reference statements)

Supporting

Mentioning

2,188

Contrasting

Unclassified

Order By: Relevance

“… FABMAP [1]: This is a state-of-the-art place recognition algorithm built on top of handcrafted features  SeqSLAM [14]: This is a sequence-based place recognition approach, which have demonstrated state-of-the-art performances on mapping environments across seasons, weather conditions and different times of a day.  Places365 [22]: This is a CNN-based scene recognition model trained to recognize 365 scene types. The model was trained on over two million pictures.…”

Section: E Comparison Methodsmentioning

confidence: 99%

See 1 more Smart Citation

Learning Context Flexible Attention Model for Long-Term Visual Place Recognition

Chen

Liu

et al. 2018

IEEE Robot. Autom. Lett.

101

View full text Add to dashboard Cite

Section: E Comparison Methodsmentioning

confidence: 99%

“…Rather than recognizing specific places, networks have also been trained for recognizing the types of places [22]. However, this scene recognition task is different in nature from visual place recognition: image under the same scene category can come from different places.…”

Section: A Visual Place Recognition With Cnnsmentioning

confidence: 99%

Learning Context Flexible Attention Model for Long-Term Visual Place Recognition

Chen

Liu

et al. 2018

IEEE Robot. Autom. Lett.

101

View full text Add to dashboard Cite

“…The Places dataset [44] consists Urban Sci. 2018, 2, 78 7 of 19 of around 10 million images depicting indoor and outdoor views.…”

Section: Scene Classificationmentioning

confidence: 99%

Quantifying Urban Surroundings Using Deep Learning Techniques: A New Proposal

2018

View full text Add to dashboard Cite

Abstract:The assessments on human perception of urban spaces are essential for the management and upkeep of surroundings. A large part of the previous studies is dedicated towards the visual appreciation and judgement of various physical features present in the surroundings. Visual qualities of the environment stimulate feelings of safety, pleasure, and belongingness. Scaling such assessments to cover city boundaries necessitates the assistance of state-of-the-art computer vision techniques. We developed a mobile-based application to collect visual datasets in the form of street-level imagery with the help of volunteers. We further utilised the potential of deep learning-based image analysis techniques in gaining insights into such datasets. In addition, we explained our findings with the help of environment variables which are related to individual satisfaction and wellbeing.

show abstract

“…As the foundation for our spatial attention model, we used a goal-directed CNN with a deep architecture 10 trained for scene categorization 18,19 . This architecture consists of 18 spatiallyselective layers that compute alternating convolution and non-linear max-pooling operations (Fig.…”

Section: Spatial Attention Model Definitionmentioning

confidence: 99%

“…To facilitate straightforward interpretation of which factors drive significant prediction of eye movements from spatial priority maps reconstructed from fMRI activity, these steps were excluded from our model. 10,18,19 . (b) Unit activity was extracted from the five pooling layers to sample activity from across the CNN hierarchy.…”

Section: Spatial Attention Model Definitionmentioning

confidence: 99%

Predicting eye movements from deep neural network activity decoded from fMRI responses to natural scenes

Chun

2017

Preprint

View full text Add to dashboard Cite

Computational models of selective spatial attention can reliably predict eye movements to complex images. However, researchers lack a simple way to measure covert representations of spatial attention in the brain and their link to overt eye movement behavior, especially in response to natural scenes. Here, we predict eye movement patterns from spatial priority maps reconstructed from brain activity measured with functional magnetic resonance imaging (fMRI). First, we define a computational spatial attention model using a deep convolutional neural network (CNN) pre-trained for scene categorization. Next, we decode CNN unit activity from fMRI activity and reconstruct spatial priority maps by applying our computational spatial attention model to decoded CNN activity. Finally, we predict eye movements in a subsequent behavioral experiment within and between individuals using reconstructed spatial priority maps. These results demonstrate that features represented in CNN unit activity can guide spatial attention and eye movements, providing a crucial link between CNN models, brain activity, and behavior.

show abstract

Places: A 10 Million Image Database for Scene Recognition

Cited by 2,921 publications

References 33 publications

Learning Context Flexible Attention Model for Long-Term Visual Place Recognition

Learning Context Flexible Attention Model for Long-Term Visual Place Recognition

Quantifying Urban Surroundings Using Deep Learning Techniques: A New Proposal

Predicting eye movements from deep neural network activity decoded from fMRI responses to natural scenes

Contact Info

Product

Resources

About