From photos to sketches - how humans and deep neural networks process objects across different levels of visual abstraction

Singer, Johannes; Seeliger, Katja; Kietzmann, Tim C.; Hebart, Martin N.

doi:10.1167/jov.22.2.4

“…This suggests that object recognition can be resolved with the same amount of processing resources for different levels of visual abstraction of the image. This is consistent with previous computational work showing that representations for photographs and drawings at different levels of visual abstraction become highly similar when being processed in feedforward deep convolutional neural networks trained to categorize natural object images (Fan et al, 2018;Singer et al, 2022). While other work has demonstrated that additional recurrent processing is necessary for resolving degraded (Wyatte et al, 2012), occluded (Rajaei et al, 2019;Tang et al, 2018) or otherwise challenging images (Kar et al, 2019), our findings indicate that no additional mechanisms are needed for the robust recognition of abstract drawings.…”

Section: Discussionsupporting

confidence: 91%

“…As expected, photos and drawings showed the highest RDM correlation (r=0.79) while the correlation for photos and sketches (r=0.41) as well as the correlation between drawings and sketches (r=0.45) were lower. Next, to confirm that human subjects perceive the object images in the different types of depiction similarly at a conceptual level, we used previously acquired data (Singer et al, 2022) where workers on Amazon Mechanical Turk indicated which of three object images they thought was the odd-one out (Hebart et al, 2020). These triplet judgments were used to construct perceptual similarity matrices for each type of depiction separately, which we subsequently correlated to each other to estimate their representational similarity.…”

Section: Natural Object Images and Line Drawings Differ In Low-level ...mentioning

confidence: 99%

The spatiotemporal neural dynamics of object recognition for natural images and line drawings

Singer

¹

,

Cichy

²

,

Hebart

³

2022

Preprint

Self Cite

0

View full text Add to dashboard Cite

Drawings offer a simple and efficient way to communicate meaning. While line drawings capture only coarsely how objects look in reality, we still perceive them as resembling real-world objects. Previous work has shown that this perceived similarity is mirrored by shared neural representations for drawings and natural images, which suggests that similar mechanisms underlie the recognition of both. However, other work has proposed that representations of drawings and natural images become similar only after substantial processing has taken place, suggesting distinct mechanisms. To arbitrate between those alternatives, we measured brain responses resolved in space and time using fMRI and MEG, respectively, while participants viewed images of objects depicted as photographs, line drawings, or sketch-like drawings. Using multivariate decoding, we demonstrate that object category information emerged similarly fast and across overlapping regions in occipital and ventral-temporal cortex for all types of depiction, yet with smaller effects at higher levels of visual abstraction. In addition, cross-decoding between depiction types revealed strong generalization of object category information from early processing stages on. Finally, by combining fMRI and MEG data using representational similarity analysis, we found that visual information traversed similar processing stages for all types of depiction, yet with an overall stronger representation for photographs. Together our results demonstrate broad commonalities in the neural dynamics of object recognition across types of depiction, thus providing clear evidence for shared neural mechanisms underlying recognition of natural object images and abstract drawings.

show abstract

“…This suggests that object recognition can be resolved with the same amount of processing resources for different levels of visual abstraction of the image. This is consistent with previous computational work showing that representations for photographs and drawings at different levels of visual abstraction become highly similar when being processed in feedforward deep convolutional neural networks trained to categorize natural object images ( Fan et al, 2018 ; Singer et al, 2022 ). While other work has demonstrated that additional recurrent processing is necessary for resolving degraded ( Wyatte et al, 2012 ), occluded ( Tang et al, 2018 ; Rajaei et al, 2019 ), or otherwise challenging images ( Kar et al, 2019 ), our findings indicate that no additional mechanisms are needed for the robust recognition of abstract drawings.…”

Section: Discussionsupporting

confidence: 91%

“…As expected, photographs and drawings showed the highest RDM correlation ( r = 0.79) while the correlation for photographs and sketches ( r = 0.41) as well as the correlation between drawings and sketches ( r = 0.45) were lower. Next, to confirm that human subjects perceive the object images in the different types of depiction similarly at a conceptual level, we used previously acquired data ( Singer et al, 2022 ) where workers on Amazon Mechanical Turk indicated which of three object images they thought was the odd-one-out ( Hebart et al, 2020 ). These triplet judgments were used to construct perceptual similarity matrices for each type of depiction separately, which we subsequently correlated to each other to estimate their representational similarity.…”

Section: Resultsmentioning

confidence: 99%

“…To ensure that human participants perceive the stimuli in the different types of depiction similarly at a conceptual level, we used data from a previous study ( Singer et al, 2022 ) in which workers on Amazon Mechanical Turk had performed a triplet odd-one-out task ( Hebart et al, 2020 ) on the same stimuli as used here. In this task, participants were instructed to find the odd-one-out in triplets of object images belonging to the same type of depiction.…”

Section: Methodsmentioning

confidence: 99%

See 1 more Smart Citation

The Spatiotemporal Neural Dynamics of Object Recognition for Natural Images and Line Drawings

Singer

¹

,

Cichy

²

,

Hebart

³

2022

View full text Add to dashboard Cite

Drawings offer a simple and efficient way to communicate meaning. While line drawings capture only coarsely how objects look in reality, we still perceive them as resembling real-world objects. Previous work has shown that this perceived similarity is mirrored by shared neural representations for drawings and natural images, which suggests that similar mechanisms underlie the recognition of both. However, other work has proposed that representations of drawings and natural images become similar only after substantial processing has taken place, suggesting distinct mechanisms. To arbitrate between those alternatives, we measured brain responses resolved in space and time using fMRI and MEG, respectively, while human participants (female and male) viewed images of objects depicted as photographs, line drawings, or sketch-like drawings. Using multivariate decoding, we demonstrate that object category information emerged similarly fast and across overlapping regions in occipital, ventral-temporal and posterior parietal cortex for all types of depiction, yet with smaller effects at higher levels of visual abstraction. In addition, cross-decoding between depiction types revealed strong generalization of object category information from early processing stages on. Finally, by combining fMRI and MEG data using representational similarity analysis, we found that visual information traversed similar processing stages for all types of depiction, yet with an overall stronger representation for photographs. Together our results demonstrate broad commonalities in the neural dynamics of object recognition across types of depiction, thus providing clear evidence for shared neural mechanisms underlying recognition of natural object images and abstract drawings.SIGNIFICANCE STATEMENT:When we see a line drawing, we effortlessly recognize it as an object in the world despite its simple and abstract style. Here we asked to what extent this correspondence in perception is reflected in the brain. To answer this question, we measured how neural processing of objects depicted as photographs and line drawings with varying levels of detail (from natural images to abstract line drawings) evolves over space and time. We find broad commonalities in the spatiotemporal dynamics and the neural representations underlying the perception of photographs and even abstract drawings. These results indicate a shared basic mechanism supporting recognition of drawings and natural images.

show abstract

A novel feature-scrambling approach reveals the capacity of convolutional neural networks to learn spatial relations

Farahat,

Effenberger,

Vinck

2023

View full text Add to dashboard Cite

From photos to sketches - how humans and deep neural networks process objects across different levels of visual abstraction

Cited by 29 publications

References 48 publications

The spatiotemporal neural dynamics of object recognition for natural images and line drawings

The spatiotemporal neural dynamics of object recognition for natural images and line drawings

The Spatiotemporal Neural Dynamics of Object Recognition for Natural Images and Line Drawings

A novel feature-scrambling approach reveals the capacity of convolutional neural networks to learn spatial relations

Contact Info

Product

Resources

About