Self-Supervised Natural Image Reconstruction and Large-Scale Semantic Classification from Brain Activity

Gaziv, Guy; Beliy, Roman; Granot, Niv; Hoogi, Assaf; Strappini, Francesca; Golan, Tal; Irani, Michal

doi:10.1101/2020.09.06.284794

Cited by 3 publications

(11 citation statements)

References 65 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The use of CNNs is ubiquitous in image processing tasks, including image reconstruction. Specifically, encoder-decoder (Beliy et al, 2019;Gaziv et al, 2020), U-Net (Fang et al, 2020), generative adversarial network (Goodfellow et al, 2014), and variational autoencoder (Kingma and Welling, 2014) are popular architectures that adopt stacked convolutional layers to extract features at multiple levels. Shen et al (2019b) utilized a pretrained VGG-19-based DNN to extract hierarchical features from stimuli images (see Figure 3A).…”

Section: Convolutional Neural Network (Cnn)mentioning

confidence: 99%

“…In a follow-up study, Gaziv et al (2020) improved the reconstruction accuracy of BeliyEncDec by introducing a loss function based on the perceptual similarity measure (Zhang et al, 2018). To calculate perceptual similarity loss, the authors first extracted multilayer features from original and reconstructed images using VGG and then compared the extracted features layerwise.…”

Section: Deterministic Encoder-decoder Modelsmentioning

confidence: 99%

“…To calculate perceptual similarity loss, the authors first extracted multilayer features from original and reconstructed images using VGG and then compared the extracted features layerwise. To distinguish it from BeliyEncDec, we refer to the framework proposed by Gaziv et al (2020) as GazivEncDec.…”

Section: Deterministic Encoder-decoder Modelsmentioning

confidence: 99%

“…candidate images, subjects are instructed to choose the one that appears to have a higher resemblance to the original. Such behavioral studies can be conducted by employing human evaluators or using Amazon Mechanical Turk 6 (Seeliger et al, 2018;Gaziv et al, 2020).…”

Section: Human-based Evaluationmentioning

confidence: 99%

“…Despite the wide adoption of SSIM as a perceptual metric, it compares poorly with many characteristics of human perception (Zhang et al, 2018). Several studies, including Güçlütürk et al (2017), Qiao et al (2018), Mozafari et al (2020), andGaziv et al (2020), emphasize the importance of higher-level perceptual similarity over lower-level metrics in evaluation because of the better correspondence of higher-level perceptual similarity to human perceptual judgments (Zhang et al, 2018).…”

Section: Psmmentioning

confidence: 99%

See 4 more Smart Citations

Natural Image Reconstruction From fMRI Using Deep Learning: A Survey

et al. 2021

View full text Add to dashboard Cite

With the advent of brain imaging techniques and machine learning tools, much effort has been devoted to building computational models to capture the encoding of visual information in the human brain. One of the most challenging brain decoding tasks is the accurate reconstruction of the perceived natural images from brain activities measured by functional magnetic resonance imaging (fMRI). In this work, we survey the most recent deep learning methods for natural image reconstruction from fMRI. We examine these methods in terms of architectural design, benchmark datasets, and evaluation metrics and present a fair performance evaluation across standardized evaluation metrics. Finally, we discuss the strengths and limitations of existing studies and present potential future directions.

show abstract

Section: Convolutional Neural Network (Cnn)mentioning

confidence: 99%

Section: Deterministic Encoder-decoder Modelsmentioning

confidence: 99%

Section: Deterministic Encoder-decoder Modelsmentioning

confidence: 99%

Section: Human-based Evaluationmentioning

confidence: 99%

Section: Psmmentioning

confidence: 99%

See 3 more Smart Citations

Natural Image Reconstruction From fMRI Using Deep Learning: A Survey

et al. 2021

View full text Add to dashboard Cite

show abstract

Reconstructing Voice Identity from Noninvasive Auditory Cortex Recordings

Lamothe,

Thoret,

Trapeau

et al. 2024

Preprint

View full text Add to dashboard Cite

The cerebral processing of voice information is known to engage Temporal Voice Areas (TVAs) that respond preferentially to conspecific vocalizations. But how voice information related to the stable physical characteristics of the speaker such as gender, age or identity is represented by neuronal populations in these areas remains poorly understood. Here we used a deep neural network (DNN) to generate a high-level, small-dimension representational space of voice stimuli—the ‘voice latent space’ (VLS)—and examined its linear relation with cerebral activity via encoding, representational similarity and decoding analyses. We find that the VLS maps onto fMRI measures of cerebral activity in response to tens of thousands of voice stimuli from hundreds of different speaker identities, and better accounts for the representational geometry for speaker identity in the TVAs than in A1. Moreover, the VLS allowed TVA-based reconstructions of voice stimuli that preserved important aspects of speaker gender and identity as assessed by both machine classifiers and human listeners. These results demonstrate that a low-dimensional, DNN-derived space accounts well for cerebral voice representations and provide insights into representational differences between A1 and the TVAs, paving the way to noninvasive brain-computer interface applications.

show abstract

Computational reconstruction of mental representations using human behavior

Caplette¹,

Turk‐Browne²

2022

Preprint

View full text Add to dashboard Cite

Revealing the contents of mental representations is a longstanding goal of cognitive science. However, there is currently no general framework for providing direct access to representations of high-level visual concepts. We asked participants to indicate what they perceived in images synthesized from random visual features in a deep neural network. We then inferred a mapping between the semantic features of their responses and the visual features of the images. This allowed us to reconstruct the mental representation of virtually any common visual concept, both those reported and others extrapolated from the same semantic space. We successfully validated 270 of these reconstructions as containing the target concept in a separate group of participants. The visual-semantic mapping uncovered with our method further generalized to new stimuli, participants, and tasks. Finally, it allowed us to reveal how the representations of individual observers differ from each other and from those of neural networks.

show abstract

Self-Supervised Natural Image Reconstruction and Large-Scale Semantic Classification from Brain Activity

Cited by 3 publications

References 65 publications

Natural Image Reconstruction From fMRI Using Deep Learning: A Survey

Natural Image Reconstruction From fMRI Using Deep Learning: A Survey

Reconstructing Voice Identity from Noninvasive Auditory Cortex Recordings

Computational reconstruction of mental representations using human behavior

Contact Info

Product

Resources

About