High-resolution image reconstruction with latent diffusion models from human brain activity

Takagi, Yu; Nishimoto, Shinji

doi:10.1101/2022.11.18.517004

Cited by 46 publications

(51 citation statements)

References 48 publications

(97 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For example, variational autoencoders (VAEs) have been applied to infer low-dimensional representations of single-trial neural population dynamics (Pandarinath et al, 2018), while generative adversarial networks (GANs) have been used for the task of spike-train generation (Molano-Mazon et al, 2018, Ramesh et al, 2019), as well as to decode images from single neuron and fMRI data (Ponce et al, 2019, Lin et al, 2022). The recently proposed denoising diffusion probabilistic models (DDPMs) have also been applied to improve neural decoding performance, in particular leveraging latent diffusion models (Rombach et al, 2022) to predict viewed images from fMRI data (Takagi and Nishimoto, 2022, Chen et al, 2023).…”

Section: Introductionmentioning

confidence: 99%

Generating realistic neurophysiological time series with denoising diffusion probabilistic models

Vetter,

Macke,

Gao

2023

Preprint

View full text Add to dashboard Cite

In recent years, deep generative models have had a profound impact in engineering and sciences, revolutionizing domains such as image and audio generation, as well as advancing our ability to model scientific data. In particular, Denoising Diffusion Probabilistic Models (DDPMs) have been shown to accurately model time series as complex high-dimensional probability distributions. Experimental and clinical neuroscience also stand to benefit from this progress, since accurate modeling of neurophysiological time series, such as electroencephalography (EEG), electrocorticography (ECoG), and local field potential (LFP) recordings, and their synthetic generation can enable or improve a variety of neuroscience-specific applications. Here, we present a method for modeling multi-channel and densely sampled neurophysiological recordings using DDPMs, which can be flexibly applied to different recording modalities and experimental configurations. First, we show that DDPMs can generate realistic synthetic data for a variety of datasets including different recording techniques (LFP, ECoG, EEG) and species (rat, macaque, human). DDPM-generated time series accurately capture single- and multi-channel statistics such as frequency spectra and phase-amplitude coupling, as well as fine-grained and dataset-specific features such as sharp wave-ripples. In addition, synthetic time series can be generated based on additional information like experimental conditions or brain states. We demonstrate the utility and flexibility of DDPMs in several neuroscience-specific analyses, such as brain-state classification and imputation of missing channels to improve neural decoding. In summary, DDPMs can serve as accurate generative models of neurophysiological recordings, and have a broad utility in the probabilistic generation of synthetic time series for many applications relevant to neuroscience.

show abstract

Section: Introductionmentioning

confidence: 99%

Generating realistic neurophysiological time series with denoising diffusion probabilistic models

Vetter,

Macke,

Gao

2023

Preprint

View full text Add to dashboard Cite

show abstract

“…Recently, by incorporating the assistance of deep neural networks (DNNs) 12,13 and generative models [14][15][16][17][18][19][20][21] , several studies have achieved higher-fidelity natural image reconstruction [22][23][24][25][26] , which has become a tool for investigating the visual processing in the brain (e.g., visual representation, attention 27 , and illusion 28 ).…”

Section: Introductionmentioning

confidence: 99%

“…Other studies have decoded seen natural images 9, 10 or videos 11 using visual features inspired by neurophysiological discoveries. Recently, by incorporating the assistance of deep neural networks (DNNs) 12, 13 and generative models 14–21 , several studies have achieved higher-fidelity natural image reconstruction 22–26 , which has become a tool for investigating the visual processing in the brain (e.g., visual representation, attention 27 , and illusion 28 ).…”

Section: Introductionmentioning

confidence: 99%

Mental image reconstruction from human brain activity

Koide-Majima

Majima

2023

Preprint

Self Cite

View full text Add to dashboard Cite

Visual images perceived by humans can be reconstructed from their brain activity. However, visualization (externalization) of mental images remains challenging. In this study, we demonstrated that our previous image reconstruction method heavily relies on low-level visual information decoded from the brain and cannot efficiently utilize semantic information that would be recruited during mental imagery. To address this limitation, we extended our previous reconstruction method as a Bayesian estimation framework and introduced the assistance of semantic information into it. Our proposed framework successfully reconstructed both seen and imagined images from the brain activity. Interestingly, we found that line components in imagery reconstructions of geometric shapes were emphasized compared to those in seen image reconstructions, which might reflect the sharpening effect caused by the top-down process in the brain. These results suggest that our framework would provide a fundamental technology for directly investigating the subjective contents of the brain.

show abstract

“…Deep convolutional neural networks (DCNNs) have entered the computational modeling scene with high predictive performance of both object category and brain dynamics during object categorization tasks (1)(2)(3)(4). These predictions on brain dynamics are not limited to lowlevel image statistics but also include high-level features such as animacy, object category and semantics (5)(6)(7)(8)(9). In fact, DCNNs' predictive performance on visual processes surpassed hand-engineered, biologically-inspired models (e.g.…”

Section: Introductionmentioning

confidence: 99%

Human visual cortex and deep convolutional neural network care deeply about object background

Loke

Seijdel

Snoek

et al. 2023

Preprint

View full text Add to dashboard Cite

Deep convolutional neural networks (DCNNs) are able to predict brain activity during object categorization tasks, but factors contributing to this predictive power are not fully understood. Our study aimed to investigate the factors contributing to the predictive power of DCNNs in object categorization tasks. We compared the activity of four DCNN architectures with electroencephalography (EEG) recordings obtained from 62 human subjects during an object categorization task. Previous physiological studies on object categorization have highlighted the importance of figure-ground segregation - the ability to distinguish objects from their backgrounds. Therefore, we set out to investigate if figureground segregation could explain DCNNs predictive power. Using a stimuli set consisting of identical target objects embedded in different backgrounds, we examined the influence of object background versus object category on both EEG and DCNN activity. Crucially, the recombination of naturalistic objects and experimentally-controlled backgrounds creates a sufficiently challenging and naturalistic task, while allowing us to retain experimental control. Our results showed that early EEG activity (<100ms) and early DCNN layers represent object background rather than object category. We also found that the predictive power of DCNNs on EEG activity is related to processing of object backgrounds, rather than categories. We provided evidence from both trained and untrained (i.e. random weights) DCNNs, showing figure-ground segregation to be a crucial step prior to the learning of object features. These findings suggest that both human visual cortex and DCNNs rely on the segregation of object backgrounds and target objects in order to perform object categorization. Altogether, our study provides new insights into the mechanisms underlying object categorization as we demonstrated that both human visual cortex and DCNNs care deeply about object background.

show abstract

High-resolution image reconstruction with latent diffusion models from human brain activity

Cited by 46 publications

References 48 publications

Generating realistic neurophysiological time series with denoising diffusion probabilistic models

Generating realistic neurophysiological time series with denoising diffusion probabilistic models

Mental image reconstruction from human brain activity

Human visual cortex and deep convolutional neural network care deeply about object background

Contact Info

Product

Resources

About