2023
DOI: 10.1038/s41598-022-27332-2
|View full text |Cite
|
Sign up to set email alerts
|

Decoding of the speech envelope from EEG using the VLAAI deep neural network

Abstract: To investigate the processing of speech in the brain, commonly simple linear models are used to establish a relationship between brain signals and speech features. However, these linear models are ill-equipped to model a highly-dynamic, complex non-linear system like the brain, and they often require a substantial amount of subject-specific training data. This work introduces a novel speech decoder architecture: the Very Large Augmented Auditory Inference (VLAAI) network. The VLAAI network outperformed state-o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
12
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
2
1

Relationship

3
2

Authors

Journals

citations
Cited by 11 publications
(13 citation statements)
references
References 41 publications
(87 reference statements)
1
12
0
Order By: Relevance
“…As reported in Accou et al (2023), correlation distribution of the FCNN and CNN models remained very similar when evaluated on unseen stimulus segments from the same subjects used during training (median: r = 0.14 and r = 0.14 respectively), but also when evaluating it on different subjects from the DTU dataset (median: r = 0.11 and r = 0.14). These findings indicate a good generalization of the model across subjects and speech content (see section 3.5).…”
Section: Robust Decoding Of the Speech Envelope From Eeg Recordings T...supporting
confidence: 63%
See 2 more Smart Citations
“…As reported in Accou et al (2023), correlation distribution of the FCNN and CNN models remained very similar when evaluated on unseen stimulus segments from the same subjects used during training (median: r = 0.14 and r = 0.14 respectively), but also when evaluating it on different subjects from the DTU dataset (median: r = 0.11 and r = 0.14). These findings indicate a good generalization of the model across subjects and speech content (see section 3.5).…”
Section: Robust Decoding Of the Speech Envelope From Eeg Recordings T...supporting
confidence: 63%
“…different signal-to-noise ratios, inserted phones or speakers). To illustrate good practices, some generalization experiments were conducted in Accou et al (2023): the authors trained a model on their dataset and they evaluated it on a publicly available dataset (Fuglsang et al 2017).…”
Section: Benchmarking Model Evaluation Using Public Datasetsmentioning
confidence: 99%
See 1 more Smart Citation
“…Yet, previous studies applied these models to different protocols, namely auditory attention decoding or a match-mismatch classification paradigm, and were often trained subject-independently. For an identical protocol, Thornton et al (2022) and Accou et al (2023) recently demonstrated that reconstruction accuracy (decoding the speech envelope from EEG data) for subject-dependent deep neural networks is higher compared to linear models. Here, we replicate these results: neural envelope tracking benefits from techniques that capture both linear and nonlinear relationships.…”
Section: Beyond Linear Componentsmentioning
confidence: 99%
“…Later research attempted to introduce non-linearity, using deep neural networks. Such architectures relied on simple fully connected layers (de Taillez et al, 2020), recurrent layers (Monesi et al, 2020; Accou et al, 2023), or even recently transformer-based architectures (Défossez et al, 2023). For a global overview of EEG-based deep learning studies see Puffay et al (2023a).…”
Section: Introductionmentioning
confidence: 99%