2021
DOI: 10.48550/arxiv.2106.09622
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Extracting Different Levels of Speech Information from EEG Using an LSTM-Based Model

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
2
2

Relationship

3
1

Authors

Journals

citations
Cited by 4 publications
(9 citation statements)
references
References 0 publications
0
9
0
Order By: Relevance
“…This paradigm can also be solved in a non-linear fashion with neural networks (e.g. Accou et al, 2021; Monesi et al, 2021; Bollens et al, 2022). Accou et al (2021) showed that the accuracy of a neural network solving a match-mismatch task could be used to estimate the speech reception threshold.…”
Section: Methods To Measure Neural Trackingmentioning
confidence: 99%
“…This paradigm can also be solved in a non-linear fashion with neural networks (e.g. Accou et al, 2021; Monesi et al, 2021; Bollens et al, 2022). Accou et al (2021) showed that the accuracy of a neural network solving a match-mismatch task could be used to estimate the speech reception threshold.…”
Section: Methods To Measure Neural Trackingmentioning
confidence: 99%
“…Most studies we report here used acoustic features, such as the temporal envelope (e.g., Ciccarelli et al 2019, de Taillez et al 2020, Lu et al 2021, Su et al 2021, Xu et al 2022a, 2022b, or the Mel spectrogram (e.g. Krishna et al 2020, Kuruvila et al 2021, Monesi et al 2021. A study even used the fundamental frequency of the voice -f0-(Puffay et al 2022).…”
Section: Speechmentioning
confidence: 99%
“…de CheveignĂ© et al 2018, Monesi et al 2020) or more (e.g. Monesi et al 2021, Accou et al 2021a, Puffay et al 2022 speech segment candidates to associate the EEG segment with.…”
Section: Introductionmentioning
confidence: 99%
“…Considering that the power spectrum of the alpha frequency band, which was adopted in [23], may not represent the disorder degree of EEG resulting from the dynamic change of the attended auditory stimuli, the EEG feature extraction method in [23] was modified in two aspects. On the one hand, for the KUL dataset, each EEG segment was decomposed into five classical frequency bands, delta (1-3 Hz), theta (4-7 Hz), alpha (8-13 Hz), beta (14-30 Hz) and gamma (31)(32)(33)(34)(35)(36)(37)(38)(39)(40)(41)(42)(43)(44)(45)(46)(47)(48)(49)(50) [45,46]. It should be noted that for both DTU and PKU datasets, the frequency band of 31-50 Hz was not included because the sampling rates of EEG in these two datasets were set as 70 Hz and 64 Hz, respectively, to keep consistent with those of the baselines.…”
Section: Multi-band De Extractionmentioning
confidence: 99%
“…In [12,34], the common spatial pattern (CSP)-based EEG spatial enhancement strategy was combined with the CNN-based AAD model to improve its performance. In [7,35,36], the long short-term memory (LSTM) was combined with the CNN-based AAD model to improve its performance by adopting LSTM to learn the long-term dependence between EEG responses and auditory stimuli.…”
Section: Introductionmentioning
confidence: 99%