2016 24th European Signal Processing Conference (EUSIPCO) 2016
DOI: 10.1109/eusipco.2016.7760204
|View full text |Cite
|
Sign up to set email alerts
|

A system identification approach to determining listening attention from EEG signals

Abstract: Abstract-We still have very little knowledge about how our brains decouple different sound sources, which is known as solving the cocktail party problem. Several approaches; including ERP, time-frequency analysis and, more recently, regression and stimulus reconstruction approaches; have been suggested for solving this problem. In this work, we study the problem of correlating of EEG signals to different sets of sound sources with the goal of identifying the single source to which the listener is attending. He… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
13
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
3
3

Relationship

1
5

Authors

Journals

citations
Cited by 16 publications
(13 citation statements)
references
References 11 publications
0
13
0
Order By: Relevance
“…Other methods for attention decoding include the forward modelling approach: predicting EEG from the auditory stimulus (Akram et al, 2016;Alickovic et al, 2016), canonical correlation analysis (CCA)-based methods (de Cheveigné et al, 2018), and Bayesian state-space modeling (Miran et al, 2018). The current state-of-the-art models are capable of classifying auditory attention in a two-speaker scenario with high accuracy (80-90% correct) over a data window with a length of approximately 10 s. However, to quickly detect a switch in attention, detection in much shorter windows, down to a few seconds, is required.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Other methods for attention decoding include the forward modelling approach: predicting EEG from the auditory stimulus (Akram et al, 2016;Alickovic et al, 2016), canonical correlation analysis (CCA)-based methods (de Cheveigné et al, 2018), and Bayesian state-space modeling (Miran et al, 2018). The current state-of-the-art models are capable of classifying auditory attention in a two-speaker scenario with high accuracy (80-90% correct) over a data window with a length of approximately 10 s. However, to quickly detect a switch in attention, detection in much shorter windows, down to a few seconds, is required.…”
Section: Introductionmentioning
confidence: 99%
“…The reconstructed envelope is then correlated with the original stimulus envelopes, and the one yielding the highest correlation is then considered to belong to the attended speaker. Other methods for attention decoding include the forward modelling approach: predicting EEG from the auditory stimulus (Akram et al, 2016;Alickovic et al, 2016), canonical correlation analysis (CCA)-based methods (de Cheveigné et al, 2018), and Bayesian state-space modeling (Miran et al, 2018).All studies mentioned above are based on linear decoders. However, since the human auditory system is inherently non-linear (Faure and Korn, 2001), non-linear models (such as neural networks) could be beneficial for reliable and quick AAD.…”
mentioning
confidence: 99%
“…Furthermore, the robustness of the attended speech envelope reconstruction in noisy real world acoustic scenes has been demonstrated [27]. In contrast to the stimulus reconstruction methods, studies with system identification approaches to solve this problem, have tried to reconstruct the neural measurements using the linear forward map of sound sources [28], [29], [30], [31]. In a recent related study, a single in-Ear-EEG electrode and an adjacent scalp-EEG electrode were used for auditory attention detection in a diotic two speaker scenario [30].…”
Section: Introductionmentioning
confidence: 99%
“…This process will be referred to as auditory attention detection (AAD). These models followed mostly two approaches: define a regression model that predicts the neural responses based on the sound, so-called forward modeling [4,2,3], or reconstruct the speech envelopes of the (un)attended speaker from the EEG or MEG neural data (backward modeling) [29,11]. Others have experimented with alternative approaches, for example to extract features from the cross-correlation of neural response and speech envelope to train a linear classifier [19].…”
Section: Introductionmentioning
confidence: 99%