2011 5th International IEEE/EMBS Conference on Neural Engineering 2011
DOI: 10.1109/ner.2011.5910601
|View full text |Cite
|
Sign up to set email alerts
|

Control of a center-out reaching task using a reinforcement learning Brain-Machine Interface

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
14
0

Year Published

2011
2011
2023
2023

Publication Types

Select...
5
3
1
1

Relationship

2
8

Authors

Journals

citations
Cited by 34 publications
(17 citation statements)
references
References 12 publications
1
14
0
Order By: Relevance
“…The assignment of reward is based on the 1-0 distance to the target, that is, dist⁡( x , d ) = 0 if x = d , and dist⁡( x , d ) = 1, otherwise. Once the cursor reaches the assigned target, the agent gets a positive reward +0.6, else it receives negative reward −0.6 [35]. Exploration rate ϵ = 0.01 and discount factor γ = 0.9 are applied.…”
Section: Experimental Results On Neural Decodingmentioning
confidence: 99%
“…The assignment of reward is based on the 1-0 distance to the target, that is, dist⁡( x , d ) = 0 if x = d , and dist⁡( x , d ) = 1, otherwise. Once the cursor reaches the assigned target, the agent gets a positive reward +0.6, else it receives negative reward −0.6 [35]. Exploration rate ϵ = 0.01 and discount factor γ = 0.9 are applied.…”
Section: Experimental Results On Neural Decodingmentioning
confidence: 99%
“…RL has also been applied in BMI research. Concentrating mainly on controlling (prosthetic/robotic) devices, several studies have been reported, including: mapping neural activity to intended behavior through coadaptive BMI (using TD(λ)) [190] and symbiotic BMI (using actor-critic) [191], a testbed targeting center-out reaching task in primates for creating more realistic BMI control models [192], Hebbian RL for adaptive control by mapping neural states to prosthetic actions [193], BMI for unsupervised decoding of cortical spikes in multistep goal-directed tracking task (using Q(λ)) [194], adaptive BMI capable of adjusting to dramatic reorganizing neural activities with minimal training and stable performance over long duration (using actor-critic) [195], 11/33 BMI for efficient nonlinear mapping of neural states to actions through sparsification of state-action mapping space using quantized attention-gated kernel RL as an approximator [196]. Also, Lampe et al proposed BMI capable of transmitting imaginary movements evoked EEG signals over the Internet to remotely control robotic device [182], and Bauer and Gharabaghi combined RL with Bayesian model to select dynamic thresholds for improved performance of restorative BMI [183].…”
Section: /33mentioning
confidence: 99%
“…As mentioned above, the differences seen in rewarding versus nonrewarding trials through various measures have potential uses in improved learning for BMIs. Toward this goal, our lab recently showed that integrated features of PSD and SFC yielded nearperfect classification accuracy between rewarding and nonrewarding trials during color-cued one-target center-out reaching tasks and thus can be used as a neural critic in autonomous BMI decoding (Bae et al, 2011;Sanchez et al, 2011;Tarigoppula et al, 2012;Marsh et al, 2015;An et al, 2018;Tarigoppula et al, 2018).…”
Section: Alpha Cycle Relation To Neural Spikingmentioning
confidence: 99%