Adaptive Optics Systems VIII 2022
DOI: 10.1117/12.2627849
|View full text |Cite
|
Sign up to set email alerts
|

Model-free reinforcement learning with a non-linear reconstructor for closed-loop adaptive optics control with a pyramid wavefront sensor

Abstract: We present a model-free reinforcement learning (RL) predictive model with a supervised learning non-linear reconstructor for adaptive optics (AO) control with a pyramid wavefront sensor (P-WFS). First, we analyse the additional problems of training an RL control method with a P-WFS compared to the Shack-Hartmann WFS. From those observations, we propose our solution: a combination of model-free RL for prediction with a non-linear reconstructor based on neural networks with a U-net architecture. We test the prop… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 27 publications
0
2
0
Order By: Relevance
“…We are also examining applications of our phase reconstruction in real-time control with impressive early results, 20 , 29 and are encouraged about its applicability given we are measuring an inference time of 0.34 ms on retail desktop GPUs. With optimization and dedicated hardware 30 , 31 and the COSMIC framework platform, 32 wavefront inference from pre-trained networks has potential for hard real-time AO loop control, and not just in postprocessing workflows via PSF reconstruction.…”
Section: Discussion and Future Workmentioning
confidence: 99%
“…We are also examining applications of our phase reconstruction in real-time control with impressive early results, 20 , 29 and are encouraged about its applicability given we are measuring an inference time of 0.34 ms on retail desktop GPUs. With optimization and dedicated hardware 30 , 31 and the COSMIC framework platform, 32 wavefront inference from pre-trained networks has potential for hard real-time AO loop control, and not just in postprocessing workflows via PSF reconstruction.…”
Section: Discussion and Future Workmentioning
confidence: 99%
“…Pou shows improvements in Strehl for a simulated 8-meter telescope with Shack-Hartmann WFS and PWFS, from 85% Strehl to 91% Strehl calculated at 1650 nm. 41 However, for the Pyramid, this performance required a nonlinear wavefront reconstructor before the RL controller.…”
Section: Reinforcement Learningmentioning
confidence: 99%