Reinforcement learning (RL) is used to design impulsive stationkeeping maneuvers for a spacecraft operating near an [Formula: see text] quasi-halo trajectory in a Sun–Earth–Moon point mass ephemeris model with solar radiation pressure. This scenario is translated into an RL problem that reflects the desired stationkeeping goals, variables, and dynamical model. An algorithm from proximal policy optimization is used to train a policy that generates stationkeeping maneuvers while transfer learning is used to reduce the computational time required for training. The trained policy successfully generates stationkeeping maneuvers that result in boundedness to the vicinity of the selected reference trajectory with low total maneuver requirements, producing comparable results to a traditionally formulated constrained optimization scheme.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.