Aiming at addressing the problem that the joints are easily destroyed by the impact torque during the process of space robot on-orbit capturing a non-cooperative spacecraft, a reinforcement learning control algorithm combined with a compliant mechanism is proposed to achieve buffer compliance control. The compliant mechanism can not only absorb the impact energy through the deformation of its internal spring, but also limit the impact torque to a safe range by combining with the compliance control strategy. First of all, the dynamic models of the space robot and the target spacecraft before capture are obtained by using the Lagrange approach and Newton-Euler method. After that, based on the law of conservation of momentum, the constraints of kinematics and velocity, the integrated dynamic model of the post-capture hybrid system is derived. Considering the unstable hybrid system, a buffer compliance control based on reinforcement learning is proposed for the stable control. The associative search network is employed to approximate unknown nonlinear functions, an adaptive critic network is utilized to construct reinforcement signal to tune the associative search network. The numerical simulation shows that the proposed control scheme can reduce the impact torque acting on joints by 76.6% at the maximum and 58.7% at the minimum in the capturing operation phase. And in the stable control phase, the impact torque acting on the joints were limited within the safety threshold, which can avoid overload and damage of the joint actuators.
Conceptual design is the key phase which confirms the principle solution that contains a great deal of innovative ideas. Theory of inventive problem solving (TRIZ) and theory of constraints (TOC) are two techniques for systematic innovation that can help designers and companies achieve innovative product solutions. TOC is used to discover contradictions and TRIZ is used to solve the conflicts in the process of product conceptual design, so we describe a process model to realize the theoretical interaction based on TRIZ and TOC. Beginning with the interaction between users and product to be designed, this paper makes a survey of user demand, summarizes in which stage of product development is and analyzes the directions of product technology evolution by selecting evolution routes of technical system. After design conflicts in product system are determined, the designer achieve the solutions of each sub-function of product according to their experience and heuristic association by using TRIZ tools. Designers let the users select the sub-function solutions from the morphological matrix and combine into a few suitable programs. Finally, the designer conduct program evaluation to obtain the optimal product solution by analytic hierarchy process (AHP). As a case study, an innovative design of the UV lampblack purification machine has been achieved with application of the process model.
In order to prevent joints from being damaged by impact force in a space robot capturing satellite, a spring-damper device (SDD) is added between the joint motor and manipulator. The device can not only absorb and attrition impact energy, but also limit impact force to a safe range through reasonable design compliance control strategy. Firstly, the dynamic mode of the space robot and target satellite systems before capture are established by using a Lagrange function based on dissipation theory and Newton-Euler function, respectively. After that, the impact effect is analyzed and the hybrid system dynamic equation is obtained by combining Newton’s third law, momentum conservation, and a kinematic geometric relationship. To realize the buffer compliance stability control of the hybrid system, a reinforcement learning (RL) control strategy based on a fuzzy wavelet network is proposed. The controller consists of a performance measurement unit (PMU), an associative search network (ASN), and an adaptive critic network (ACN). Finally, the stability of system is proved by Lyapunov theorem, and both the impact resistance of SDD and the effectiveness of buffer compliance control strategy are verified by numerical simulation.
In this article, the trajectory planning of the two manipulators of the dual-arm robot is studied to approach the patient in a complex environment with deep reinforcement learning algorithms. The shape of the human body and bed is complex which may lead to the collision between the human and the robot. Because the sparse reward the robot obtains from the environment may not support the robot to accomplish the task, a neural network is trained to control the manipulators of the robot to prepare to hold the patient up by using a proximal policy optimization algorithm with a continuous reward function. Firstly, considering the realistic scene, the 3D simulation environment is built to conduct the research. Secondly, inspired by the idea of the artificial potential field, a new reward and punishment function was proposed to help the robot obtain enough rewards to explore the environment. The function is consisting of four parts which include the reward guidance function, collision detection, obstacle avoidance function, and time function. Where the reward guidance function is used to guide the robot to approach the targets to hold the patient, the collision detection and obstacle avoidance function are complementary to each other and are used to avoid obstacles, and the time function is used to reduce the number of training episode. Finally, after the robot is trained to reach the targets, the training results are analyzed. Compared with the DDPG algorithm, the PPO algorithm reduces about 4 million steps for training to converge. Moreover, compared with the other reward and punishment functions, the function used in this paper will obtain many more rewards at the same training time. Apart from that, it will take much less time to converge, and the episode length will be shorter; so, the advantage of the algorithm used in this paper is verified.
In the control of space robots, flexible vibrations exist in the base, links and joints. When building a motion control scheme, the following three aspects should be considered: (1) the complexity in dynamic modeling; (2) the low accuracy of motion control and (3) the simultaneous suppression of multiple flexible vibrations. In this paper, we propose a motion vibration integrated saturation control scheme. First, the dynamic model of space robot with flexible-base, flexible-link and flexible-joint is established according to the assumed modes method and Lagrange equation. Second, singular perturbation theory is used to decompose the model into two subsystems: a slow subsystem containing the rigid motions of base and joints as well as the vibration of links, and a fast subsystem containing vibrations of base and joints. Third, an integrated sliding mode control with input restriction, output feedback and repetitive learning (ISMC-IOR) is designed, which can track the desired trajectories of base and joints with −3 orders of magnitude accuracy, while suppressing the multiple flexible vibrations of base, links and joints 50%–80% and 37% performance improvement over ISMC-IOR-NV were achieved. Finally, the algorithm is verified by simulations.
The dynamic modeling, motion control and flexible vibration active suppression of space robot under the influence of flexible base, flexible link and flexible joint are explored, and motion and vibration integrated fixed-time sliding mode control of fully flexible system is designed. The flexibility of the base and joints are equivalent to the vibration effect of linear springs and torsion springs. The flexible links are regarded as Euler–Bernoulli simply supported beams, which are analyzed by the hypothetical mode method, and the dynamic model of the fully flexible space robot is established by using the Lagrange equation. Then, the singular perturbation theory is used to decompose the model into slow subsystem including rigid motion and the link flexible vibrations, and fast subsystems including the base and the joint flexible vibrations. A fixed time sliding mode control based on hybrid trajectory is designed for the slow subsystem to ensure that the base and joints track the desired trajectory in a limited time while achieving vibration suppression on the flexible links. For the fast subsystem, linear quadratic optimal control is used to suppress the flexible vibration of the base and joints. The simulation results show that the controller proposed in the paper can make the system state converge within a fixed time, is robust to model uncertainty and external interference, and can effectively suppress the flexible vibration of the base, links, and joints.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.