“…Through multimodal interfaces, including image processing, gesture recognition, natural language processing, and affective computing, the operator’s intentions can be inferred and passed to the machines to enhance their reasoning ( Yang et al, 2018 ). Recently, we proposed a novel human–machine cooperative rapidly exploring random tree algorithm, which introduces human preferences and corrective actions, enhancing the safety, smoothness, and human likeness of robot planning and tracking control ( Huang et al, 2021 ). In addition, to further improve machine intelligence, we are developing a human-in-the-loop deep reinforcement learning (DRL) method by fusing human skills via real-time guidance into the DRL agent during the learning process, which effectively improves the learning curve ( Wu et al, 2021 ).…”