“…In an unstructured environment, the model is unknown, and two methods can be used: geometry estimation and a closed-loop online controller to minimize force and torque. Several studies have been conducted on geometry estimation [13,14,15,16,17,18], where articulation pose is estimated from visual input, and a motion trajectory can be planned from this estimation result. However, the estimation accuracy is insufficient for compliant manipulation (e.g., ∼ 20 • estimation error in a rotation-axis orientation on real-world data [18]), and causes the planning-based approach to fail.…”