“…Tactile sensing provides granular, localized data, complemented by the expansive overview offered by vision, encompassing attributes like an object's holistic shape and hue [3]. The merging of these senses, known as Vision-Tactile Fusion Perception (VTFP), has revealed numerous avenues for improved sensory comprehension [4,5]. Analogously, in robotic systems, while vision serves as a primary data source, tactile feedback is indispensable for discerning attributes like weight, firmness, slippage,and texture.…”