“…Some algorithms build on intermediate features such as marker position, optical flow, or depth [ 3 , 10 , 11 , 14 , 15 , 16 , 17 , 18 ], while others build and train end-to-end models [ 3 , 18 , 19 , 20 ]. The high resolution of VBTS allows robots to perform tasks such as detecting slip [ 2 , 21 , 22 ], estimating force distribution [ 11 , 23 ], classifying surface textures [ 24 , 25 ], and manipulating small and soft objects [ 26 , 27 ] as well as many tasks that require dexterity and fine-detailed pose estimation.…”