“…This is particularly true for some of the richest, most information-dense sensing modalities, such as images, audio, or tactile feedback, as well as in interaction-rich applications that need to reason about difficult-to-model contact dynamics. These systems also tend to have more complex, heteroscedastic (variable) noise profiles [6]. For example, an object position estimate from an image might rapidly switch from being extremely precise under nominal operating conditions to completely useless under occlusion or poor lighting.…”