2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2021
DOI: 10.1109/iros51168.2021.9636133
|View full text |Cite
|
Sign up to set email alerts
|

Using Visual Anomaly Detection for Task Execution Monitoring

Abstract: An object handover between a robot and a human is a coordinated action which is prone to failure for reasons such as miscommunication, incorrect actions and unexpected object properties. Existing works on handover failure detection and prevention focus on preventing failures due to object slip or external disturbances. However, there is a lack of datasets and evaluation methods that consider unpreventable failures caused by the human participant. To address this deficit, we present the multimodal Handover Fail… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 45 publications
0
2
0
Order By: Relevance
“…An advantage of using RGB video lies in the availability of many well-established algorithms that can be used to extract auxiliary information to aid the anomaly detection. An example is the frequent use of optical flow to help detecting motion-based anomalies [11,13,22,33]. Some works use much higher-level information, e.g.…”
Section: Related Workmentioning
confidence: 99%
“…An advantage of using RGB video lies in the availability of many well-established algorithms that can be used to extract auxiliary information to aid the anomaly detection. An example is the frequent use of optical flow to help detecting motion-based anomalies [11,13,22,33]. Some works use much higher-level information, e.g.…”
Section: Related Workmentioning
confidence: 99%
“…The authors adopt LSTM-based variational autoencoders to process multimodal input from a sensor set including a camera, a microphone, a joint encoder, and a force sensor. In another work [36], multimodal cues are used to detect book manipulation failures on shelves. [37] fuses visuo-tactile cues for grasp failure detection.…”
Section: Related Workmentioning
confidence: 99%