2022
DOI: 10.48550/arxiv.2202.13941
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Background Mixup Data Augmentation for Hand and Object-in-Contact Detection

Abstract: Detecting the positions of human hands and objects-incontact (hand-object detection) in each video frame is vital for understanding human activities from videos. For training an object detector, a method called Mixup, which overlays two training images to mitigate data bias, has been empirically shown to be effective for data augmentation. However, in hand-object detection, mixing two hand-manipulation images produces unintended biases, e.g., the concentration of hands and objects in a specific region degrades… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 15 publications
0
1
0
Order By: Relevance
“…All methods in this competition exploited deep learning techniques. Most of them were developed based on the state of the art in object detection, including YOLOv5 [15,16], Fast-RCNN [13,17], EfficientDet [18], Cascade R-CNN [15,19,20], CBNetV2 [21,22], CenterNet2 [23], Task-aligned One-stage object Detection (TOOD) [17,24], and RetinaNet [25,26]. These methods are convolutional neural networks with various backbone architectures, where the most popular architecture is based on ResNet blocks [17, 22-25, 27, 28].…”
Section: Methodsmentioning
confidence: 99%
“…All methods in this competition exploited deep learning techniques. Most of them were developed based on the state of the art in object detection, including YOLOv5 [15,16], Fast-RCNN [13,17], EfficientDet [18], Cascade R-CNN [15,19,20], CBNetV2 [21,22], CenterNet2 [23], Task-aligned One-stage object Detection (TOOD) [17,24], and RetinaNet [25,26]. These methods are convolutional neural networks with various backbone architectures, where the most popular architecture is based on ResNet blocks [17, 22-25, 27, 28].…”
Section: Methodsmentioning
confidence: 99%