2022
DOI: 10.1007/978-3-031-20074-8_10
|View full text |Cite
|
Sign up to set email alerts
|

OOD-CV: A Benchmark for Robustness to Out-of-Distribution Shifts of Individual Nuisances in Natural Images

Abstract: Enhancing the robustness of vision algorithms in real-world scenarios is challenging. One reason is that existing robustness benchmarks are limited, as they either rely on synthetic data or ignore the effects of individual nuisance factors. We introduce OOD-CV-v2 , a benchmark dataset that includes out-of-distribution examples of 10 object categories in terms of pose, shape, texture, context and the weather conditions, and enables benchmarking of models for image classification, object detection, and 3D pose e… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 15 publications
(3 citation statements)
references
References 42 publications
0
3
0
Order By: Relevance
“…They show feasibility of this attack on MiniGPT-4 [17], Fuyu [40], LLaVA [16]. This attack is evaluated on Safety Evaluation Benchmark, focusing on two evaluation scenarios: (a) OODCV-VQA and Counterfactual Variant: A novel VQA dataset is proposed grounded on images from OODCV [41]. The dataset includes questions with pre-defined templates for yes/no or digit responses.…”
Section: Red Teaming Methodsmentioning
confidence: 99%
“…They show feasibility of this attack on MiniGPT-4 [17], Fuyu [40], LLaVA [16]. This attack is evaluated on Safety Evaluation Benchmark, focusing on two evaluation scenarios: (a) OODCV-VQA and Counterfactual Variant: A novel VQA dataset is proposed grounded on images from OODCV [41]. The dataset includes questions with pre-defined templates for yes/no or digit responses.…”
Section: Red Teaming Methodsmentioning
confidence: 99%
“…Regarding natural inputs, several methods have been proposed for defining perturbations and generating perturbed input images, guided by knowledge of the application domain [58], coverage metrics [59,60], or properties that the system must satisfy, such as metamorphic relations [22,61]. In addition, several perturbation benchmarks have been proposed in the literature to assess robustness [7,[62][63][64]. However, few of these methods have been developed and evaluated in the context of 2D object detection systems.…”
Section: Robustness Testing Of Ai-based Object Detection Systemsmentioning
confidence: 99%
“…It aims to verify that inserting objects into the background of an image does not change the results for the other objects. For their part, Zhao et al [63] proposed a natural perturbation benchmark for testing models in the field of computer vision, including object detection models.…”
Section: Robustness Testing Of Ai-based Object Detection Systemsmentioning
confidence: 99%