2022
DOI: 10.1007/978-3-031-20059-5_6
|View full text |Cite
|
Sign up to set email alerts
|

Rethinking Data Augmentation for Robust Visual Question Answering

Abstract: Being widely used in learning unbiased visual question answering (VQA) models, Data Augmentation (DA) helps mitigate language biases by generating extra training samples beyond the original samples. While today's DA methods can generate robust samples, the augmented training set, significantly larger than the original dataset, often exhibits redundancy in terms of difficulty or content repetition, leading to inefficient model training and even compromising the model performance. To this end, we design an Effec… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 17 publications
(2 citation statements)
references
References 50 publications
0
1
0
Order By: Relevance
“…Improving the accuracy of machine learning models to reduce reliance on human review in terms of annotating inputs and in correcting errors will also be critical. This will necessitate foundational computer vision research into avenues such as active learning techniques (e.g., Norouzzadeh et al, 2021), data augmentations (e.g., Chen et al, 2022), new model architectures, and more consistent image processing procedures (e.g., Kellenberger et al, 2021). Increasing confidence in machine learning outputs through improved statistical rigor will be necessary, as current methods do not adequately characterize multiple sources of error and bias including the downstream propagation of uncertainty in image labels.…”
Section: Improving the Machine Learning Pipelinementioning
confidence: 99%
“…Improving the accuracy of machine learning models to reduce reliance on human review in terms of annotating inputs and in correcting errors will also be critical. This will necessitate foundational computer vision research into avenues such as active learning techniques (e.g., Norouzzadeh et al, 2021), data augmentations (e.g., Chen et al, 2022), new model architectures, and more consistent image processing procedures (e.g., Kellenberger et al, 2021). Increasing confidence in machine learning outputs through improved statistical rigor will be necessary, as current methods do not adequately characterize multiple sources of error and bias including the downstream propagation of uncertainty in image labels.…”
Section: Improving the Machine Learning Pipelinementioning
confidence: 99%
“…Additionally, generative models are computationally expensive to train and require large amounts of data to produce good-quality results. Some works that are not generation based are SimpleAug [25] and KDDAug [26]. SimpleAug works on the principle that many of the "unknowns" are indeed "known" within the dataset.…”
Section: Related Workmentioning
confidence: 99%