Segmentation Guided Attention Networks for Visual Question Answering

Sharma, Vasu; Bishnu, Ankita; Patel, Labhesh

doi:10.18653/v1/p17-3008

Cited by 7 publications

(5 citation statements)

References 13 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Recently, the use of ML models trained on multimodal data has gained traction, particularly the combination of image and text data modalities. Several papers have shown that multimodal models may provide some resilience against attacks [328], but other papers show that multimodal models themselves could be vulnerable to attacks mounted on all modalities at the same time [63,261,326]. See Section 4.6 for additional discussion.…”

Section: Cybersecuritymentioning

confidence: 99%

“…Without such an effort, single modality attacks can be effective and compromise multimodal models across a wide range of multimodal tasks despite the information contained in the remaining unperturbed modalities [328,335]. Moreover, researchers have devised effcient mechanisms for constructing simultaneous attacks on multiple modalities, which suggests that multimodal models might not be more robust against adversarial attacks despite improved performance [63,261,326].…”

Section: Tradeofs Between the Attributes Of Trustworthy Aimentioning

confidence: 99%

“…In addition, multimodal ML has made exciting progress in many tasks, including generative and classifcation tasks, and there have been attempts to use multimodal learning as a potential mitigation of single-modality attacks [328]. However, powerful simultaneous attacks against all modalities in a multimodal model have also emerged [63,261,326].…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Adversarial machine learning :

Vassilev,

Oprea,

Fordyce

et al. 2024

View full text Add to dashboard Cite

This NIST AI report develops a taxonomy of concepts and defines terminology in the field of adversarial machine learning (AML). The taxonomy is built on survey of the AML literature and is arranged in a conceptual hierarchy that includes key types of ML methods and lifecycle stage of attack, attacker goals and objectives, and attacker capabilities and knowledge of the learning process. The report also provides corresponding methods for mitigating and managing the consequences of attacks and points out relevant open challenges to take into account in the lifecycle of AI systems. The terminology used in the report is consistent with the literature on AML and is complemented by a glossary that defines key terms associated with the security of AI systems and is intended to assist non-expert readers. Taken together, the taxonomy and terminology are meant to inform other standards and future practice guides for assessing and managing the security of AI systems, by establishing a common language and understanding of the rapidly developing AML landscape.

show abstract

Section: Cybersecuritymentioning

confidence: 99%

Section: Tradeofs Between the Attributes Of Trustworthy Aimentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Adversarial machine learning :

Vassilev,

Oprea,

Fordyce

et al. 2024

View full text Add to dashboard Cite

show abstract

“…The field of multi-modal learning had lots of progress [2,16,19,18,17] in recent years for cross-modal understanding. The line of work in Trojan attacks on multi-modal models [20,5,27] is usually limited to a single modality, investigating the robustness to single modality Trojan and how the presence of such a Trojan would affect the multimodal model performance. For instance, Attend and Attack [20] generates adversarial visual inputs to fool a visual question answering (VQA) model [30,31,3,7,21] through a compromised attention map.…”

Section: Related Workmentioning

confidence: 99%

“…The line of work in Trojan attacks on multi-modal models [20,5,27] is usually limited to a single modality, investigating the robustness to single modality Trojan and how the presence of such a Trojan would affect the multimodal model performance. For instance, Attend and Attack [20] generates adversarial visual inputs to fool a visual question answering (VQA) model [30,31,3,7,21] through a compromised attention map. Chaturvedi et al [5] presented a targeted adversarial attack on VQA using adversarial background noise in the vision input.…”

Section: Related Workmentioning

confidence: 99%

Attacking Distance-aware Attack: A Semi-targeted Poisoning Attack on Federated Learning

Sun¹,

Ochiai²,

Sakuma³

2023

Preprint

View full text Add to dashboard Cite

<p>Existing model poisoning attacks on federated learning (FL) assume that an adversary has access to the full data distribution. In reality, an adversary usually has limited prior knowledge about clients' data distributions. In such a case, a poorly chosen target class renders an attack less effective. In particular, we considered a semi-targeted situation where the source class is predetermined but the target class is not. The goal is to cause the global classifier to misclassify data of the source class. Approaches such as label flipping have been adopted to inject poisoned parameters into FL. Nevertheless, it has shown that their performances are usually class-sensitive, varying with different target classes applied. Typically, an attack can become less effective when shifting to a different target class. To overcome this challenge, we propose the Attacking Distance-aware Attack (ADA) that enhances a poisoning attack by finding the optimized target class in the feature space. ADA deduces pair-wise distances between different classes in the latent feature space based on the Fast LAyer gradient MEthod (FLAME). We performed extensive evaluations by varying the factor of attacking frequency in four benchmark image classification tasks. Furthermore, ADA's efficacy was studied under different defense strategies in FL. As a result, ADA succeeded in increasing the poisoning attack performance to 2.8 times in the most challenging case with an attacking frequency of 0.01. ADA bypassed existing defenses where differential privacy that was the most effective defense still could not reduce the attack performance to below 50%.</p>

show abstract

Semantic Equivalent Adversarial Data Augmentation for Visual Question Answering

Tang

Zhang

et al. 2020

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Segmentation Guided Attention Networks for Visual Question Answering

Cited by 7 publications

References 13 publications

Adversarial machine learning :

Adversarial machine learning :

Attacking Distance-aware Attack: A Semi-targeted Poisoning Attack on Federated Learning

Semantic Equivalent Adversarial Data Augmentation for Visual Question Answering

Contact Info

Product

Resources

About