On the Efficacy of Co-Attention Transformer Layers in Visual Question Answering

Ankur, Sikarwar,; Kreiman, Gabriel

doi:10.48550/arxiv.2201.03965

Cited by 1 publication

(1 citation statement)

References 19 publications

(33 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Objective O12: Apply Attention Mechanism to the QA system Apply Attention Mechanism (Section 2.5) to the QA system in order to allow the decoder of the Seq2Seq model to pay attention to one part of the input sequence (while giving less attention to others) at different decoding steps, thus guiding the process of reasoning similar to [239], but in an RL setting. This will help achieve goal G2.…”

Section: Path Planning Systemsmentioning

confidence: 99%

Bayesian Deep Multi-Agent Multimodal Reinforcement Learning for Embedded Systems in Games, Natural Language Processing and Robotics

Kourouklides¹

2022

Preprint

View full text Add to dashboard Cite

Nowadays, Machine Learning is one of the most dynamic fields, as it attracts strong research interest from both industry and academia alike. It is not surprising that a huge amount of funding from government agencies, universities, Tech giants and well-funded startups is currently being allocated exclusively to this field. Reinforcement Learning, one of the three major subfields of Machine Learning, has recently gained a tremendous traction due to the fact that algorithms can run more efficiently. This is mainly due to two reasons, firstly affordable and portable hardware, such as mobile phones, wearables and Internet of Things devices, now has the capacity to run these algorithms and secondly, new methods and models are being proposed that deal with the matter of efficiency from an algorithmic point of view. This proposal is concerned with dealing with open challenges in memory efficiency, while devising and applying such Reinforcement Learning algorithms for embedded systems in the domains of Games, Natural Language Processing and Robotics using Deep Learning models and Bayesian inference, a very powerful framework. Natural Language Processing is a domain of Machine Learning in which the input is given in the form of a text from a natural language that human agents use for everyday communication. It is also a domain that still faces a series of ongoing challenges, as opposed to more saturated domains, such as Computer Vision. One of them is the fact that ground truth is difficult to be decided due to the nature of text in general. Other challenges include the personalized type and tone of the conversation held by the human agents, such as formal, informal, aggressive, polite, etc. Therefore, this proposal deals with all of these matters, mainly in the subdomain of Question Answering systems, also known as chatbots, in a multi-agent setting.

show abstract

Section: Path Planning Systemsmentioning

confidence: 99%

Bayesian Deep Multi-Agent Multimodal Reinforcement Learning for Embedded Systems in Games, Natural Language Processing and Robotics

Kourouklides¹

2022

Preprint

View full text Add to dashboard Cite

show abstract

On the Efficacy of Co-Attention Transformer Layers in Visual Question Answering

Cited by 1 publication

References 19 publications

Bayesian Deep Multi-Agent Multimodal Reinforcement Learning for Embedded Systems in Games, Natural Language Processing and Robotics

Bayesian Deep Multi-Agent Multimodal Reinforcement Learning for Embedded Systems in Games, Natural Language Processing and Robotics

Contact Info

Product

Resources

About