MAST: Multimodal Abstractive Summarization with Trimodal Hierarchical Attention

Khullar, Aman; Arora, Udit

doi:10.18653/v1/2020.nlpbt-1.7

“…For each video clip, following previous works (Sanabria et al, 2018;Palaskar et al, 2019;Khullar and Arora, 2020), a 2048-dimensional feature representation is extracted for every 16 non-overlapping frames using a 3D ResNeXt-101 model (Hara et al, 2018), which is pre-trained on the Kinetics dataset (Kay et al, 2017). Therefore, each data sample will have a sequence of 2048-vision feature vectors of length .…”

Section: Video Feature Extractionmentioning

confidence: 99%

Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization

Yu

¹

,

Dai

²

,

Liu

³

et al. 2021

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing

Get access via publisher Add to dashboard Cite

Exaggerated anticipatory anxiety is common in social anxiety disorder (SAD). Neuroimaging studies have revealed altered neural activity in response to social stimuli in SAD, but fewer studies have examined neural activity during anticipation of feared social stimuli in SAD. The current study examined the time course and magnitude of activity in threat processing brain regions during speech anticipation in socially anxious individuals and healthy controls (HC). Method Participants (SAD n = 58; HC n = 16) underwent functional magnetic resonance imaging (fMRI) during which they completed a 90s control anticipation task and 90s speech anticipation task.

show abstract

“…[1] first proposed to utilise the visual and textual information of video clips into the summary generation process. Khullar and Arora [26] incorporated audio to generate a summary of video content with visual and textual modalities. Liu et al.…”

Section: Related Workmentioning

confidence: 99%

MCR: Multilayer cross‐fusion with reconstructor for multimodal abstractive summarisation

Yuan

¹

,

Yun

²

,

Zheng

³

et al. 2023

IET Computer Vision

Get access via publisher Add to dashboard Cite

Exaggerated anticipatory anxiety is common in social anxiety disorder (SAD). Neuroimaging studies have revealed altered neural activity in response to social stimuli in SAD, but fewer studies have examined neural activity during anticipation of feared social stimuli in SAD. The current study examined the time course and magnitude of activity in threat processing brain regions during speech anticipation in socially anxious individuals and healthy controls (HC). Method Participants (SAD n = 58; HC n = 16) underwent functional magnetic resonance imaging (fMRI) during which they completed a 90s control anticipation task and 90s speech anticipation task.

show abstract

“…For each video clip, following previous works (Sanabria et al, 2018;Palaskar et al, 2019;Khullar and Arora, 2020), a 2048-dimensional feature representation is extracted for every 16 non-overlapping frames using a 3D ResNeXt-101 model (Hara et al, 2018), which is pre-trained on the Kinetics dataset (Kay et al, 2017). Therefore, each data sample will have a sequence of 2048-𝑑 vision feature vectors of length 𝑀.…”

Section: Video Feature Extractionmentioning

confidence: 99%

Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization

Yu

¹

,

Dai

²

,

Liu

³

et al. 2021

Preprint

Get access via publisher Add to dashboard Cite

Exaggerated anticipatory anxiety is common in social anxiety disorder (SAD). Neuroimaging studies have revealed altered neural activity in response to social stimuli in SAD, but fewer studies have examined neural activity during anticipation of feared social stimuli in SAD. The current study examined the time course and magnitude of activity in threat processing brain regions during speech anticipation in socially anxious individuals and healthy controls (HC). Method Participants (SAD n = 58; HC n = 16) underwent functional magnetic resonance imaging (fMRI) during which they completed a 90s control anticipation task and 90s speech anticipation task.

show abstract

MAST: Multimodal Abstractive Summarization with Trimodal Hierarchical Attention

Cited by 37 publications

References 27 publications

Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization

Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization

MCR: Multilayer cross‐fusion with reconstructor for multimodal abstractive summarisation

Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization

Contact Info

Product

Resources

About