MAST: Multimodal Abstractive Summarization with Trimodal Hierarchical Attention

Khullar, Aman; Arora, Udit

doi:10.48550/arxiv.2010.08021

“…Multimodal summarization focuses on distilling the most significant aspects of input data that span multiple modalities. Recent years have seen extensive exploration into generating text summaries from multimodal data, which may include text, visual, and auditory information [19][20][21]. A significant body of work has concentrated on incorporating visual information to enhance the quality of text summa-rization [3][4][5]22,23].…”

Section: Related Workmentioning

confidence: 99%

HGTMFS: A Hypergraph Transformer Framework for Multimodal Summarization

Lu

¹

,

Lu

²

,

Zhang

³

2024

Applied Sciences

0

Get access via publisher Add to dashboard Cite

Exaggerated anticipatory anxiety is common in social anxiety disorder (SAD). Neuroimaging studies have revealed altered neural activity in response to social stimuli in SAD, but fewer studies have examined neural activity during anticipation of feared social stimuli in SAD. The current study examined the time course and magnitude of activity in threat processing brain regions during speech anticipation in socially anxious individuals and healthy controls (HC). Method Participants (SAD n = 58; HC n = 16) underwent functional magnetic resonance imaging (fMRI) during which they completed a 90s control anticipation task and 90s speech anticipation task.

show abstract

“…Based on the How2, Palaskar et al [1] first proposed to utilise the visual and textual information of video clips into the summary generation process. Khullar and Arora [26] incorporated audio to generate a summary of video content with visual and textual modalities. Liu et al [3] conducted multistage fusion to interact multi-source modalities together and applied the forget gate module to resist the noise flows from multimodal semantics.…”

Section: Multimodal Abstractive Summarisationmentioning

confidence: 99%

MCR: Multilayer cross‐fusion with reconstructor for multimodal abstractive summarisation

Yuan

¹

,

Yun

²

,

Zheng

³

et al. 2023

IET Computer Vision

Get access via publisher Add to dashboard Cite

Exaggerated anticipatory anxiety is common in social anxiety disorder (SAD). Neuroimaging studies have revealed altered neural activity in response to social stimuli in SAD, but fewer studies have examined neural activity during anticipation of feared social stimuli in SAD. The current study examined the time course and magnitude of activity in threat processing brain regions during speech anticipation in socially anxious individuals and healthy controls (HC). Method Participants (SAD n = 58; HC n = 16) underwent functional magnetic resonance imaging (fMRI) during which they completed a 90s control anticipation task and 90s speech anticipation task.

show abstract

“…dle the text input.Multimodal summarization aims to condense information from multimodal inputs, such as text, vision, and audio[12]. Recently, Ms has been extensively studied ([10,2,8,31]). A large number of the works focus on fusing visual information to improve the quality of text summaries [11].…”

mentioning

confidence: 99%

CISum: Learning Cross-modality Interaction to Enhance Multimodal Semantic Coverage for Multimodal Summarization

Zhang¹,

Zhang²,

Guo³

et al. 2023

Preprint

0

Get access via publisher Add to dashboard Cite

Exaggerated anticipatory anxiety is common in social anxiety disorder (SAD). Neuroimaging studies have revealed altered neural activity in response to social stimuli in SAD, but fewer studies have examined neural activity during anticipation of feared social stimuli in SAD. The current study examined the time course and magnitude of activity in threat processing brain regions during speech anticipation in socially anxious individuals and healthy controls (HC). Method Participants (SAD n = 58; HC n = 16) underwent functional magnetic resonance imaging (fMRI) during which they completed a 90s control anticipation task and 90s speech anticipation task.

show abstract

MAST: Multimodal Abstractive Summarization with Trimodal Hierarchical Attention

Cited by 4 publications

References 12 publications

HGTMFS: A Hypergraph Transformer Framework for Multimodal Summarization

HGTMFS: A Hypergraph Transformer Framework for Multimodal Summarization

MCR: Multilayer cross‐fusion with reconstructor for multimodal abstractive summarisation

CISum: Learning Cross-modality Interaction to Enhance Multimodal Semantic Coverage for Multimodal Summarization

Contact Info

Product

Resources

About