Mona Ahmadian scite author profile

Heterogeneous graphs provide a compact, efficient, and scalable way to model data involving multiple disparate modalities. This makes modeling audiovisual data using heterogeneous graphs an attractive option. However, graph structure does not appear naturally in audiovisual data. Graphs for audiovisual data are constructed manually which is both difficult and sub-optimal. In this work, we address this problem by (i) proposing a parametric graph construction strategy for the intra-modal edges, and (ii) learning the crossmodal edges. To this end, we develop a new model, heterogeneous graph crossmodal network (HGCN) that learns the crossmodal edges. Our proposed model can adapt to various spatial and temporal scales owing to its parametric construction, while the learnable crossmodal edges effectively connect the relevant nodes across modalities. Experiments on a large benchmark dataset (AudioSet) show that our model is state-of-the-art (0.53 mean average precision), outperforming transformer-based models and other graph-based models. Our code is available at github.com/AmirSh15/Cross modality graph

show abstract

Future Image Prediction of Plantar Pressure During Gait Using Spatio-temporal Transformer

Ahmadian

Rahmani-Boldaji

Shirian

2022

View full text Add to dashboard Cite

Unsupervised Generative Adversarial Network for Plantar Pressure Image-to-Image Translation

Ahmadian

Beheshti

Kalhor

et al. 2021

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Mona Ahmadian

Heterogeneous Graph Learning for Acoustic Event Classification

Heterogeneous Graph Learning for Acoustic Event Classification

Future Image Prediction of Plantar Pressure During Gait Using Spatio-temporal Transformer

Unsupervised Generative Adversarial Network for Plantar Pressure Image-to-Image Translation

Contact Info

Product

Resources

About