CAMM: Cross-Attention Multimodal Classification of Disaster-Related Tweets

Khattar, Anuradha; Quadri, S. M. K.

doi:10.1109/access.2022.3202976

Cited by 18 publications

(11 citation statements)

References 50 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Deep learning, a subset of ML, has also found extensive applications in handling Twitter data due to its ability to automatically learn composite patterns and representations from large-scale data. In the latest literature mainly focused on deep networks, the authors [5] proposed a new deep neural network called Cross-Attention Multi-Modal (CAMM) to classify disaster data that contains both text and images. The authors [6] proposed a novel method, called Stacking-based Ensemble [43], using Statistical features and Informative words to tackle the challenges of damage assessment in tweets.…”

Section: Deep Learning and Neural Network Approachesmentioning

confidence: 99%

“…Continuous Bag of Words (CBOW): Based on the context words around a target word, CBOW attempts to anticipate that term. To maximize the chance of correctly predicting the target word "w_t" given its context words "w_c," where "c" spans from "-C" to "C" (excluding 0 as the target word itself), one must start with a context window of size "C" (number of words on each side of the target word) shown in equation [5] 𝑀𝑎𝑥𝑖𝑚𝑖𝑧𝑒 ( 1 𝑇 ) * Σ(Log P(w t |w c ))…”

Section: Word To Vector (Word2vec)mentioning

confidence: 99%

“…In the case of Twitter, tweet information is collected using the Twitter API, API key, and Bear token [5], which can be obtained through Twitter APL. Relevant hashtags can then be used to collect tweet information.…”

Section: Introductionmentioning

confidence: 99%

“…In most of the literature on tweet classification, the datasets are initially created with manual annotations [8] so that the annotators can preprocess the individual tweets. Late the following recent trends, many researchers focused on machine learning based [3][4][5], deep learning based [11] approaches. In the recent past, most researchers have focused on transfer learning approaches such as voting [31,39], stacking [10,42], and BERT [33] to improve classification tasks.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

DPHFM: A Deep Parallel Hybrid Fusion Model for Disaster Tweet Classification on Twitter Data

DASARI,

Gorla,

2023

Preprint

View full text Add to dashboard Cite

In recent years, disaster tweet classification has garnered significant attention in natural language processing (NLP) due to its potential to aid disaster response and emergency management. The goal of disaster tweet classification is to automate the identification of informative tweets containing information related to various types of disasters, such as floods, earthquakes, wildfires, and more. This classification task plays a crucial role in real-time monitoring, situational awareness, and timely response coordination during emergency situations. In this context, we propose a deep parallel hybrid fusion model (DPHFM) that combines features extracted from Convolutional Neural Networks (CNNs) and Bidirectional Long Short-Term Memory (Bi-LSTM) as base learners. The extracted features from the base learners are combined using a fusion mechanism, and the resulting features are then reconstructed and supplied to a meta-learner as input for making predictions. The DPHFM is trained on disaster datasets, such as crisisMMD, which consists of seven natural disaster events. The model was thoroughly evaluated using various metrics, demonstrating an average performance improvement of 90–96%. Furthermore, the proposed model's performance surpassed that of other state-of-the-art models, showcasing its potential for disaster tweet classification using deep learning techniques.

show abstract

Section: Deep Learning and Neural Network Approachesmentioning

confidence: 99%

Section: Word To Vector (Word2vec)mentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

DPHFM: A Deep Parallel Hybrid Fusion Model for Disaster Tweet Classification on Twitter Data

DASARI,

Gorla,

2023

Preprint

View full text Add to dashboard Cite

show abstract

“…Multimodal learning is a general method for building artificial intelligence (AI) models that extract and correlate information from multimodal data (Baltrusaitis et al 2019). Multimodal learning has been used in several areas (Khattar & Quadri 2022), such as visual question answering, emotion recognition, machine translation, cross-modal retrieval, and speech recognition. With the development of large survey telescopes, a massive amount of multi-source heterogeneous astronomical data, such as spectral and photometric data of astronomical objects, have been generated.…”

Section: Introductionmentioning

confidence: 99%

Identification of Blue Horizontal Branch Stars with Multimodal Fusion

Wei

Bin

Zhang³

2023

PASP

View full text Add to dashboard Cite

Blue Horizontal Branch stars (BHBs) are ideal tracers to probe the global structure of the milky Way (MW), and the increased size of the BHB star sample could be helpful to accurately calculate the MW’s enclosed mass and kinematics. Large survey telescopes have produced an increasing number of astronomical images and spectra. However, traditional methods of identifying BHBs are limited in dealing with the large scale of astronomical data. A fast and efficient way of identifying BHBs can provide a more significant sample for further analysis and research. Therefore, in order to fully use the various data observed and further improve the identification accuracy of BHBs, we have innovatively proposed and implemented a Bi-level attention mechanism-based Transformer multimodal fusion model, called Bi-level Attention in the Transformer with Multimodality (BATMM). The model consists of a spectrum encoder, an image encoder, and a Transformer multimodal fusion module. The Transformer enables the effective fusion of data from two modalities, namely image and spectrum, by using the proposed Bi-level attention mechanism, including cross-attention and self-attention. As a result, the information from the different modalities complements each other, thus improving the accuracy of the identification of BHBs. The experimental results show that the F1 score of the proposed BATMM is 94.78%, which is 21.77% and 2.76% higher than the image and spectral unimodality, respectively. It is therefore demonstrated that higher identification accuracy of BHBs can be achieved by means of using data from multiple modalities and employing an efficient data fusion strategy.

show abstract