Multimodal Frame Identification with Multilingual Evaluation

Botschen, Teresa; Gurevych, Iryna; Klie, Jan-Christoph; Sergieh, Hatem Mousselly; Roth, Stefan

doi:10.18653/v1/n18-1134

Cited by 15 publications

(20 citation statements)

References 31 publications

(35 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Our model underperforms compared to other embedding frameworks from Hermann et al (2014) and Botschen et al (2018), which can be explained through an examination of the input representation methods used by the different models, as well as their disambiguation strategies. The model by Hermann et al (2014) constructs an input representation that encodes the syntactic dependency relations found within the predicate context by concatenating the embeddings for the arguments and learning a mapping to a lowerdimensional space.…”

Section: Resultsmentioning

confidence: 80%

“…The Botschen et al (2018) model is most significantly different from ours in two respects: it uses multimodal embedding representations at the input (textual + visual), and it employs a softmax classifier at the output step, whereas we use MSE as a loss function. Prior work has shown that the first option is more powerful in the context of word sense disambiguation tasks (Popov, 2017).…”

Section: Resultsmentioning

confidence: 99%

See 1 more Smart Citation

Graph Embeddings for Frame Identification

Попов

Sikos²

2019

Proceedings - Natural Language Processing in a Deep Learning World

View full text Add to dashboard Cite

Lexical resources such as WordNet (Miller, 1995) and FrameNet (Baker et al., 1998) are organized as graphs, where relationships between words are made explicit via the structure of the resource. This work explores how structural information from these lexical resources can lead to gains in a downstream task, namely frame identification. While much of the current work in frame identification uses various neural architectures to predict frames, those neural architectures only use representations of frames based on annotated corpus data. We demonstrate how incorporating knowledge directly from the FrameNet graph structure improves the performance of a neural network-based frame identification system. Specifically, we construct a bidirectional LSTM with a loss function that incorporates various graph-and corpus-based frame embeddings for learning and ultimately achieves strong performance gains with the graphbased embeddings over corpus-based embeddings alone.

show abstract

Section: Resultsmentioning

confidence: 80%

Section: Resultsmentioning

confidence: 99%

Graph Embeddings for Frame Identification

Попов

Sikos²

2019

Proceedings - Natural Language Processing in a Deep Learning World

View full text Add to dashboard Cite

show abstract

“…Work on event semantics hints at two annotation types complementing each other: additional information about participants benefits event prediction (Ahrendt and Demberg, 2016;Botschen et al, 2018) and context information about events benefits the prediction of implicit arguments and entities (Cheng and Erk, 2018). The complementarity is further affirmed by efforts on aligning WD and the FN lexicon: the best alignment approach only maps 37% of the total WD properties to frames (Mousselly-Sergieh and Gurevych, 2016).…”

Section: Complementarity Of Annotationsmentioning

confidence: 99%

“…Both annotation tools, the WD entity linker as well as the FN frame identifier, introduce some noise: for the entity linker, Sorokin and Gurevych (2018) report 0.73 F-score and the frame identifier has an accuracy of 0.89 (Botschen et al, 2018). We perform a manual error analysis on 50 instances of the test set to understand the effect of the noisy WD annotation.…”

Section: Error Analysismentioning

confidence: 99%

“…We use two freely available systems to obtain semantic annotations for the claim (b), the reason (c) and the alternative warrants (i, ii): the frame identifier by Botschen et al (2018) for frame annotations and the entity linker by Sorokin and Gurevych (2018). We employ pre-trained vector representations to encode information from FN and WD.…”

Section: Preprocessing -Obtaining Annotationsmentioning

confidence: 99%

See 1 more Smart Citation

Frame- and Entity-Based Knowledge for Common-Sense Argumentative Reasoning

Botschen

Sorokin

Gurevych

2018

Proceedings of the 5th Workshop on Argument Mining

Self Cite

View full text Add to dashboard Cite

Common-sense argumentative reasoning is a challenging task that requires holistic understanding of the argumentation where external knowledge about the world is hypothesized to play a key role. We explore the idea of using event knowledge about prototypical situations from FrameNet and fact knowledge about concrete entities from Wikidata to solve the task. We find that both resources can contribute to an improvement over the non-enriched approach and point out two persisting challenges: first, integration of many annotations of the same type, and second, fusion of complementary annotations. After our explorations, we question the key role of external world knowledge with respect to the argumentative reasoning task and rather point towards a logic-based analysis of the chain of reasoning.

show abstract

Multiple POS Dependency-Aware Mixture of Experts for Frame Identification

Yan

Chai

et al. 2023

IEEE Access

View full text Add to dashboard Cite

Frame identification, which is finding the exact evoked frame for a target word in a given sentence, is a fundamental and crucial prerequisite for frame semantic parsing. It is generally seen as a classification task for target words, whose contextual representations are usually obtained using a neural network like BERT as an encoder, and enriched with a joint learning model or the knowledge of FrameNet. However, the distinction at a fine-grained level, such as the delicate differences in the information of syntax and PropBank roles caused by different parts-of-speech (POS) of target words, is neglected. We propose a Multiple POS Dependency-aware Mixture of Experts(MPDaMoE) network that integrates five types of information, consisting of the syntactic information of target words whose POS are nominal, adjectival, adverbial, or prepositional, and the PropBank role information of target words whose POS are only verbal.To better learn such information, a Mixture of Experts network is employed, in which every expert is a Graph Convolutional Network, to incorporate the different dependency information of target words. Our model outperforms state-of-the-art models in experiments on two benchmark datasets, which shows its effectiveness.

show abstract

Multimodal Frame Identification with Multilingual Evaluation

Cited by 15 publications

References 31 publications

Graph Embeddings for Frame Identification

Graph Embeddings for Frame Identification

Frame- and Entity-Based Knowledge for Common-Sense Argumentative Reasoning

Multiple POS Dependency-Aware Mixture of Experts for Frame Identification

Contact Info

Product

Resources

About