Pheonix at SemEval-2020 Task 5: Masking the Labels Lubricates Models for Sequence Labeling

Babvey, Pouria; Borrelli, Dario; Zhao, Yulan; Lipizzi, Carlo

doi:10.18653/v1/2020.semeval-1.88

Cited by 4 publications

(1 citation statement)

References 9 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To predict the start and ending positions of antecedents and consequents, the model utilizes an ensemble of RoBERTa models and extend it in the same manner as how BERT was extended for the SQuAD dataset (Rajpurkar et al, 2016). The second-place system pouria babvey uses a sequence labelling approach: the authors develop the model on top of BERT with a multi-head attention layer and label masking to capture mutual information between nearby labels (Babvey et al, 2020). Label masking, in which only part of the labels is fed during training and the rest have to be predicted, has shown to be particularly effective for improving accuracy, which can be seen as a form of regularization.…”

Section: Subtask-2: Detecting Antecedent and Consequent (Dac)mentioning

confidence: 99%

SemEval-2020 Task 5: Counterfactual Recognition

Yang¹,

Obadinma²,

Zhao³

et al. 2020

Preprint

View full text Add to dashboard Cite

We present a counterfactual recognition (CR) task, the shared Task 5 of SemEval-2020. Counterfactuals describe potential outcomes (consequents) produced by actions or circumstances that did not happen or cannot happen and are counter to the facts (antecedent). Counterfactual thinking is an important characteristic of the human cognitive system; it connects antecedents and consequents with causal relations. Our task provides a benchmark for counterfactual recognition in natural language with two subtasks. Subtask-1 aims to determine whether a given sentence is a counterfactual statement or not. Subtask-2 requires the participating systems to extract the antecedent and consequent in a given counterfactual statement. During the SemEval-2020 official evaluation period, we received 27 submissions to Subtask-1 and 11 to Subtask-2. The data, baseline code, and leaderboard can be found at https://competitions.codalab.org/competitions/21691. The data and baseline code are also available at https://zenodo.org/record/3932442.

show abstract

Section: Subtask-2: Detecting Antecedent and Consequent (Dac)mentioning

confidence: 99%

SemEval-2020 Task 5: Counterfactual Recognition

Yang¹,

Obadinma²,

Zhao³

et al. 2020

Preprint

View full text Add to dashboard Cite

show abstract

Content-Aware Galaxies: Digital Fingerprints of Discussions on Social Media

Babvey

Borrelli

Lipizzi

et al. 2021

IEEE Trans. Comput. Soc. Syst.

View full text Add to dashboard Cite

Content-based user classifier to uncover information exchange in disaster-motivated networks

et al. 2021

Self Cite

View full text Add to dashboard Cite

Disasters strike communities around the world, with a reduced time-frame for warning and action leaving behind high rates of damage, mortality, and years in rebuilding efforts. For the past decade, social media has indicated a positive role in communicating before, during, and after disasters. One important question that remained un-investigated is that whether social media efficiently connect affected individuals to disaster relief agencies, and if not, how AI models can use historical data from previous disasters to facilitate information exchange between the two groups. In this study, the BERT model is first fine-tuned using historical data and then it is used to classify the tweets associated with hurricanes Dorian and Harvey based on the type of information provided; and alongside, the network between users is constructed based on the retweets and replies on Twitter. Afterwards, some network metrics are used to measure the diffusion rate of each type of disaster-motivated information. The results show that the messages by disaster eyewitnesses get the least spread while the posts by governments and media have the highest diffusion rates through the network. Additionally, the “cautions and advice” messages get the most spread among other information types while “infrastructure and utilities” and “affected individuals” messages get the least diffusion even compared with “sympathy and support”. The analysis suggests that facilitating the propagation of information provided by affected individuals, using AI models, will be a valuable strategy to pursue in order to accelerate communication between affected individuals and survival groups during the disaster and aftermath.

show abstract

Pheonix at SemEval-2020 Task 5: Masking the Labels Lubricates Models for Sequence Labeling

Cited by 4 publications

References 9 publications

SemEval-2020 Task 5: Counterfactual Recognition

SemEval-2020 Task 5: Counterfactual Recognition

Content-Aware Galaxies: Digital Fingerprints of Discussions on Social Media

Content-based user classifier to uncover information exchange in disaster-motivated networks

Contact Info

Product

Resources

About