Gayatri Bhat scite author profile

Training language models for Code-mixed (CM) language is known to be a difficult problem because of lack of data compounded by the increased confusability due to the presence of more than one language. We present a computational technique for creation of grammatically valid artificial CM data based on the Equivalence Constraint Theory. We show that when training examples are sampled appropriately from this synthetic data and presented in certain order (aka training curriculum) along with monolingual and real CM data, it can significantly reduce the perplexity of an RNN-based language model. We also show that randomly generated CM data does not help in decreasing the perplexity of the LMs. * Work done during author's internship at Microsoft Research 1 According to some linguists, code-switching refers to inter-sentential mixing of languages, whereas code-mixing refers to intra-sentential mixing. Since the latter is more general, we will use code-mixing in this paper to mean both.

show abstract

Contextual Affective Analysis: A Case Study of People Portrayals in Online #MeToo Stories

Field

Bhat

Tsvetkov

2019

ICWSM

View full text Add to dashboard Cite

In October 2017, numerous women accused producer Harvey Weinstein of sexual harassment. Their stories encouraged other women to voice allegations of sexual harassment against many high profile men, including politicians, actors, and producers. These events are broadly referred to as the #MeToo movement, named for the use of the hashtag “#metoo” on social media platforms like Twitter and Facebook. The movement has widely been referred to as “empowering” because it has amplified the voices of previously unheard women over those of traditionally powerful men. In this work, we investigate dynamics of sentiment, power and agency in online media coverage of these events. Using a corpus of online media articles about the #MeToo movement, we present a contextual affective analysis—an entity-centric approach that uses contextualized lexicons to examine how people are portrayed in media articles. We show that while these articles are sympathetic towards women who have experienced sexual harassment, they consistently present men as most powerful, even after sexual assault allegations. While we focus on media coverage of the #MeToo movement, our method for contextual affective analysis readily generalizes to other domains.1

show abstract

A Margin-based Loss with Synthetic Negative Samples for Continuous-output Machine Translation

Bhat

Kumar

Tsvetkov

2019

View full text Add to dashboard Cite

Neural models that eliminate the softmax bottleneck by generating word embeddings (rather than multinomial distributions over a vocabulary) attain faster training with fewer learnable parameters. These models are currently trained by maximizing densities of pretrained target embeddings under von Mises-Fisher distributions parameterized by corresponding model-predicted embeddings. This work explores the utility of margin-based loss functions in optimizing such models. We present syn-margin loss, a novel marginbased loss that uses a synthetic negative sample constructed from only the predicted and target embeddings at every step. The loss is efficient to compute, and we use a geometric analysis to argue that it is more consistent and interpretable than other marginbased losses. Empirically, we find that synmargin provides small but significant improvements over both vMF and standard marginbased losses in continuous-output neural machine translation.

show abstract

CMU-01 at the SIGMORPHON 2019 Shared Task on Crosslinguality and Context in Morphology

Chaudhary¹,

Salesky²,

Bhat³

et al. 2019

View full text Add to dashboard Cite

This paper presents the submission by the CMU-01 team to the SIGMORPHON 2019 task 2 of Morphological Analysis and Lemmatization in Context. This task requires us to produce the lemma and morpho-syntactic description of each token in a sequence, for 107 treebanks. We approach this task with a hierarchical neural conditional random field (CRF) model which predicts each coarse-grained feature (eg. POS, Case, etc.) independently. However, most treebanks are under-resourced, thus making it challenging to train deep neural models for them. Hence, we propose a multi-lingual transfer training regime where we transfer from multiple related languages that share similar typology. 1 Cohen. 2016.Multi-task cross-lingual sequence tagging from scratch. arXiv preprint arXiv:1603.06270.

show abstract

Hierarchical Encoders for Modeling and Interpreting Screenplays

Bhat¹,

Saluja²,

Dye³

et al. 2021

View full text Add to dashboard Cite

While natural language understanding of longform documents remains an open challenge, such documents often contain structural information that can inform the design of models encoding them. Movie scripts are an example of such richly structured text -scripts are segmented into scenes, which decompose into dialogue and descriptive components. In this work, we propose a neural architecture to encode this structure, which performs robustly on two multi-label tag classification tasks without using handcrafted features. We add a layer of insight by augmenting the encoder with an unsupervised 'interpretability' module, which can be used to extract and visualize narrative trajectories. Though this work specifically tackles screenplays, we discuss how the underlying approach can be generalized to a range of structured documents.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Gayatri Bhat

Language Modeling for Code-Mixing: The Role of Linguistic Theory based Synthetic Data

Contextual Affective Analysis: A Case Study of People Portrayals in Online #MeToo Stories

A Margin-based Loss with Synthetic Negative Samples for Continuous-output Machine Translation

CMU-01 at the SIGMORPHON 2019 Shared Task on Crosslinguality and Context in Morphology

Hierarchical Encoders for Modeling and Interpreting Screenplays

Contact Info

Product

Resources

About